spiceai/docs

spiceai/

docs

Help Login

evgenii/docs-spicepod-v2

Edit on GitHub

Fork

/docs/website/versioned_docs/version-2.0.x/use-cases/ai/multi-tenant-agents/index.md

spiceai/docs | Spice Cloud Platform

evgenii/docs-spicepod-v2

Edit on GitHub

Fork

/docs/website/versioned_docs/version-2.0.x/use-cases/ai/multi-tenant-agents/index.md

spiceai/docs/README.md

title: 'Multi-Tenant AI Agents' sidebar_label: 'Multi-Tenant AI Agents' sidebar_position: 16 description: 'Deploy AI agents across many SaaS tenants with strict isolation and no per-tenant ETL pipelines.' pagination_prev: null pagination_next: null

Spice serves as the data substrate for multi-tenant AI agents, giving each tenant real-time access to its own data sources with configurable isolation and no per-tenant ETL pipelines. A lightweight Rust runtime can be deployed once for all tenants, once per tenant, or in a hybrid topology, and every deployment exposes the same SQL, search, and AI inference APIs over HTTP, Arrow Flight, FlightSQL, ODBC, JDBC, and ADBC.

Pipeline-per-integration architectures collapse at scale: every tenant brings its own schema, refresh cadence, and ownership, and ETL orchestration becomes the bottleneck. Spice federates queries across 30+ data sources and accelerates them locally, so adding a tenant is a Spicepod configuration change rather than a new pipeline.

Why Spice.ai?

Zero-ETL federation: Query PostgreSQL, Snowflake, Databricks, S3, Kafka, and 25+ other sources through a single SQL interface. Onboarding a tenant is a dataset declaration, not a pipeline build.
Configurable isolation: Choose logical, config-level, or runtime-level tenant boundaries. Isolation becomes a deployment property rather than something enforced in application code.
Sandboxed runtimes: The Spice runtime is lightweight enough to deploy one instance per tenant or per agent, giving each agent its own sources, acceleration layers, and secrets.
Local acceleration with CDC: Materialize tenant working sets into Arrow, DuckDB, SQLite, or PostgreSQL accelerators and keep them current with change data capture.
Declarative configuration: The spicepod.yaml manifest defines datasets, models, secrets, and acceleration behavior, so tenant topology is version-controlled and auditable.

Deployment Patterns

Four patterns trade off operational simplicity against isolation strength. Most SaaS workloads converge on a hybrid configuration.

Pattern 1: Query-Time Tenant Isolation

One runtime serves many tenants. Datasets reference tenant-partitioned tables, and the application includes a tenant filter in every query.

This is the simplest and cheapest pattern to operate, but isolation is logical: correctness depends on every query path applying the tenant filter.

View-based variant: Move tenant filtering into the Spicepod using views, so agents query a tenant-specific view rather than constructing the filter at runtime.

Pattern 2: Config-Level Tenant Isolation

One runtime, but each tenant gets its own dataset entry. The manifest is typically generated from tenant-onboarding metadata.

Tenant boundaries are explicit in configuration and easier to audit, but the manifest grows with tenant count and reload time and memory planning become operational concerns at large scale.

Pattern 3: Runtime-Level Tenant Isolation

Run a dedicated Spicepod per tenant and route requests using tenant context from auth or session claims. Each runtime has an independent manifest, compute envelope, and cache.

This gives the clearest isolation model and per-tenant operational control, at the cost of higher operational overhead as tenant count grows. It fits regulated workloads that require strict data boundaries.

Pattern 4: Hybrid Isolation

Large tenants run on dedicated Spicepods while the long tail shares a partitioned deployment. A tenant-aware router decides placement and clients query a single logical interface.

Hybrid isolation keeps the query interface stable while allowing tenant placement by policy, isolating high-load or regulated tenants on dedicated runtimes and using shared capacity for the long tail. It is the recommended starting point for variable SaaS workloads.

Choosing a Pattern

The right shape depends on isolation requirements, tenant count, and query volume per tenant.

Requirement	Recommended Pattern
Small tenant count, uniform workload	Pattern 1 or 1-view
Explicit config-level boundaries, auditable manifest	Pattern 2
Regulated tenants, strict per-tenant SLOs	Pattern 3
Power-law tenant distribution	Pattern 4 (hybrid)

Benefits

Zero-ETL onboarding: Add a tenant by declaring a dataset, not by building a pipeline.
Deployment-level isolation: Match isolation strength to business and compliance requirements without changing application code.
Consistent interface: Agents query SQL, search, and inference APIs the same way regardless of how tenants are partitioned underneath.

Learn More

Multi-Tenancy for AI Agents without the Pipelines — the engineering deep dive behind these patterns.
Spicepod reference and datasets reference.
Views for declarative tenant filtering.
Data Acceleration and Change Data Capture.
Federated SQL Query recipe.