spiceai/docs

spiceai/

docs

Help Login

trunk

Edit on GitHub

Fork

/docs/website/versioned_docs/version-1.11.x/deployment/aws/integrations.md

spiceai/docs | Spice Cloud Platform

trunk

Edit on GitHub

Fork

/docs/website/versioned_docs/version-1.11.x/deployment/aws/integrations.md

spiceai/docs/README.md

title: 'AWS Integrations' description: 'Complete guide to Spice.ai integrations with Amazon Web Services, including data connectors, AI models, vector stores, and secret management.' sidebar_label: 'Integrations' sidebar_position: 2 pagination_next: null keywords: [spice.ai, aws, amazon, s3, dynamodb, redshift, bedrock, glue, s3 vectors, secrets manager, eks, ecs] image: /img/aws-spice.png

Spice.ai and AWS

Spice.ai provides deep integrations with Amazon Web Services (AWS), enabling data federation, AI inference, vector search, and secure secret management across the AWS ecosystem. This page consolidates all AWS-compatible components and provides quick access to configuration guides.

Data Connectors

Data connectors federate SQL queries across AWS data sources without data movement.

Connector	Description	Documentation
Amazon S3	Query Parquet, CSV, and JSON files stored in S3 buckets. Supports private buckets with IAM authentication and S3-compatible storage like MinIO.	S3 Data Connector
Amazon S3 Tables	Query Iceberg tables in Amazon S3 Tables using the Glue connector with S3 Tables catalog format.	Glue Data Connector
Amazon DynamoDB	Federated SQL queries on DynamoDB tables with automatic schema inference.	DynamoDB Data Connector
Amazon DynamoDB Streams	Real-time CDC streaming of table changes via DynamoDB Streams.	DynamoDB Data Connector
Amazon Redshift	Connect to Redshift clusters using the PostgreSQL-compatible connector.	Redshift Data Connector
Amazon Aurora PostgreSQL	Connect to Aurora PostgreSQL clusters using the PostgreSQL connector.	PostgreSQL Data Connector
Amazon Aurora MySQL	Connect to Aurora MySQL clusters using the MySQL connector.	MySQL Data Connector
Amazon RDS PostgreSQL	Connect to RDS PostgreSQL instances using the PostgreSQL connector.	PostgreSQL Data Connector
Amazon RDS MySQL	Connect to RDS MySQL instances using the MySQL connector.	MySQL Data Connector
Amazon MSK	Stream data from Amazon MSK (Managed Streaming for Apache Kafka) topics using the Kafka connector.	Kafka Data Connector
Debezium (Amazon MSK)	Change Data Capture (CDC) from databases via Debezium running on Amazon MSK for real-time dataset updates.	Debezium Data Connector
AWS Glue Data Catalog	Query Iceberg tables registered in AWS Glue.	Glue Data Connector
Apache Iceberg (AWS)	Query Iceberg tables stored in S3 with Glue or REST catalog metadata.	Iceberg Data Connector
Delta Lake (S3)	Query Delta Lake tables stored in Amazon S3.	Delta Lake Data Connector
AWS Athena (ODBC)	Connect to Athena using the ODBC connector with Athena SQL dialect support.	ODBC Data Connector

Example: Amazon S3

Example: DynamoDB

Example: AWS Glue with Amazon S3 Tables

Catalog Connectors

Catalog connectors provide schema discovery and unified access to tables in AWS data catalogs.

Connector	Description	Documentation
AWS Glue Catalog	Discover and query tables from AWS Glue Data Catalog with glob pattern filtering.	Glue Catalog Connector

Example: Glue Catalog

AI Models (Amazon Bedrock)

Spice integrates with Amazon Bedrock for large language model inference, supporting Amazon Nova and other foundation models.

Provider	Supported Models	Documentation
Amazon Bedrock	Amazon Nova (Micro, Lite, Pro, Premier), cross-region inference profiles	Bedrock Models

Example: Amazon Nova

Guardrails Support

Bedrock Guardrails can filter model inputs and outputs:

Embeddings (Amazon Bedrock)

Generate vector embeddings using Amazon Bedrock embedding models for semantic search and RAG applications.

Provider	Supported Models	Documentation
Amazon Bedrock	Amazon Titan Embeddings, Amazon Nova Multimodal Embeddings, Cohere Embed	Bedrock Embeddings

Example: Amazon Titan Embeddings

Example: Amazon Nova Multimodal Embeddings

Vector Stores (Amazon S3 Vectors)

Amazon S3 Vectors is a new S3 bucket type for storing and querying vector embeddings at scale. Spice integrates S3 Vectors as a vector index backend for hybrid search applications.

Engine	Description	Documentation
Amazon S3 Vectors	Sub-second similarity queries on billions of vectors with up to 90% cost reduction compared to traditional vector databases.	S3 Vectors Engine

Example: S3 Vectors with Bedrock Embeddings

Data Accelerators (S3 Express One Zone)

Spice Cayenne data accelerator supports AWS S3 Express One Zone for storing accelerated data with single-digit millisecond latency. This is ideal for latency-sensitive query workloads that require persistent storage while maintaining fast access.

:::tip Storage Recommendation For best performance, store Cayenne data files on local NVMe storage. Use S3 Express One Zone only when persistence of accelerations is required, such as preserving accelerated data across restarts or sharing data between multiple Spice instances. :::

Accelerator	Description	Documentation
Spice Cayenne	High-performance data accelerator using Vortex file format with S3 Express One Zone for sub-10ms latency query performance.	Cayenne Accelerator

Why S3 Express One Zone?

S3 Express One Zone directory buckets provide:

Single-digit millisecond latency: 10x faster than S3 Standard for first-byte latency
High request throughput: Up to 10x higher request rates than S3 Standard
Cost efficiency: Lower per-request costs for high-frequency access patterns
Durability: Same 99.999999999% (11 9s) durability as S3 Standard

Example: Cayenne with S3 Express One Zone

Example: Auto-generated Bucket with IAM Role

Supported AWS Regions

S3 Express One Zone is available in select regions. Spice automatically derives the region from zone IDs:

Zone ID Prefix	Region
`use1`	us-east-1
`use2`	us-east-2
`usw1`	us-west-1
`usw2`	us-west-2
`euw1`	eu-west-1
`euc1`	eu-central-1
`apne1`	ap-northeast-1
`apse1`	ap-southeast-1

See AWS documentation for the complete list of S3 Express One Zone availability zones.

Secret Management

Securely store and retrieve credentials using AWS Secrets Manager.

Store	Description	Documentation
AWS Secrets Manager	Read secrets from AWS Secrets Manager by secret name.	AWS Secrets Manager

Example: Using Secrets Manager

Authentication

All AWS integrations support the standard AWS SDK credential chain. When credentials are not explicitly configured, Spice loads them from the following sources in order:

Environment Variables: AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, AWS_SESSION_TOKEN
Shared Credentials Files: ~/.aws/credentials and ~/.aws/config
AWS SSO Sessions: Configured via aws configure sso
Web Identity Token: For OIDC/OAuth (common with EKS IRSA)
ECS Container Credentials: Automatic IAM role for ECS tasks
EC2 Instance Metadata (IMDSv2): Automatic IAM role for EC2 instances

IAM Permissions

Ensure the IAM role or user has appropriate permissions for all AWS services used:

Deployment Options

Deploy Spice on AWS infrastructure for optimal performance and integration:

Option	Description	Documentation
Amazon EKS	Kubernetes orchestration with Helm chart deployment	AWS Deployment
Amazon ECS	Container service with Fargate or EC2 launch types	AWS Deployment
Amazon EC2	Direct deployment with Docker or binary	AWS Deployment

Resources

Marketplace

Spice.ai on AWS Marketplace - Deploy Spice.ai from AWS Marketplace

Quick Start

Get started with Spice on AWS in minutes:

Install Spice CLI:

Configure AWS credentials:

Create a Spicepod with S3 data:

Start the runtime:

Query your data: