
Spice.ai provides deep integrations with Amazon Web Services (AWS), enabling data federation, AI inference, vector search, and secure secret management across the AWS ecosystem. This page consolidates all AWS-compatible components and provides quick access to configuration guides.
Data connectors federate SQL queries across AWS data sources without data movement.
| Connector | Description | Documentation |
|---|---|---|
| Amazon S3 | Query Parquet, CSV, and JSON files stored in S3 buckets. Supports private buckets with IAM authentication and S3-compatible storage like MinIO. | S3 Data Connector |
| Amazon S3 Tables | Query Iceberg tables in Amazon S3 Tables using the Glue connector with S3 Tables catalog format. | Glue Data Connector |
| Amazon DynamoDB | Federated SQL queries on DynamoDB tables with automatic schema inference. | DynamoDB Data Connector |
| Amazon DynamoDB Streams | Real-time CDC streaming of table changes via DynamoDB Streams. | DynamoDB Data Connector |
| Amazon Redshift | Connect to Redshift clusters using the PostgreSQL-compatible connector. | Redshift Data Connector |
| Amazon Aurora PostgreSQL | Connect to Aurora PostgreSQL clusters using the PostgreSQL connector. | PostgreSQL Data Connector |
| Amazon Aurora MySQL | Connect to Aurora MySQL clusters using the MySQL connector. | MySQL Data Connector |
| Amazon RDS PostgreSQL | Connect to RDS PostgreSQL instances using the PostgreSQL connector. | PostgreSQL Data Connector |
| Amazon RDS MySQL | Connect to RDS MySQL instances using the MySQL connector. | MySQL Data Connector |
| Amazon MSK | Stream data from Amazon MSK (Managed Streaming for Apache Kafka) topics using the Kafka connector. | Kafka Data Connector |
| Debezium (Amazon MSK) | Change Data Capture (CDC) from databases via Debezium running on Amazon MSK for real-time dataset updates. | Debezium Data Connector |
| AWS Glue Data Catalog | Query Iceberg tables registered in AWS Glue. | Glue Data Connector |
| Apache Iceberg (AWS) | Query Iceberg tables stored in S3 with Glue or REST catalog metadata. | Iceberg Data Connector |
| Delta Lake (S3) | Query Delta Lake tables stored in Amazon S3. | Delta Lake Data Connector |
| AWS Athena (ODBC) | Connect to Athena using the ODBC connector with Athena SQL dialect support. | ODBC Data Connector |
Catalog connectors provide schema discovery and unified access to tables in AWS data catalogs.
| Connector | Description | Documentation |
|---|---|---|
| AWS Glue Catalog | Discover and query tables from AWS Glue Data Catalog with glob pattern filtering. | Glue Catalog Connector |
Spice integrates with Amazon Bedrock for large language model inference, supporting Amazon Nova and other foundation models.
| Provider | Supported Models | Documentation |
|---|---|---|
| Amazon Bedrock | Amazon Nova (Micro, Lite, Pro, Premier), cross-region inference profiles | Bedrock Models |
Bedrock Guardrails can filter model inputs and outputs:
Generate vector embeddings using Amazon Bedrock embedding models for semantic search and RAG applications.
| Provider | Supported Models | Documentation |
|---|---|---|
| Amazon Bedrock | Amazon Titan Embeddings, Amazon Nova Multimodal Embeddings, Cohere Embed | Bedrock Embeddings |
Amazon S3 Vectors is a new S3 bucket type for storing and querying vector embeddings at scale. Spice integrates S3 Vectors as a vector index backend for hybrid search applications.
| Engine | Description | Documentation |
|---|---|---|
| Amazon S3 Vectors | Sub-second similarity queries on billions of vectors with up to 90% cost reduction compared to traditional vector databases. | S3 Vectors Engine |
Spice Cayenne data accelerator supports AWS S3 Express One Zone for storing accelerated data with single-digit millisecond latency. This is ideal for latency-sensitive query workloads that require persistent storage while maintaining fast access.
:::tip Storage Recommendation For best performance, store Cayenne data files on local NVMe storage. Use S3 Express One Zone only when persistence of accelerations is required, such as preserving accelerated data across restarts or sharing data between multiple Spice instances. :::
| Accelerator | Description | Documentation |
|---|---|---|
| Spice Cayenne | High-performance data accelerator using Vortex file format with S3 Express One Zone for sub-10ms latency query performance. | Cayenne Accelerator |
S3 Express One Zone directory buckets provide:
S3 Express One Zone is available in select regions. Spice automatically derives the region from zone IDs:
| Zone ID Prefix | Region |
|---|---|
use1 | us-east-1 |
use2 | us-east-2 |
usw1 | us-west-1 |
usw2 | us-west-2 |
euw1 | eu-west-1 |
euc1 | eu-central-1 |
apne1 | ap-northeast-1 |
apse1 | ap-southeast-1 |
See AWS documentation for the complete list of S3 Express One Zone availability zones.
Securely store and retrieve credentials using AWS Secrets Manager.
| Store | Description | Documentation |
|---|---|---|
| AWS Secrets Manager | Read secrets from AWS Secrets Manager by secret name. | AWS Secrets Manager |
All AWS integrations support the standard AWS SDK credential chain. When credentials are not explicitly configured, Spice loads them from the following sources in order:
AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, AWS_SESSION_TOKEN~/.aws/credentials and ~/.aws/configaws configure ssoEnsure the IAM role or user has appropriate permissions for all AWS services used:
Deploy Spice on AWS infrastructure for optimal performance and integration:
| Option | Description | Documentation |
|---|---|---|
| Amazon EKS | Kubernetes orchestration with Helm chart deployment | AWS Deployment |
| Amazon ECS | Container service with Fargate or EC2 launch types | AWS Deployment |
| Amazon EC2 | Direct deployment with Docker or binary | AWS Deployment |
Getting started with Amazon S3 Vectors and Spice - YouTube walkthrough
How Spice AI operationalizes data lakes for AI using Amazon S3 - Spice presentation at re:Invent
Get started with Spice on AWS in minutes: