Connect your data to Spice
Our data connectors enable you to query data where it already lives using standard SQL, accelerate working sets locally, and integrate operational and analytical datasets into one unified runtime.
Integrations with 30+ data sources
Spice connects to the leading databases, warehouses, data lakes, streaming systems, and more.
AWS Glue
AWS Glue is a serverless data integration service that simplifies the process of discovering, preparing, moving, and integrating data from multiple sources. The Glue Data Connector enables federated SQL querying on tables in an AWS Glue Data Catalog.
Learn more
Amazon S3
Amazon Simple Storage Service (S3) is a service offered by Amazon Web Services (AWS) that provides object storage. The S3 Data Connector enables federated SQL querying on files stored in S3 or S3-compatible systems (e.g., MinIO, Cloudflare R2, NetApp OnTap).
Learn more
Amazon DynamoDB
Amazon DynamoDB is a fully managed NoSQL database service that provides fast and predictable performance with seamless scalability. This connector enables using DynamoDB tables as data sources for federated SQL queries in Spice.
Learn more
Apache Kafka
Kafka is an open-source distributed event streaming platform used for high-performance data pipelines, streaming analytics, and data integration. The Kafka Data Connector enables direct acceleration of data from Apache Kafka topics.
Learn more
Debezium
Debezium is an open-source platform that enables Change Data Capture (CDC) for efficient real-time updates of locally accelerated datasets. Spice supports connecting to a Kafka topic managed by Debezium to keep datasets up-to-date with source data.
Learn more
ClickHouse
ClickHouse is a fast, open-source columnar database management system designed for online analytical processing (OLAP) and real-time analytics. This connector enables federated SQL queries from a ClickHouse server.
Learn more
Oracle
Oracle offers a comprehensive and fully integrated cloud platform service. The Oracle Data Connector enables SQL queries on data stored in Oracle databases, including on-premises instances, Oracle Cloud User-Managed Databases, and Oracle Cloud Autonomous Databases (ADB).
Learn more
Apache Iceberg
Apache Iceberg is a high-performance format for huge analytic tables. The Iceberg Data Connector helps query Apache Iceberg tables using federated SQL.
Learn more
Apache Spark
Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning. Use the connector for federated SQL queries against a Spark Cluster using Spark Connect.
Learn more
ODBC
ODBC (Open Database Connectivity) is a standard API that allows applications to connect to and interact with various database management systems using a common interface. Connect to any ODBC database for federated/accelerated SQL queries with the Spice ODBC Connector.
Learn more
Microsoft SQL Server
Microsoft SQL Server (MSSQL) is a proprietary relational database management system (RDBMS). The Microsoft SQL Server Data Connector enables federated/accelerated SQL queries on data stored in MSSQL databases.
Learn more
FlightSQL
FlightSQL is an open-source protocol built on Apache Arrow Flight for high-performance, columnar data transfer between SQL databases and clients. Use the connector for federated SQL queries across any Flight SQL compatible server.
Learn more
GraphQL
GraphQL is an open-source query language for APIs and a server-side runtime. The GraphQL Data Connector enables federated SQL queries on any GraphQL endpoint.
Learn more
Delta Lake
An open-source storage layer that improves the reliability and performance of data lakes by bringing database-like features such as ACID transactions and schema enforcement. The Delta Lake data connector enables SQL queries from Delta Lake tables.
Learn more
Dremio
Dremio is a data lake engine that enables high-performance SQL queries directly on data lake storage. This connector enables using Dremio as a data source for federated SQL queries.
Learn more
DuckDB
DuckDB is an in-process SQL OLAP (Online Analytical Processing) database management system designed for analytical query workloads. This connector supports DuckDB persistent databases as a data source for federated SQL queries.
Learn more
MySQL
MySQL is an open-source relational database management system that uses structured query language (SQL) for managing and manipulating databases. The MySQL Data Connector enables federated/accelerated SQL queries on data stored in MySQL databases.
Learn more
PostgreSQL
PostgreSQL is an advanced open-source relational database management system known for its robustness, extensibility, and support for SQL compliance. The PostgreSQL Server Data Connector enables federated/accelerated SQL queries on data stored in PostgreSQL databases.
Learn more
GitHub
GitHub is a developer platform that allows developers to create, store, manage, and share their code. The GitHub Data Connector enables federated SQL queries on various GitHub resources such as files, issues, pull requests, and commits.
Learn more
MongoDB
MongoDB is an open-source NoSQL database that stores data in flexible, JSON-like documents, allowing for dynamic schemas and easy scalability. The MongoDB Data Connector enables federated/accelerated SQL queries on data stored in MongoDB databases.
Learn more
Snowflake
Snowflake is a cloud data platform that allows for data analysis and simultaneous access of data sets with minimal latency. The Snowflake Data Connector enables federated SQL queries across datasets in the Snowflake Cloud Data Warehouse.
Learn more
Databricks
Databricks is a cloud-based platform for data analytics and artificial intelligence. Use the connector for federated SQL queries against Databricks using Spark Connect, directly from Delta Lake tables, or using the SQL Statement Execution API.
Learn more
ABFS
The Azure Blob Filesystem driver (ABFS) is a dedicated Azure Storage driver for Hadoop. The Azure BlobFS (ABFS) Data Connector enables federated SQL queries on files stored in Azure Blob-compatible endpoints.
Learn more
Amazon Redshift
Amazon Redshift is a columnar OLAP database compatible with PostgreSQL. Connect Redshift to Spice using the PostgreSQL data connector with Redshift cluster connection parameters for federated SQL queries.
Learn more
File
The File Data Connector enables federated SQL queries on files stored by locally accessible filesystems. It supports querying individual files or entire directories with Parquet, CSV, JSON, and document formats.
Learn more
FTP/SFTP
FTP and SFTP are network protocols for transferring files between a client and server. The FTP/SFTP Data Connector enables federated/accelerated SQL query across supported file formats stored in FTP/SFTP servers.
Learn more
HTTP/HTTPS
The HTTP(s) Data Connector enables federated SQL query across supported file formats stored at an HTTP(s) endpoint. The connector supports dynamic query and data refresh through SQL-based filtering.
Learn more
IMAP
The IMAP Data Connector enables federated SQL query across emails stored in an IMAP email server. Query email headers, subjects, senders, recipients, and message content with standard SQL.
Learn more
SharePoint
The SharePoint Data Connector enables federated SQL queries on documents stored in Microsoft SharePoint. Connect to SharePoint drives, sites, and groups to query unstructured documents.
Learn more
Data Accelerators
Locally materialize and accelerate data using embedded engines like DuckDB or SQLite for high-performance queries.
Apache Arrow
In-Memory Arrow Records accelerator provides the fastest query performance by storing data in memory using Apache Arrow columnar format. Default accelerator for datasets.
Learn more
DuckDB
DuckDB is an embedded analytical database that supports both in-memory and file-based acceleration. Ideal for larger datasets that need to persist across restarts.
Learn more
SQLite
SQLite accelerator provides embedded database acceleration with support for both in-memory and file-based storage. Great for lightweight, persistent acceleration.
Learn more
PostgreSQL
PostgreSQL accelerator enables acceleration to an attached PostgreSQL database. Use for enterprise deployments requiring robust database infrastructure.
Learn more
Spice Cayenne
Spice Cayenne (Vortex) is a high-performance file-based accelerator optimized for analytical workloads. Available in Spice v1.9.0-rc.1 and later.
Learn more
Catalog Connectors
Connect to external catalog providers and make their tables available for federated SQL query in Spice.
Unity Catalog
Unity Catalog is an open-source data catalog for data and AI governance. Connect to Unity Catalog to access Delta Lake tables with fine-grained access control.
Learn more
Databricks Catalog
Connect to a Databricks Unity Catalog provider to access tables across your Databricks workspace using Spark Connect or S3/Delta Lake.
Learn more
Apache Iceberg Catalog
Connect to an Apache Iceberg catalog provider to query Iceberg tables. Supports REST, AWS Glue, and Hive Metastore catalog backends.
Learn more
AWS Glue Catalog
Connect to an AWS Glue Data Catalog to access tables across your data lake. Supports Parquet and Iceberg table formats.
Learn more
Model Providers
Connect to leading AI model providers for LLM inference, embeddings, and traditional ML models.
OpenAI
Connect to OpenAI or OpenAI-compatible endpoints for GPT models. Supports chat completions, embeddings, and tool use with models like GPT-4o.
Learn more
Azure OpenAI
Connect to Azure OpenAI Service for enterprise-grade OpenAI models with Azure security and compliance features.
Learn more
Claude
Connect to Anthropic for Claude models. Supports chat completions and tool use with Claude 3 family of models.
Learn more
HuggingFace
Load and serve machine learning models from HuggingFace Hub. Supports ONNX, GGUF, GGML, and SafeTensor formats.
Learn more
Amazon Bedrock
Connect to Amazon Bedrock for access to foundation models from AI21 Labs, Anthropic, Cohere, Meta, and Amazon via AWS infrastructure.
Learn more
Perplexity
Connect to Perplexity AI for models optimized for search and information retrieval tasks with real-time web access.
Learn more
xAI
Connect to xAI for Grok models. Supports chat completions with OpenAI-compatible API interface.
Learn more
Databricks Mosaic AI
Connect to Databricks Mosaic AI for models deployed on the Databricks platform with enterprise security and governance.
Learn more
Filesystem
Load and serve models from the local filesystem. Supports ONNX, GGUF, GGML, and SafeTensor model formats.
Learn more
Vector Engines
Index and efficiently search vector embeddings for semantic search and AI applications.
Amazon S3 Vectors
Store and search vector embeddings using Amazon S3 Vectors. Provides efficient vector search operations and avoids unnecessary recomputation of embeddings.
Learn more
DuckDB
Use DuckDB as a vector engine for efficient in-memory or file-based vector search. Supports similarity search with embedded analytical capabilities.
Learn more
SQLite
Lightweight embedded vector engine using SQLite. Ideal for edge deployments and applications requiring portable vector search capabilities.
Learn more
PostgreSQL (pgvector)
Use PostgreSQL with pgvector and pgvectorscale extensions for production-grade vector search. Supports IVFFlat and HNSW indexing for high-performance similarity queries.
Learn more
Deployed in production
Teams trust Spice to bring inference closer to their data, enabling low-latency, enterprise-grade AI across industries.

"Partnering with Spice AI has transformed how NRC Health delivers AI-driven insights. By unifying siloed data across systems, we accelerated AI feature development, reducing time-to-market from months to weeks - and sometimes days. With predictable costs and faster innovation, Spice is not just solving some of our data and AI challenges - it is helping us redefine personalized healthcare."
Tim Ottersburg
VP of Technology, NRC Health

"Spice AI grounds AI in our actual data, using SQL queries across all our data. This brings accuracy to probabilistic AI systems, which are very prone to hallucinations."
Rachel Wong
CTO, Basis Set
Integrations across all of your data sources
Accelerate your data and AI stack with a library of 30+ prebuilt connectors for the most common databases, warehouses, and file stores - from Databricks and S3 to MySQL and PostgreSQL. Govern AI sandboxes with the same policies as your production data environments.

See Spice in action
Get a guided walkthrough of how development teams use Spice to query, accelerate, and integrate AI for mission-critical workloads.
Get a demo




















