Connect your data to Spice

Our data connectors enable you to query data where it already lives using standard SQL, accelerate working sets locally, and integrate operational and analytical datasets into one unified runtime.

Integrations with 30+ data sources

Spice connects to the leading databases, warehouses, data lakes, streaming systems, and more.

AWS Glue
AWS Glue

AWS Glue is a serverless data integration service that simplifies the process of discovering, preparing, moving, and integrating data from multiple sources. The Glue Data Connector enables federated SQL querying on tables in an AWS Glue Data Catalog.

Learn more

Amazon S3
Amazon S3

Amazon Simple Storage Service (S3) is a service offered by Amazon Web Services (AWS) that provides object storage. The S3 Data Connector enables federated SQL querying on files stored in S3 or S3-compatible systems (e.g., MinIO, Cloudflare R2, NetApp OnTap).

Learn more

Amazon DynamoDB
Amazon DynamoDB

Amazon DynamoDB is a fully managed NoSQL database service that provides fast and predictable performance with seamless scalability. This connector enables using DynamoDB tables as data sources for federated SQL queries in Spice.

Learn more

Apache Kafka
Apache Kafka

Kafka is an open-source distributed event streaming platform used for high-performance data pipelines, streaming analytics, and data integration. The Kafka Data Connector enables direct acceleration of data from Apache Kafka topics.

Learn more

Debezium
Debezium

Debezium is an open-source platform that enables Change Data Capture (CDC) for efficient real-time updates of locally accelerated datasets. Spice supports connecting to a Kafka topic managed by Debezium to keep datasets up-to-date with source data.

Learn more

ClickHouse
ClickHouse

ClickHouse is a fast, open-source columnar database management system designed for online analytical processing (OLAP) and real-time analytics. This connector enables federated SQL queries from a ClickHouse server.

Learn more

Oracle
Oracle

Oracle offers a comprehensive and fully integrated cloud platform service. The Oracle Data Connector enables SQL queries on data stored in Oracle databases, including on-premises instances, Oracle Cloud User-Managed Databases, and Oracle Cloud Autonomous Databases (ADB).

Learn more

Apache Iceberg
Apache Iceberg

Apache Iceberg is a high-performance format for huge analytic tables. The Iceberg Data Connector helps query Apache Iceberg tables using federated SQL.

Learn more

Apache Spark
Apache Spark

Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning. Use the connector for federated SQL queries against a Spark Cluster using Spark Connect.

Learn more

ODBC
ODBC

ODBC (Open Database Connectivity) is a standard API that allows applications to connect to and interact with various database management systems using a common interface. Connect to any ODBC database for federated/accelerated SQL queries with the Spice ODBC Connector.

Learn more

Microsoft SQL Server
Microsoft SQL Server

Microsoft SQL Server (MSSQL) is a proprietary relational database management system (RDBMS). The Microsoft SQL Server Data Connector enables federated/accelerated SQL queries on data stored in MSSQL databases.

Learn more

FlightSQL
FlightSQL

FlightSQL is an open-source protocol built on Apache Arrow Flight for high-performance, columnar data transfer between SQL databases and clients. Use the connector for federated SQL queries across any Flight SQL compatible server.

Learn more

GraphQL
GraphQL

GraphQL is an open-source query language for APIs and a server-side runtime. The GraphQL Data Connector enables federated SQL queries on any GraphQL endpoint.

Learn more

Delta Lake
Delta Lake

An open-source storage layer that improves the reliability and performance of data lakes by bringing database-like features such as ACID transactions and schema enforcement. The Delta Lake data connector enables SQL queries from Delta Lake tables.

Learn more

Dremio
Dremio

Dremio is a data lake engine that enables high-performance SQL queries directly on data lake storage. This connector enables using Dremio as a data source for federated SQL queries.

Learn more

DuckDB
DuckDB

DuckDB is an in-process SQL OLAP (Online Analytical Processing) database management system designed for analytical query workloads. This connector supports DuckDB persistent databases as a data source for federated SQL queries.

Learn more

MySQL
MySQL

MySQL is an open-source relational database management system that uses structured query language (SQL) for managing and manipulating databases. The MySQL Data Connector enables federated/accelerated SQL queries on data stored in MySQL databases.

Learn more

PostgreSQL
PostgreSQL

PostgreSQL is an advanced open-source relational database management system known for its robustness, extensibility, and support for SQL compliance. The PostgreSQL Server Data Connector enables federated/accelerated SQL queries on data stored in PostgreSQL databases.

Learn more

GitHub
GitHub

GitHub is a developer platform that allows developers to create, store, manage, and share their code. The GitHub Data Connector enables federated SQL queries on various GitHub resources such as files, issues, pull requests, and commits.

Learn more

MongoDB
MongoDB

MongoDB is an open-source NoSQL database that stores data in flexible, JSON-like documents, allowing for dynamic schemas and easy scalability. The MongoDB Data Connector enables federated/accelerated SQL queries on data stored in MongoDB databases.

Learn more

Snowflake
Snowflake

Snowflake is a cloud data platform that allows for data analysis and simultaneous access of data sets with minimal latency. The Snowflake Data Connector enables federated SQL queries across datasets in the Snowflake Cloud Data Warehouse.

Learn more

Databricks
Databricks

Databricks is a cloud-based platform for data analytics and artificial intelligence. Use the connector for federated SQL queries against Databricks using Spark Connect, directly from Delta Lake tables, or using the SQL Statement Execution API.

Learn more

ABFS
ABFS

The Azure Blob Filesystem driver (ABFS) is a dedicated Azure Storage driver for Hadoop. The Azure BlobFS (ABFS) Data Connector enables federated SQL queries on files stored in Azure Blob-compatible endpoints.

Learn more

Amazon Redshift
Amazon Redshift

Amazon Redshift is a columnar OLAP database compatible with PostgreSQL. Connect Redshift to Spice using the PostgreSQL data connector with Redshift cluster connection parameters for federated SQL queries.

Learn more

File
File

The File Data Connector enables federated SQL queries on files stored by locally accessible filesystems. It supports querying individual files or entire directories with Parquet, CSV, JSON, and document formats.

Learn more

FTP/SFTP
FTP/SFTP

FTP and SFTP are network protocols for transferring files between a client and server. The FTP/SFTP Data Connector enables federated/accelerated SQL query across supported file formats stored in FTP/SFTP servers.

Learn more

HTTP/HTTPS
HTTP/HTTPS

The HTTP(s) Data Connector enables federated SQL query across supported file formats stored at an HTTP(s) endpoint. The connector supports dynamic query and data refresh through SQL-based filtering.

Learn more

IMAP
IMAP

The IMAP Data Connector enables federated SQL query across emails stored in an IMAP email server. Query email headers, subjects, senders, recipients, and message content with standard SQL.

Learn more

SharePoint
SharePoint

The SharePoint Data Connector enables federated SQL queries on documents stored in Microsoft SharePoint. Connect to SharePoint drives, sites, and groups to query unstructured documents.

Learn more

Data Accelerators

Locally materialize and accelerate data using embedded engines like DuckDB or SQLite for high-performance queries.

Apache Arrow
Apache Arrow

In-Memory Arrow Records accelerator provides the fastest query performance by storing data in memory using Apache Arrow columnar format. Default accelerator for datasets.

Learn more

DuckDB
DuckDB

DuckDB is an embedded analytical database that supports both in-memory and file-based acceleration. Ideal for larger datasets that need to persist across restarts.

Learn more

SQLite
SQLite

SQLite accelerator provides embedded database acceleration with support for both in-memory and file-based storage. Great for lightweight, persistent acceleration.

Learn more

PostgreSQL
PostgreSQL

PostgreSQL accelerator enables acceleration to an attached PostgreSQL database. Use for enterprise deployments requiring robust database infrastructure.

Learn more

Spice Cayenne
Spice Cayenne

Spice Cayenne (Vortex) is a high-performance file-based accelerator optimized for analytical workloads. Available in Spice v1.9.0-rc.1 and later.

Learn more

Catalog Connectors

Connect to external catalog providers and make their tables available for federated SQL query in Spice.

Unity Catalog
Unity Catalog

Unity Catalog is an open-source data catalog for data and AI governance. Connect to Unity Catalog to access Delta Lake tables with fine-grained access control.

Learn more

Databricks
Databricks Catalog

Connect to a Databricks Unity Catalog provider to access tables across your Databricks workspace using Spark Connect or S3/Delta Lake.

Learn more

Apache Iceberg
Apache Iceberg Catalog

Connect to an Apache Iceberg catalog provider to query Iceberg tables. Supports REST, AWS Glue, and Hive Metastore catalog backends.

Learn more

AWS Glue
AWS Glue Catalog

Connect to an AWS Glue Data Catalog to access tables across your data lake. Supports Parquet and Iceberg table formats.

Learn more

Model Providers

Connect to leading AI model providers for LLM inference, embeddings, and traditional ML models.

OpenAI
OpenAI

Connect to OpenAI or OpenAI-compatible endpoints for GPT models. Supports chat completions, embeddings, and tool use with models like GPT-4o.

Learn more

Azure OpenAI
Azure OpenAI

Connect to Azure OpenAI Service for enterprise-grade OpenAI models with Azure security and compliance features.

Learn more

Claude
Claude

Connect to Anthropic for Claude models. Supports chat completions and tool use with Claude 3 family of models.

Learn more

HuggingFace
HuggingFace

Load and serve machine learning models from HuggingFace Hub. Supports ONNX, GGUF, GGML, and SafeTensor formats.

Learn more

Amazon Bedrock
Amazon Bedrock

Connect to Amazon Bedrock for access to foundation models from AI21 Labs, Anthropic, Cohere, Meta, and Amazon via AWS infrastructure.

Learn more

Perplexity
Perplexity

Connect to Perplexity AI for models optimized for search and information retrieval tasks with real-time web access.

Learn more

xAI
xAI

Connect to xAI for Grok models. Supports chat completions with OpenAI-compatible API interface.

Learn more

Databricks
Databricks Mosaic AI

Connect to Databricks Mosaic AI for models deployed on the Databricks platform with enterprise security and governance.

Learn more

Filesystem
Filesystem

Load and serve models from the local filesystem. Supports ONNX, GGUF, GGML, and SafeTensor model formats.

Learn more

Vector Engines

Index and efficiently search vector embeddings for semantic search and AI applications.

Amazon S3 Vectors
Amazon S3 Vectors

Store and search vector embeddings using Amazon S3 Vectors. Provides efficient vector search operations and avoids unnecessary recomputation of embeddings.

Learn more

DuckDB
DuckDB

Use DuckDB as a vector engine for efficient in-memory or file-based vector search. Supports similarity search with embedded analytical capabilities.

Learn more

SQLite
SQLite

Lightweight embedded vector engine using SQLite. Ideal for edge deployments and applications requiring portable vector search capabilities.

Learn more

PostgreSQL pgvector
PostgreSQL (pgvector)

Use PostgreSQL with pgvector and pgvectorscale extensions for production-grade vector search. Supports IVFFlat and HNSW indexing for high-performance similarity queries.

Learn more

Deployed in production

Teams trust Spice to bring inference closer to their data, enabling low-latency, enterprise-grade AI across industries.

NRC Health
Basis Set
Tim Ottersburg

"Partnering with Spice AI has transformed how NRC Health delivers AI-driven insights. By unifying siloed data across systems, we accelerated AI feature development, reducing time-to-market from months to weeks - and sometimes days. With predictable costs and faster innovation, Spice is not just solving some of our data and AI challenges - it is helping us redefine personalized healthcare."

Tim Ottersburg

VP of Technology, NRC Health

Rachel Wong

"Spice AI grounds AI in our actual data, using SQL queries across all our data. This brings accuracy to probabilistic AI systems, which are very prone to hallucinations."

Rachel Wong

CTO, Basis Set

Integrations across all of your data sources

Accelerate your data and AI stack with a library of 30+ prebuilt connectors for the most common databases, warehouses, and file stores - from Databricks and S3 to MySQL and PostgreSQL. Govern AI sandboxes with the same policies as your production data environments.

Data Connectors

See Spice in action

Get a guided walkthrough of how development teams use Spice to query, accelerate, and integrate AI for mission-critical workloads.

Get a demo

content stat graphiccontent stat graphiccontent stat orb