Your Data and AI Stack in One Engine

Federated SQL query, hybrid search, and LLM inference in a portable and open-source runtime

Deployed in Production at

Accelerated Data.
Hybrid Search.
LLM Inference.

Federate and Accelerate Data with Zero ETL
Connect to operational databases, data lakes, and warehouses across the enterprise. Use DuckDB and Apache Arrow acceleration to deliver sub-second performance.

Hybrid Search Across Data Estates
Run vector, keyword, and full-text search in the same SQL statement for more advanced retrieval pipelines.

Serve and Ground Any LLM Model
Serve local and hosted LLMs from OpenAI, Anthropic, xAI, or NVIDIA NIM. Combine inference with acceleration and hybrid search to power latency-sensitive AI applications.

Spice dashboard with playground tab selected

Built on open-source

The Operational Data Lakehouse

Spice is the only data lakehouse purpose-built for operational data use cases, not just analytics. Massively improve performance and eliminate the need for ETL pipelines, caches, and specialized databases - all in a portable 140MB runtime.

SQL Query Federation

  • Join data across databases, data warehouses, data lakes, and APIs in a single SQL query

  • Connectors for 30+ modern and legacy sources from Databricks, to MySQL, and CSV files on FTP servers

  • Industry-standard protocols including ODBC, JDBC, ADBC, HTTP, and Apache Arrow Flight (gRPC)

Data Acceleration

  • Fast, low-latency, high-concurrency query, search, and AI retrieval

  • Materialize and accelerate data in-memory or using embedded databases including DuckDB or SQLite

  • Keep accelerations updated in real-time with Change-Data-Capture (CDC) using Debezium

Hybrid SQL Search

  • Run vector, keyword, and full-text search in the same SQL query

  • Build retrieval pipelines that combine structured and unstructured data

  • Leverage open table formats (Iceberg, Delta, Hudi) and S3 Vectors without extra infrastructure

LLM Inference

  • Serve local or hosted LLM models (OpenAI, Anthropic, xAI, NVIDIA NIM)

  • Combine inference with search and retrieval for latency-sensitive apps

  • Integrate agentic RAG workflows with full observability and distributed tracing

Built by developers, for developers

Get Started with Just Three Lines of Code

Connect to your first dataset with a few lines of YAML and start querying or serving models in minutes. Developer-friendly Node.js, Go, Python, and Rust SDKs make accessing and using petabyte-scale data easy.
Code snippets with just a few lines of code to install

How Spice.ai
Works

Focus on your application. Spice.ai brings together enterprise-grade data and AI infrastructure, serverless compute, storage, ZK & ML GPU clusters, blockchain nodes, and indexing into a single, developer-focused platform.

SDKS
Node.js SDK
Python SDK
GO SDK
Rust SDK
More SDKs coming soon
Example Ecosystem Libraries
Pandas
PyTorch
Tensor Flow
Compatible with many more
1st part of diagram with enterprise-grade
Pandas
Web3.js
PyTorch
Tensor Flow
Compatible with many more
Data
AI
Compute
Storage
Blockchains
ZK & ML Accelerators
Ethereum
Polygon
Bitcoin
OP Stack
Flow
1TB Ram
8x GPU
Many more
Real-Time Data
Indexing • SQL Query • Firecache
Complete
Historical Data
Indexing • SQL Query • Firecache
Hosted AI & ML
ML Pipelines • Inferencing Models
Compute
Functions • Actors
Storage
DuckDB • Blob
AI Models
Predictions
Create
Your Own
DeFi
Uniswap • Sushiswap • Aave
NFTs
NFT Metadata • NFT Owners
Contract
Indexing
Token Prices
ENS
Wallet Balances
Native token balances
More coming soon
Training
Inferencing
Model Registry
Bare-Metal ZK/ML Acceleration
GPUs
FPGAs (Coming soon)
The Spice Cloud Platform vs. Building Yourself  
Spice AI

The Spice Cloud Platform

Multi-cloud, high-availability SOC2 deployments
Building & Operating In-House
Managed Infrastructure
Data & AI Infrastructure Cost
Included$5k to $50k
per month
Enterprise-grade high-availability and compliance
Included in Enterprise
Multi-cloud, high-availability SOC2 deployments
Twice as much as the Spice Cloud Platform
2x the total cost of infra and ops
High-performance caching for frontend & inferencing queries
Managed Spice.ai Open Source
$1k to $5k
per month
Engineering
Data and Infrastructure Engineering Cost
Included$15k to $20k
per month, per engineer
Time to Implement
Get started in minutes
Typically 3 to 6 months
Operations & Support
On-Going Operational Cost
Included
with Pro for Teams and Enterprise plans
$15k
per month, per ops engineer
99.9%+ Enterprise SLA & Support
Included in Enterprise
Self-managed 24/7 on-call
Testimonials goes here.
"I explored AI, embeddings, search algorithms, and features with our own database. I read a lot about this, but it was so much easier to use Spice than doing it from scratch."
Taher Ahmed
Software Engineering Manager at NRC Health
"With a simple drop in container, we are able to double our data redundancy by using Spice."
David Blum
Principal Software Engineer at Twilio
"What I like the most about Spice, it's very easy to collect data from different data sources and I am able to chat with this data and do everything in one place."
Dustin Warner
Director of Software Engineering at NRC Health
"It just kinda spins up and it just works, which is really nice."
Darin Douglass
Principal Software Engineer at Barracuda
"Spice opened the door to take these critical control-plane datasets and move them next to our services in the runtime path."
Peter Janovsky
Software Architect at Twilio
"It's just a huge gain in responsiveness for the customer."
David Stancu
Senior Principal Software Engineer at Barracuda
"Initially, we thought it's such a departure from where they are, it's going to take them a long time. But in a few weeks, it's like magic. We had everything working exactly as we wanted, and it's amazing."
Sreeram Kannan
Founder and CEO of EigenLayer
"Nethermind is committed to advancing the Ethereum re-staking ecosystem. We're excited that our re-staking dashboard, an early contribution to the EigenLayer community, now thrives on data powered by Spice AI."
Eliot Stock
CPO of Nethermind
"Other solutions were prohibitively expensive - what we could do in Spice with a single query would have taken millions of API calls in other platforms."
Andrew Dworschak
CEO & Co-founder of Yakoa
“We are a team of crypto and fintech product experts, not cloud and developer infrastructure , so we needed to evaluate solutions based on their ease of implementation and time to market. We value solutions that deliver performance of transactions and pricing data without compromising the developer experience
and accuracy.”
Kareem Khattab
CEO and Founder of Entendre
“Spice AI makes it insanely easy for to leverage high quality, reliable datasets on our platform. This is critical for our developers to gain insight into how the APIs are being used so we can enable Web3 developers to build their first dApp in minutes.”
Anir Agarwal
Founder and CEO of Niftory
“XMTP is enabling web3 messaging between wallets. Spice enables us to leverage the rich Python ecosystem of data science and ML tools to build the best web3 data-driven experiences possible.”
Peter Denton
Head of Growth, XMTP
arrow icon pointing left
arrow icon pointing right

Designed for Scale


A compute engine built to meet enterprise demands for security, reliability, and compliance.

Enterprise-Grade Performance & Security

The Spice Cloud Platform is built for enterprise, 99.9%+ high-availability, security, performance, and compliance backed by an enterprise-level SLA and support. SOC 2 Type II certified.

Apache Arrow

Access developer, ML, and Data-Science friendly APIs with HTTP and Apache Arrow Flight (gRPC) endpoints. Query across datasets with filtering and aggregation in seconds.

Combine SQL with code

Use developer-friendly SDKs to query data with ease, including joins across multiple datasets with filtering and aggregations, in just three lines of code.

Ecosystem Compatible

Fetch millions of rows of data and use libraries such as NumPy, Pandas, Tensorflow, and PyTorch with ease.

Datasets & Views

Define and create your own private datasets and views, available in SQL query and optionally shareable with anyone else.

Petabyte-Scale Data For Applications & ML

The Spice Cloud Platform is the fastest way to access real-time, time-series data. Get millions of records in seconds.
arrow icon pointing left
arrow icon pointing right

Built on Open-Source

Spice is powered by Apache Arrow, Flight, ADBC, DataFusion, Parquet, Iceberg, and more. Explore the platform, see where it’s headed, and join the community shaping its future.