Spice Cloud v1.7.0: DataFusion v49, Full-Text Search Updates & More
Spice Cloud & Spice.ai Enterprise 1.7.0 are now live, bringing performance upgrades with DataFusion v49, real-time full-text search indexing, EmbeddingGemma support, and improvements across search, embeddings, and API integrations. Spice Cloud customers will automatically upgrade to v1.7.0 on deployment, while Spice.ai Enterprise customers can consume the Enterprise v1.7.0 image from the Spice AWS Marketplace listing.
What’s New in v1.7.0
DataFusion v49 Upgrade
Spice now runs on DataFusion v49, delivering lower latency and improved query optimization.

DataFusion v49 highlights include:
- Dynamic filters and pushdown to skip unnecessary reads in
ORDER BY & LIMITqueries - Compressed spill files to reduce disk usage during large sorts and aggregations
- Support for ordered-set aggregates with
WITHIN GROUP - New
REGEXP_INSTRfunction to identify regex match positions
EmbeddingGemma Support
Spice now supports EmbeddingGemma, Google’s latest embedding model for text and documents. It delivers high-quality embeddings for semantic search, retrieval, and recommendation tasks. Configure it directly in your Spicepod via HuggingFace.
Embedding Request Caching
Repeated embedding requests can now be cached in the Spice runtime. This reduces both latency and costs, with configurable cache size and TTL options. Check out the caching documentation for more details.
Real-Time Indexing for Full Text Search
Full-text indexing now supports real-time changes from CDC streams such as Debezium. New events are searchable as they arrive, ensuring continuously fresh results.
OpenAI Responses API Tool Calls with Streaming
The OpenAI Responses API in Spice now supports tool calls with streaming. Results from tools like web_search and code_interpreter are streamed as they’re generated, enabling more responsive agent and application experiences.
Bug & Stability Fixes
v1.7.0 includes numerous fixes and improvements:
- CDC streams readiness and full-text indexing reliability
- Vector search pipeline and
vector_searchUDTF fixes - Kafka schema inference, consumer group persistence, and cooperative mode
- Error reporting improvements (e.g., ThrottlingException handling)
- Iceberg connector support for
LIMITpushdown - S3 Vector ingestion reliability and tracing fixes
v1.7 Release Community Call
We’ll walk through highlights of v1.7 live on our Release Community Call. Join us to see the new functionality in action and bring your questions! Register here.
.png)
Explore more Spice resources
Tutorials, docs, and blog posts to help you go deeper with Spice.
How we use Apache DataFusion at Spice AI
Why we chose to build on DataFusion and how we extended it with custom TableProviders, optimizer rules, and UDFs for federated SQL

Introducing Spice Cayenne: The Next-Generation Data Accelerator Built on Vortex for Performance and Scale
Spice Cayenne is the next-generation Spice.ai data accelerator built for high-scale and low latency data lake workloads. It combines the Vortex columnar format with an embedded metadata engine to deliver faster queries and significantly lower memory usage than existing Spice data accelerators, including DuckDB and SQLite.

Spice Cloud v1.10: Caching Acceleration Mode, DynamoDB Streams Support, & More!
Spice v1.10 includes a new caching acceleration mode, a new DynamoDB Streams data connector in preview, Amazon S3 location-based pruning, S3 Tables write support, and several performance and security improvements.

See Spice in action
Get a guided walkthrough of how development teams use Spice to query, accelerate, and integrate AI for mission-critical workloads.
Get a demo

