Learn how to define and augment datasets with embedding columns for advanced search capabilities.
Spice provides three distinct methods for handling embedding columns in datasets:
Before configuring dataset embeddings define the embedding models in the spicepod.yaml, for example:
See Embedding components for more information on embedding models.
Spice supports complex searches by utilizing embeddings. Both local and remote embedding models can be used for vector searches.
To run a vector search, embeddings must be defined for the relevant columns in your dataset. Once configured, similarity searches can be performed using the defined embeddings.
For detailed instructions and examples on running vector searches, refer to the Vector-Based Search documentation.
The embed() scalar function allows you to generate embeddings directly within SQL queries. This function can process both single text strings and arrays of text, making it useful for ad-hoc embedding generation and comparison operations.