spiceai/docs

trunk

/docs/website/versioned_docs/version-1.8.x/features/semantic-model/index.md

title: 'Semantic Model' sidebar_label: 'Semantic Model' description: 'Learn how to define and use semantic data models with Spice.' sidebar_position: 9 pagination_prev: null pagination_next: null

Semantic data models in Spice are defined using the datasets[*].columns configuration. These models provide structured and meaningful data representations, which are beneficial for both AI large language models (LLMs) and traditional data analysis.

Use-Cases

Large Language Models (LLMs)

The semantic model is automatically used by Spice Models as context to produce more accurate and context-aware AI responses.

Defining a Semantic Model

Semantic data models are defined within the spicepod.yaml file, specifically under the datasets section. Each dataset supports description, metadata, and a columns field where individual columns are described with metadata and features for utility and clarity.

Example Configuration

Example spicepod.yaml:

Dataset Metadata

Datasets can be defined with the following metadata:

instructions: Optional. Instructions to provide to a language model when using this dataset.
reference_url_template: Optional. A URL template for citation links.

For detailed metadata configuration, see the Dataset Reference

Column Definitions

Each column in the dataset can be defined with the following attributes:

description: Optional. A description of the column's contents and purpose.
embeddings: Optional. Vector embeddings configuration for this column.

For detailed columns configuration, see the Dataset Reference