The Spice.ai Cloud Platform includes a comprehensive set of free, ready-to-query sample datasets.
The Spice runtime can query these datasets using the Spice.ai Data Connector.
To access these community datasets, navigate to spice.ai, and create a new account by clicking 'Start for Free'.
After logging in, create an app in order to get an API key.
This quickstart will use the taxi_trips dataset from https://spice.ai/spiceai/quickstart Spice.ai app.
Step 1. Initialize a new project:
Step 2. Log in to the Spice Cloud Platform from the command line using the spice login command. A pop up browser window will prompt you to authenticate:
Logging in will create or update a .env file in the project directory with the API key.
Step 3. Start the runtime:
Step 4. Configure the dataset:
In a new terminal window, configure a new dataset using the spice dataset configure command:
Enter a dataset name that will be used to reference the dataset in queries. This name does not need to match the name in the dataset source.
Enter the description of the dataset:
description: Taxi trips dataset
Enter the location of the dataset:
Select y when prompted whether to accelerate the data:
You should see the following output from your runtime terminal:
Step 5. In a new terminal window, use the Spice SQL REPL to query the dataset
The output displays the results of the query along with the query execution time:
You can experiment with the time it takes to generate queries when using non-accelerated datasets. You can change the acceleration setting from true to false in the datasets.yaml file.
The output displays the average gas used: