Explore the TPC-H Benchmark dataset using Spice
The TPC Benchmark™ H (TPC-H) is widely used to evaluate the analytic query capabilities of databases.
The components of TPC-H consist of eight separate and individual tables (the Base Tables). The relationships between columns in these tables are illustrated in the following ER diagram (source: TPC Benchmark H Standard Specification):
Step 1. Initialize and start Spice
Step 2. Connect the TPC-H Benchmark pod
The following output is shown in the Spice runtime terminal:
Step 3. Run queries against the dataset using the Spice SQL REPL.
In a new terminal, start the Spice SQL REPL.
Check that TPC-H tables exist:
Run Pricing Summary Report Query (Q1). More information about TPC-H and all the queries involved can be found in the official TPC Benchmark H Standard Specification.
Step4 (Optional) Enable Data Acceleration for TPC-H Benchmark Sample Data
Use text editor to open ./spicepods/spiceai/tpch/spicepod.yaml
file and enable acceleration
flags for each table. Save.
Before:
- from: s3://spiceai-demo-datasets/tpch/customer/ name: customer acceleration: enabled: false
After:
- from: s3://spiceai-demo-datasets/tpch/customer/ name: customer acceleration: enabled: true
Run Pricing Summary Report Query using the Spice SQL REPL again.
Observe query execution time decreased from 4.178523666 to 0.108190459 seconds.