title: 'File Data Connector Deployment Guide' sidebar_label: 'Deployment Guide' description: 'Operating guide for the File data connector in production: permissions, formats, performance, and observability.' sidebar_position: 10 pagination_prev: null pagination_next: null tags:
Production operating guide for the File data connector (reading files from the local or mounted filesystem).
The File connector has no authentication layer. Access control is enforced by the operating system:
ConfigMap / Secret / PersistentVolumeClaim mounts over host paths.For secrets embedded in data files (credentials, tokens), encrypt at rest and restrict filesystem ACLs to the Spice process user.
The File connector reads local files synchronously; there is no network layer, retry, or concurrency semaphore. Failures are filesystem errors (ENOENT, EACCES, EIO) and surface directly to the caller. Filesystem issues (e.g., an NFS mount going stale) must be handled at the infrastructure layer.
For hot-reloading of updated data files, accelerate the dataset and configure a refresh_interval — the connector re-reads the file on each refresh.
hive_partitioning_enabled: true when reading partitioned directories to prune at plan time.See File Formats for format-specific parameters. Choose based on access pattern:
has_header and delimiter explicitly.The File connector does not register connector-specific instruments. Monitor via Spice's query execution metrics (query_duration_ms, query_processed_rows). See Component Metrics for general configuration.
For filesystem-level issues (disk utilization, IOPS), use the underlying OS metrics (Prometheus node_exporter, CloudWatch agent, etc.).
File reads participate in task history through DataFusion's execution-plan spans. Listings, opens, and reads are attributed to the enclosing sql_query or accelerated_table_refresh task.
refresh_interval on an accelerated dataset to pick up changes.file:// paths in a spicepod are non-portable across environments; parameterize via env vars or use network-mounted paths with consistent mount points.| Symptom | Likely cause | Resolution |
|---|---|---|
No such file or directory | Path typo, wrong working directory, or missing mount. | Verify the file exists from the Spice process context (ls inside the container). |
Permission denied | Spice process user lacks read permission. | Adjust file ACLs or mount with appropriate UID/GID. |
| Schema inference is slow for JSON | Large file with sparse fields sampled. | Provide an explicit schema, or sample fewer records. |
| Planning time dominates for glob patterns | Very large directory listings. | Prune with Hive partitioning or break the dataset into narrower prefixes. |
| Query returns old data after file was replaced | No file watch; Spice sees cached schema. | Set refresh_interval on an accelerated dataset, or restart the runtime. |