Skip to main content

Dremio Data Connector

Dremio is a data lake engine that enables high-performance SQL queries directly on data lake storage. It provides a unified interface for querying and analyzing data from various sources without the need for complex data movement or transformation.

This connector enables using Dremio as a data source for federated/accelerated SQL queries.

- from: dremio:datasets.dremio_dataset
name: dremio_dataset
params:
dremio_endpoint: grpc://127.0.0.1:32010
dremio_username: demo
dremio_password: ${secrets:my_dremio_pass}

Configuration​

from​

The from field takes the form dremio:dataset where dataset is the fully qualified name of the dataset to read from.

[Limitations]

Currently, only up to three levels of nesting are supported for dataset names (e.g., a.b.c). Additional levels are not supported at this time.

name​

The dataset name. This will be used as the table name within Spice.

Example:

datasets:
- from: dremio:datasets.dremio_dataset
name: cool_dataset
params:
...
SELECT COUNT(*) FROM cool_dataset;
+----------+
| count(*) |
+----------+
| 6001215 |
+----------+

params​

Parameter NameDescription
dremio_endpointThe endpoint used to connect to the Dremio server.
dremio_usernameThe username to connect with.
dremio_passwordThe password to connect with. Use the secret replacement syntax to load the password from a secret store, e.g. ${secrets:my_dremio_pass}.

Examples​

Connecting to a GRPC endpoint​

- from: dremio:datasets.dremio_dataset
name: dremio_dataset
params:
dremio_endpoint: grpc://127.0.0.1:32010
dremio_username: demo
dremio_password: ${secrets:my_dremio_pass}

Secrets​

Spice integrates with multiple secret stores to help manage sensitive data securely. For detailed information on supported secret stores, refer to the secret stores documentation. Additionally, learn how to use referenced secrets in component parameters by visiting the using referenced secrets guide.