Delta Lake Data Connector
Query/accelerate Delta Lake tables in Spice.
datasets:
# Example for local Delta Lake
- from: delta_lake:/path/to/local/delta/table # A local filesystem path to a Delta Lake table
name: my_delta_lake_table
# Example for Delta Lake on S3
- from: delta_lake:s3://my_bucket/path/to/s3/delta/table/ # A reference to a table in S3
name: my_delta_lake_table
params:
delta_lake_aws_access_key_id: ${secrets:aws_access_key_id}
delta_lake_aws_secret_access_key: ${secrets:aws_secret_access_key}
# Example for Delta Lake on Azure Blob
- from: delta_lake:abfss://my_container@my_account.dfs.core.windows.net/path/to/azure/delta/table/ # A reference to a table in Azure Blob
name: my_delta_lake_table
params:
delta_lake_azure_storage_account_name: my_account
delta_lake_azure_storage_account_key: ${secrets:my_key}
Configuration​
- Delta Lake + Local
- Delta Lake + S3
- Delta Lake + Azure Blob
- Delta Lake + Google Storage
- from: delta_lake:/path/to/local/delta/table # A local filesystem path to a Delta Lake table
name: my_delta_lake_table
- from: delta_lake:s3://my_bucket/path/to/s3/delta/table/ # A reference to a table in S3
name: my_delta_lake_table
params:
delta_lake_aws_region: us-west-2 # Optional
delta_lake_aws_access_key_id: ${secrets:aws_access_key_id}
delta_lake_aws_secret_access_key: ${secrets:aws_secret_access_key}
delta_lake_aws_endpoint: s3.us-west-2.amazonaws.com # Optional
- from: delta_lake:abfss://my_container@my_account.dfs.core.windows.net/path/to/azure/delta/table/ # A reference to a table in Azure Blob
name: my_delta_lake_table
params:
# Account Name + Key
delta_lake_azure_storage_account_name: my_account
delta_lake_azure_storage_account_key: ${secrets:my_key}
# OR Service Principal + Secret
delta_lake_azure_storage_client_id: my_client_id
delta_lake_azure_storage_client_secret: ${secrets:my_secret}
# OR SAS Key
delta_lake_azure_storage_sas_key: my_sas_key
params:
delta_lake_google_service_account_path: /path/to/service-account.json
Delta Lake object store parameters​
Use the secret replacement syntax to reference a secret, e.g. ${secrets:aws_access_key_id}
.
client_timeout
: Optional. Specifies timeout for object store operations. Default value is30s
E.g.client_timeout: 60s
AWS S3​
delta_lake_aws_region
: Optional. The AWS region for the S3 object store. E.g.us-west-2
.delta_lake_aws_access_key_id
: The access key ID for the S3 object store.delta_lake_aws_secret_access_key
: The secret access key for the S3 object store.delta_lake_aws_endpoint
: Optional. The endpoint for the S3 object store. E.g.s3.us-west-2.amazonaws.com
.
Azure Blob​
Note
One of the following auth values must be provided for Azure Blob:
delta_lake_azure_storage_account_key
,delta_lake_azure_storage_client_id
andazure_storage_client_secret
, ordelta_lake_azure_storage_sas_key
.
delta_lake_azure_storage_account_name
: The Azure Storage account name.delta_lake_azure_storage_account_key
: The Azure Storage master key for accessing the storage account.delta_lake_azure_storage_client_id
: The service principal client id for accessing the storage account.delta_lake_azure_storage_client_secret
: The service principal client secret for accessing the storage account.delta_lake_azure_storage_sas_key
: The shared access signature key for accessing the storage account.delta_lake_azure_storage_endpoint
: Optional. The endpoint for the Azure Blob storage account.
Google Storage (GCS)​
google_service_account
: Filesystem path to the Google service account JSON key file.