Skip to main content

Delta Lake Data Connector

Query/accelerate Delta Lake tables in Spice.

datasets:
# Example for local Delta Lake
- from: delta_lake:/path/to/local/delta/table # A local filesystem path to a Delta Lake table
name: my_delta_lake_table
# Example for Delta Lake on S3
- from: delta_lake:s3://my_bucket/path/to/s3/delta/table/ # A reference to a table in S3
name: my_delta_lake_table
params:
delta_lake_aws_access_key_id: ${secrets:aws_access_key_id}
delta_lake_aws_secret_access_key: ${secrets:aws_secret_access_key}
# Example for Delta Lake on Azure Blob
- from: delta_lake:abfss://my_container@my_account.dfs.core.windows.net/path/to/azure/delta/table/ # A reference to a table in Azure Blob
name: my_delta_lake_table
params:
delta_lake_azure_storage_account_name: my_account
delta_lake_azure_storage_account_key: ${secrets:my_key}

Configuration​

- from: delta_lake:/path/to/local/delta/table  # A local filesystem path to a Delta Lake table
name: my_delta_lake_table

Delta Lake object store parameters​

Use the secret replacement syntax to reference a secret, e.g. ${secrets:aws_access_key_id}.

  • client_timeout: Optional. Specifies timeout for object store operations. Default value is 30s E.g. client_timeout: 60s

AWS S3​

  • delta_lake_aws_region: Optional. The AWS region for the S3 object store. E.g. us-west-2.
  • delta_lake_aws_access_key_id: The access key ID for the S3 object store.
  • delta_lake_aws_secret_access_key: The secret access key for the S3 object store.
  • delta_lake_aws_endpoint: Optional. The endpoint for the S3 object store. E.g. s3.us-west-2.amazonaws.com.

Azure Blob​

Note

One of the following auth values must be provided for Azure Blob:

  • delta_lake_azure_storage_account_key,
  • delta_lake_azure_storage_client_id and azure_storage_client_secret, or
  • delta_lake_azure_storage_sas_key.
  • delta_lake_azure_storage_account_name: The Azure Storage account name.
  • delta_lake_azure_storage_account_key: The Azure Storage master key for accessing the storage account.
  • delta_lake_azure_storage_client_id: The service principal client id for accessing the storage account.
  • delta_lake_azure_storage_client_secret: The service principal client secret for accessing the storage account.
  • delta_lake_azure_storage_sas_key: The shared access signature key for accessing the storage account.
  • delta_lake_azure_storage_endpoint: Optional. The endpoint for the Azure Blob storage account.

Google Storage (GCS)​

  • google_service_account: Filesystem path to the Google service account JSON key file.