Databricks
Databricks is a platform for big data analytics and artificial intelligence.
ingestr supports Databricks as both a source and destination.
URI format
Access Token Authentication
The traditional URI format for Databricks using an access token:
plaintext
databricks://token:<access_token>@<server_hostname>?http_path=<http_path>&catalog=<catalog>&schema=<schema>URI parameters:
access_token: the access token to connect to the Databricks instanceserver_hostname: the hostname of the Databricks instancehttp_path: the path to the Databricks instancecatalog: the catalog to connect toschema: the schema to connect to
OAuth M2M Authentication (Service Principal)
You can also authenticate using OAuth machine-to-machine (M2M) credentials with a service principal's client ID and client secret:
plaintext
databricks://@<server_hostname>?http_path=<http_path>&catalog=<catalog>&schema=<schema>&client_id=<client_id>&client_secret=<client_secret>URI parameters:
server_hostname: the hostname of the Databricks instancehttp_path: the path to the Databricks instancecatalog: the catalog to connect toschema: the schema to connect toclient_id: the service principal's client ID (application ID)client_secret: the OAuth secret for the service principal
To set up OAuth M2M authentication:
- Create a service principal in your Databricks workspace
- Generate an OAuth secret for the service principal
- Ensure the service principal has the necessary permissions to access your workspace resources
You can read more about Databricks OAuth M2M authentication here.
The same URI structure can be used both for sources and destinations. You can read more about SQLAlchemy's Databricks dialect here.