Azure ADLS as Storage for Data Products

Category: Data Platform Platform: Databricks, Azure Synapse Analytics, Generic Data Lake

Context

We use Databricks as our data platform (databricks-as-data-platform).

Data product producers need to store data products, so that other domains can easily access and query the data.

Ease of use, performance, and egress-costs are to be considered.

Decision

We store the data of data products as files on Azure Data Lake Storage Gen2.

We use the same Azure region for all data products (Germany West Central) in the same VPC to avoid egress costs.

Consequences

Considered Alternatives

Automation