Documentation | Sample Data Bundles
The Lakeflow Framework is a metadata-driven framework for building Databricks Lakeflow Spark Declarative Pipelines. It uses a configuration-driven, pattern-based approach to support both batch and streaming workloads across the medallion architecture.
The framework supports centralized and domain-oriented operating models, and accommodates multiple modelling paradigms (including dimensional, Data Vault, and enterprise canonical models). It is designed for simplicity, performance, maintainability, and extensibility as the Databricks product evolves.
- Configuration-driven pattern based pipeline delivery with reusable implementation patterns
- Support for batch and streaming pipelines across Bronze/Silver/Gold, aligned to your chosen modelling pattern
- Flexible for centralized and domain-oriented operating models
git clone https://github.com/databricks-solutions/lakeflow_framework.git
cd lakeflow_framework
pip install -r requirements-dev.txtThen:
- Open the hosted docs: https://databricks-solutions.github.io/lakeflow_framework/
- Deploy the framework using the
Deploy Frameworkguide - Deploy samples from
samples/using the documentation walkthroughs - Build your first pipeline bundle using the
Build a Pipeline Bundleguide
- Access to a Databricks workspace
- Databricks CLI installed and configured
- Python environment with project dependencies installed
- Familiarity with Databricks Lakeflow Spark Declarative Pipelines concepts
docs/- Sphinx documentation and versioned docs build toolingsamples/- example framework and pipeline bundlessrc/- framework source code and runtime components
This project tracks Databricks Lakeflow Spark Declarative Pipelines capabilities and evolves with platform changes. Validate runtime, feature, and API compatibility against your target Databricks workspace and the latest project documentation before production rollout.
The framework is actively maintained. Databricks support does not cover this repository; issue support is best effort through GitHub issues.
- Releases: https://github.com/databricks-solutions/lakeflow_framework/releases
- Tags: https://github.com/databricks-solutions/lakeflow_framework/tags
Please refer to the documentation for further details and an explanation of the samples. The documentation needs to be deployed as HTML or Markdown within your org before it can be used.
pip install -r requirements-docs.txt
make -C docs htmlDatabricks support doesn't cover this content. For questions or bugs, please open a GitHub issue and the team will help on a best effort basis.
© 2025 Databricks, Inc. All rights reserved. The source in this notebook is provided subject to the Databricks License [https://databricks.com/db-license-source]. All included or referenced third party libraries are subject to the licenses set forth below.