Lineage
Recent items mentioning Lineage across the Databricks ecosystem — releases, news, videos, and community Q&A. Updated hourly.
Recent Databricks content highlights automated lineage tracking as a key component of modern data governance. Unity Catalog now provides automated lineage tracking for data and AI assets, enabling context-aware search and data protection 1. This aligns with a complete data governance architecture blueprint emphasizing automated lineage, RBAC, and federated models for data quality and compliance at scale 2.
Generated daily from the 3 most recent items mentioning Lineage. Click any [N] to jump to the source.
TutorialsDatabricks Unity Catalog: The Safe Way to Govern AI
Databricks Unity Catalog provides a single governance layer for data and AI assets, enabling discovery, classification, protection, and certification of data. It demonstrates how to use Unity Catalog for context-aware search, automated lineage tracking, tagging sensitive data with govern and non-govern tags, and applying column masking for data protection.
Data Governance Architecture: A Complete Blueprint for Modern Organizations
This blueprint details a complete data governance architecture, outlining the policies, roles, and technologies needed to manage data assets. It emphasizes a modern strategy combining automated lineage, RBAC, and federated models to ensure data quality and regulatory compliance at scale.
TutorialsThe Future of Finance Operations Starts Here
The video demonstrates how Databricks' financial lakehouse solution addresses common finance data challenges like fragmentation and slow analysis. It showcases features like Unity Catalog for data governance, Lake Flow for pipeline management, and Genie Spaces for natural language querying of financial data.
AI Applications: Tools, Use Cases, and Platforms
AI applications span four capability tiers, each with distinct data requirements and evaluation frameworks, and enterprise deployments often stall due to inadequate data infrastructure. Production-grade model development, from prompt engineering to pretraining, is increasingly accessible with open-source LLMs, but requires pre-built governance and monitoring infrastructure for successful deployment at scale.
How AI improves data lineage at scale
Discover how AI accelerates data lineage with automated docs, testing, and scalable governance.
TutorialsHow to use Recursive CTEs in Databricks
The video demonstrates how to use recursive CTEs in Databricks to traverse hierarchical data structures of unknown depth, such as data lineage or organizational charts. It shows how to write a recursive CTE in SQL, highlighting the `RECURSIVE` keyword and the union of an anchor member and a recursive member.
NewsFrom Raw Data to Real-Time Retention: Powering Customer Health Scores on Databricks
Tutorials










