The Databricks reading list.
Hand-picked books that actually move the needle when you're learning, levelling up, or shipping on Databricks. Not affiliated with any of these authors or publishers.

Modern Data Lakehouse Architectures
Denny Lee, Tristen Wentling, Scott Haines, Prashanth Babu V. · O'Reilly · 2024

Data Lakehouse Functionality, Performance, and Scalability on the Data Lake
Tomer Shiran, Jason Hughes, Alex Merced · O'Reilly · 2024

Modern Data Lakehouse Architectures with Delta Lake
Bennie Haelen, Dan Davis · O'Reilly · 2023

Build effective data and AI solutions using Apache Spark, Databricks, and Delta Lake
Pulkit Chadha · Packt · 2024

Seamlessly transition ML models and MLOps on Databricks
Debu Sinha · Packt · 2023

Create scalable pipelines that ingest, curate, and aggregate complex data in a timely and secure way
Manoj Kukreja · Packt · 2021

Lightning-Fast Data Analytics
Jules S. Damji, Brooke Wenig, Tathagata Das, Denny Lee · O'Reilly · 2020

Big Data Processing Made Simple
Bill Chambers, Matei Zaharia · O'Reilly · 2018

A Hands-On Guide for Building Mission-Critical Streaming Applications
Scott Haines · Apress · 2022

Covers Apache Spark 3 with Examples in Java, Python, and Scala
Jean-Georges Perrin · Manning · 2020

Best Practices for Scaling and Optimizing Apache Spark
Holden Karau, Rachel Warren · O'Reilly · 2017

Plan and Build Robust Data Systems
Joe Reis, Matt Housley · O'Reilly · 2022

The Big Ideas Behind Reliable, Scalable, and Maintainable Systems
Martin Kleppmann · O'Reilly · 2017

The What, Where, When, and How of Large-Scale Data Processing
Tyler Akidau, Slava Chernyak, Reuven Lax · O'Reilly · 2018

The Definitive Guide to Dimensional Modeling
Ralph Kimball, Margy Ross · Wiley · 2013