Where we read.
Every signal you see in the brief traces back to one of these public sources — currently 40 streaming inputs across five streams plus 5 hand-curated catalogs informed by 16 more. No private sources, no scraped paywalls, nothing fabricated. The last-seen timestamps below update with the daily pipeline.
News feeds
3 RSS feeds from the platform team and the open-source projects most teams pair with Databricks.
GitHub repositories
12 repos whose releases are newsworthy for Databricks practitioners — the platform CLI, the SDKs, Terraform, the OSS projects (Delta, MLflow, Unity Catalog), and the dbt adapter.
YouTube channels
10 channels — the official Databricks channel plus a curated list of independent creators we track for tutorial and deep-dive content.
Data + AI Summit archive
11 official YouTube playlists across 6 years, tagged at ingest so DAIS talks are filterable in the assistant and on /videos.
Data + AI Summit 2025
Data + AI Summit 2024
Data + AI Summit 2023
Data + AI Summit 2022
Data + AI Summit 2021
Data + AI Summit 2020
Community Q&A
4 forums where Databricks practitioners ask real questions. Indexed so the assistant can cite community discussion alongside official docs.
Editorial reference
These pages are hand-maintained, not crawled. Each entry traces back to one of the public sources listed below — refreshed when we re-curate the catalog, not on a pipeline cadence.
Startups
84 entriesCompanies in the Databricks ecosystem — Built on / Built for / Adjacent. Curated from these public sources; each entry cites its evidence.
Certifications
18 entriesEvery public Databricks certification + accreditation with cost, difficulty, study hours, and active vouchers. Drawn from the official learning surfaces.
Books
15 entriesReading list for the Databricks ecosystem — Delta, Iceberg, Apache Spark, foundational data engineering. Each title links to the publisher's canonical page.
Events
3 entriesUpcoming Databricks community events — Summit, regional roadshows, meetups. Drawn from the official events calendar and the relevant conference sites.
Roadmap
80 entriesSynthesized from Databricks Blog posts via Gemini-driven feature-status extraction + manual editorial review. The upstream source is listed under News above; this entry credits the derivation step.
Missing a source you trust?
The list is curated, not crawled — we read every source ourselves before adding it. Tell us what we're missing and we'll evaluate it for the next batch.