Skip to content
brickster.ai
Startups

Databricks-ecosystem startups

Companies building on, for, or around Databricks. Curated from the Databricks Ventures portfolio, Partner Directory, AI Accelerator cohort, and the 2025 Partner Awards. Independent reference, not affiliated; ask us to add or remove a company.

72 active · 36 built for · 14 built on · 22 adjacent · 12 acquired

Showing 72 of 72 startups
  • Alation

    Series D+ · $340M raised

    Enterprise data intelligence and catalog platform with active metadata and agentic AI capabilities.

    Data catalog and governance platform with Unity Catalog integrations. Listed in the Databricks Ventures portfolio.

    2012Redwood City, USA501+
    Ecosystem-adjacentCatalogDatabricks Ventures portfolio; Technology PartnerSources
  • dbt Labs

    Series D+ · $416M raised

    SQL-based data transformation framework that has become the standard for analytics engineering.

    SQL-based transformation framework that runs as a first-class citizen on Databricks. Both a Databricks Ventures portfolio company and a 2025 Partner Award winner. Now merging with Fivetran.

    2016Philadelphia, USA501+
    Ecosystem-adjacentDev ToolingDatabricks Ventures portfolio; Data Integration Partner of the Year 2025Sources
  • Glean

    Series D+ · $600M raised

    AI-powered enterprise knowledge assistant that searches and reasons across company applications.

    Enterprise AI search and agents platform. Part of the Databricks Ventures portfolio; valued at $7.2B in its 2025 Series F.

    2019Palo Alto, USA501+
    Built on DatabricksVertical AIDatabricks Ventures portfolioSources
  • Hightouch

    Series D+ · $322M raised

    Composable customer data platform and reverse-ETL that activates lakehouse data into business tools.

    Reverse-ETL pioneer that activates Databricks data into 200+ business tools. Both a Databricks Ventures portfolio company and 2025 Partner Award winner.

    2018San Francisco, USA201-500
    Ecosystem-adjacentReverse-ETLDatabricks Ventures portfolio; Retail and CG Data Partner of the Year 2025Sources
  • Immuta

    Series D+ · $267M raised

    Data security platform providing access control, policy enforcement and audit on cloud data platforms.

    Data access control and policy automation that integrates deeply with Databricks Unity Catalog. Part of the Databricks Ventures portfolio.

    2015Boston, USA201-500
    Built for DatabricksSecurityDatabricks Ventures portfolio; Technology PartnerSources
  • Labelbox

    Series D+ · $188M raised

    Training data and human-in-the-loop platform for building ML and frontier-model datasets.

    Training data platform with native Databricks integration. Part of the Databricks Ventures portfolio.

    2018San Francisco, USA201-500
    Built for DatabricksML OpsDatabricks Ventures portfolio; Technology PartnerSources
  • Matillion

    Series D+ · $310M raised

    Data productivity cloud with low-code ETL pipelines and orchestration for cloud data platforms.

    Low-code ETL with native Databricks integration. Part of the Databricks Ventures portfolio.

    2010Manchester, UK201-500
    Ecosystem-adjacentDev ToolingDatabricks Ventures portfolio; Technology PartnerSources
  • Perplexity

    Series D+ · $1.5B raised

    AI-powered conversational search engine that synthesizes web answers with cited sources.

    Consumer AI answer engine. Part of the Databricks Ventures portfolio; valued at $20B in 2025.

    2022San Francisco, USA201-500
    Built on DatabricksVertical AIDatabricks Ventures portfolioSources
  • Sigma

    Series D+ · $742M raised

    Cloud BI and analytics with a spreadsheet-style interface that runs natively on cloud data platforms.

    Spreadsheet-style cloud BI on Databricks. Both Databricks Ventures portfolio and 2025 BI Partner of the Year.

    2014San Francisco, USA501+
    Ecosystem-adjacentBIDatabricks Ventures portfolio; BI Partner of the Year 2025Sources
  • Anvilogic

    Series C · $85M raised

    AI-powered SOC platform for security detection engineering on top of data lakes including Databricks.

    Provides a detection engineering layer that runs natively on Databricks-as-security-data-lake. Won the 2025 Databricks Growth Built on Partner of the Year award.

    2019Palo Alto, USA51-200
    Built for DatabricksSecurityDatabricks Ventures portfolio; 2025 Growth Built Partner of the YearSources
  • Hex

    Series C · $122M raised

    Collaborative analytics and data science notebook platform for teams working with SQL and Python.

    Collaborative analytics workspace with deep Databricks integration. Part of the Databricks Ventures portfolio.

    2019San Francisco, USA51-200
    Ecosystem-adjacentBIDatabricks Ventures portfolioSources
  • Hunters

    Series C · $118M raised

    Modern SOC platform that runs detection and response on a customer's Databricks security data lake.

    Modern SIEM alternative that ingests security data directly into a customer's Databricks lakehouse. First security partner on the open security lakehouse ecosystem.

    2018Tel Aviv, Israel51-200
    Built for DatabricksSecurityDatabricks Ventures portfolio; first SOC platform built on DatabricksSources
  • Mistral AI

    Series C · $2.3B raised

    European generative AI lab building frontier open-weight LLMs and enterprise AI products.

    European frontier LLM lab whose models are served via Databricks Mosaic AI Model Serving. Part of the Databricks Ventures portfolio.

    2023Paris, France201-500
    Built for DatabricksVertical AIDatabricks Ventures portfolio; models hosted on Mosaic AISources
  • Anomalo

    Series B · $82M raised

    Automated data quality monitoring using ML to detect anomalies in data warehouses and lakehouses.

    Builds automated data quality monitoring with native Databricks integration. Part of the Databricks Ventures portfolio.

    2018Palo Alto, USA51-200
    Built for DatabricksData QualityDatabricks Ventures portfolioSources
  • Cube

    Series B

    Universal semantic layer that delivers consistent metrics across BI tools and AI applications.

    Open-source semantic layer that runs on Databricks SQL warehouses. Part of the Databricks Ventures portfolio.

    2019San Francisco, USA51-200
    Built for DatabricksSemantic LayerDatabricks Ventures portfolioSources
  • Galileo

    Series B · $68M raised

    GenAI evaluation and observability platform for measuring LLM and agent quality in production.

    Evaluation intelligence for AI teams. Databricks Ventures led participation in the Series B; later acquired by Cisco.

    2021San Francisco, USA51-200
    Built for DatabricksObservabilityDatabricks Ventures portfolio (acquired by Cisco)Sources
  • LangChain

    Series B · $160M raised

    Open-source agent framework and observability platform (LangSmith) for building LLM applications.

    Agent framework and LLM observability with native Databricks Mosaic AI integration. Part of the Databricks Ventures portfolio; unicorn at $1.25B.

    2022San Francisco, USA51-200
    Built for DatabricksML OpsDatabricks Ventures portfolio; first-class integration with Mosaic AISources
  • Lovable

    Series B · $552M raised

    AI software creation platform that turns natural-language prompts into shippable web applications.

    Vibe-coding platform for building apps from natural language. Part of the Databricks Ventures portfolio; valued at $6.6B in late 2025.

    2023Stockholm, Sweden51-200
    Built on DatabricksDev ToolingDatabricks Ventures portfolioSources
  • Noma Security

    Series B · $132M raised

    AI security platform for governing models, agents and third-party AI applications across the enterprise.

    AI model and agent security platform with native Databricks integration. Part of the Databricks Ventures portfolio.

    2023Palo Alto, USA51-200
    Built for DatabricksSecurityDatabricks Ventures portfolioSources
  • Omni

    Series B · $95M raised

    Business intelligence platform combining a semantic model with self-serve exploration for analysts.

    BI platform from ex-Looker and Stitch leaders. Part of the Databricks Ventures portfolio.

    2022San Francisco, USA51-200
    Ecosystem-adjacentBIDatabricks Ventures portfolioSources
  • Prophecy

    Series B · $78M raised

    Low-code data engineering and self-serve transformation platform optimized for Databricks SQL.

    Visual data engineering and AI-driven data prep, deeply integrated with Databricks Lakeflow and SQL. Part of the Databricks Ventures portfolio.

    2017San Ramon, USA51-200
    Built for DatabricksDev ToolingDatabricks Ventures portfolioSources
  • Replit

    Series B

    Browser-based software creation platform that lets anyone build and deploy apps using natural language.

    AI software creation platform. Part of the Databricks Ventures portfolio.

    2016Foster City, USA51-200
    Built on DatabricksDev ToolingDatabricks Ventures portfolioSources
  • Snowplow

    Series B · $60M raised

    Behavioral data platform that captures rich event streams and lands them in lakehouses.

    Behavioral data pipeline that lands AI-ready customer events into Databricks. Part of the Databricks Ventures portfolio.

    2012London, UK51-200
    Ecosystem-adjacentStreamingDatabricks Ventures portfolio; Technology PartnerSources
  • SuperAnnotate

    Series B · $75M raised

    AI pipeline platform for building high-quality multimodal datasets for ML and LLM training.

    Training data platform that won the 2025 Databricks Customer Impact Partner of the Year award.

    2018San Francisco, USA201-500
    Built for DatabricksML OpsDatabricks Ventures portfolio; 2025 Customer Impact Partner of the YearSources
  • Unstructured

    Series B · $65M raised

    Data transformation tools that turn unstructured documents into LLM-ready structured data.

    Unstructured-data ETL for LLMs and RAG. Part of the Databricks Ventures portfolio.

    2022San Francisco, USA51-200
    Built for DatabricksDev ToolingDatabricks Ventures portfolioSources
  • Cleanlab

    Series A · $30M raised

    Automated data curation and quality platform that fixes label errors and trustworthiness issues for AI.

    Data quality platform for ML and LLM training data. Part of the Databricks Ventures portfolio; later acquired by Handshake.

    2021San Francisco, USA11-50
    Built for DatabricksData QualityDatabricks Ventures portfolio (acquired by Handshake)Sources
  • Gable

    Series A · $20M raised

    Data contracts and governance platform to manage source-data changes before they break downstream pipelines.

    Data contracts platform that powers the 'shift-left' movement. Part of the Databricks Ventures portfolio.

    2023Seattle, USA11-50
    Built for DatabricksGovernanceDatabricks Ventures portfolioSources
  • LanceDB

    Series A · $38M raised

    AI-native multimodal data lakehouse with vector indexing on top of an open columnar format.

    Multimodal lakehouse purpose-built for AI workloads. Part of the Databricks Ventures portfolio.

    2022San Francisco, USA11-50
    Ecosystem-adjacentOtherDatabricks Ventures portfolioSources
  • LlamaIndex

    Series A · $28M raised

    Framework and cloud platform for building data-backed agentic LLM applications over unstructured data.

    Open-source framework and managed cloud for RAG and agentic AI. Part of the Databricks Ventures portfolio.

    2023San Francisco, USA11-50
    Built for DatabricksML OpsDatabricks Ventures portfolioSources
  • OpenRouter

    Series A

    Unified API and marketplace for accessing hundreds of large language models with usage billing.

    Multi-model LLM router and marketplace. Part of the Databricks Ventures portfolio.

    2023New York, USA11-50
    Built for DatabricksML OpsDatabricks Ventures portfolioSources
  • Twelve Labs

    Series A · $107M raised

    Video understanding foundation models for search, classification and content generation.

    Video understanding foundation models. Part of the Databricks Ventures portfolio; partners with LanceDB for multimodal RAG.

    2021San Francisco, USA51-200
    Built for DatabricksVertical AIDatabricks Ventures portfolioSources
  • Pomo

    Seed · $4.5M raised

    Agentic marketing intelligence platform for mid-market companies, built on Databricks.

    Marketing intelligence agents from ex-Google DeepMind and Databricks engineers. Databricks Ventures led seed alongside Kindred Ventures.

    2024San Francisco, USA1-10
    Built on DatabricksVertical AIDatabricks Ventures portfolio; founded by ex-Databricks engineersSources
  • Data and AI services partner specializing in Databricks implementation across APJ enterprises.

    Databricks-focused data and AI implementation partner. Databricks Ventures portfolio and 2025 APJ SI Partner of the Year.

    2016Jaipur, India501+
    Ecosystem-adjacentOtherDatabricks Ventures portfolio; 2025 APJ SI Partner of the YearSources
  • AI co-worker for revenue teams that automates customer management workflows.

    AI co-worker for customer management built for revenue teams. Listed in the Databricks Ventures portfolio.

    11-50
    Built on DatabricksVertical AIDatabricks Ventures portfolioSources
  • Builds diffusion-based large language models for faster, more efficient text generation.

    Diffusion-based LLM research lab. Part of the Databricks Ventures portfolio.

    11-50
    Built for DatabricksVertical AIDatabricks Ventures portfolioSources
  • Generates large-scale single-cell AI datasets and Virtual Cell Models for drug discovery.

    Biology AI startup generating proprietary single-cell datasets. Part of the Databricks Ventures portfolio.

    2023South San Francisco, USA11-50
    Built on DatabricksVertical AIDatabricks Ventures portfolioSources
  • Data and AI operations security platform for sensitive data discovery, posture and runtime protection.

    Data and AI security posture management on Databricks. Part of the Databricks Ventures portfolio.

    11-50
    Built for DatabricksSecurityDatabricks Ventures portfolioSources
  • Deep Sync

    Stage unknown

    Identity and data enrichment platform that operates on top of customer Databricks environments.

    Identity and audience data enrichment built on Databricks. Co-recipient of the 2025 Data Built Partner of the Year award.

    2020Tulsa, USA51-200
    Ecosystem-adjacentOther2025 Data Built Partner of the Year (with T-Mobile Advertising Solutions)Sources
  • Abnormal Security

    Series D+ · $546M raised

    AI-native email security platform protecting enterprises from advanced socially-engineered attacks.

    Email security at scale, built on Databricks. Featured in the original Built on Databricks announcement.

    2018San Francisco, USA501+
    Built on DatabricksSecurityBuilt on Databricks PartnerSources
  • Anthropic

    Series D+ · $18B raised

    AI safety company building the Claude family of frontier large language models.

    Frontier model lab whose Claude models are served on Databricks Mosaic AI. Won the 2025 AI Visionary award.

    2021San Francisco, USA501+
    Built for DatabricksVertical AIAI Visionary Partner of the Year 2025; Claude models available on Mosaic AISources
  • Fivetran

    Series D+ · $728M raised

    Automated data movement platform with hundreds of pre-built connectors into cloud lakehouses.

    Managed ELT into Databricks lakehouses. 2025 Databricks Data Integration Partner of the Year; merging with dbt Labs.

    2012Oakland, USA501+
    Ecosystem-adjacentDev Tooling2025 Data Integration Partner of the Year; Technology PartnerSources
  • Monte Carlo

    Series D+ · $236M raised

    Data observability platform that monitors freshness, volume and quality of data and AI pipelines.

    Data and AI observability with deep Databricks integration. Won the 2025 Databricks Data Governance Partner of the Year award.

    2019San Francisco, USA201-500
    Built for DatabricksObservability2025 Data Governance Partner of the Year; native Databricks integrationSources
  • People.ai

    Series D+ · $200M raised

    Revenue intelligence platform that uses AI to surface deal insights for enterprise sales teams.

    Revenue intelligence on Databricks. Featured in the Built on Databricks program.

    2016San Francisco, USA201-500
    Built on DatabricksVertical AIBuilt on Databricks PartnerSources
  • Quantexa

    Series D+ · $410M raised

    Decision intelligence platform using graph analytics for KYC, AML and fraud across financial services.

    Decision intelligence platform deployed on Databricks for financial services. Won the 2025 Enterprise Built on Partner of the Year award.

    2016London, UK501+
    Built for DatabricksVertical AI2025 Enterprise Built on Partner of the YearSources
  • YipitData

    Series D+ · $525M raised

    Alternative-data and analytics provider serving institutional investors and corporate clients.

    Alternative data analytics built on Databricks. Long-tenured Built on Databricks partner.

    2011New York, USA501+
    Built on DatabricksBIBuilt on Databricks PartnerSources
  • Retool

    Series C · $141M raised

    Low-code platform for building internal tools and AI agents on top of databases and APIs.

    Internal-tools low-code platform that connects to Databricks SQL and Mosaic AI. Won the 2025 Databricks Emerging Partner of the Year award.

    2017San Francisco, USA201-500
    Ecosystem-adjacentDev Tooling2025 Emerging Partner of the YearSources
  • Kubit

    Series A

    Warehouse-native product analytics built directly on Databricks and other lakehouses.

    Product analytics that uses zero-copy data sharing on Databricks instead of replicating events. Built on Databricks partner.

    2020San Francisco, USA11-50
    Built on DatabricksBIBuilt on Databricks PartnerSources
  • nOps

    Series A

    Cloud cost optimization platform that automates rightsizing and commitment management on AWS.

    Cloud cost optimization platform managing $4B+ annual spend, with its production app rebuilt on Databricks Lakebase.

    2017San Francisco, USA51-200
    Built for DatabricksFinOpsBuilt on Databricks Partner; rebuilt on Databricks LakebaseSources
  • Aithon

    Seed

    AI-native go-to-market platform for regulated financial services with a 360-degree customer graph.

    GTM intelligence built on Databricks for financial-services sellers. Built on Databricks partner with the Context360 product.

    2024New York, USA11-50
    Built on DatabricksVertical AIBuilt on Databricks PartnerSources
  • Cloud data platform for energy asset operators that standardizes OT data for AI analytics.

    Industrial data platform for the energy sector. Validated Built on Databricks partner.

    2023Boston, USA11-50
    Built on DatabricksVertical AIBuilt on Databricks Partner; Validated Technology PartnerSources
  • Posit

    Bootstrapped

    Open-source data science platform (formerly RStudio) for R, Python and Quarto workflows.

    Maker of RStudio and Posit Workbench, with Databricks Connect integrations. Won the 2025 Databricks Developer Tools Partner of the Year award.

    2009Boston, USA201-500
    Ecosystem-adjacentDev Tooling2025 Developer Tools Partner of the YearSources
  • Security analytics that uses pattern recognition to reduce alert noise and false positives in real time.

    AI-native SOC tooling. Member of the Databricks AI Accelerator cohort.

    1-10
    Built for DatabricksSecurityDatabricks AI Accelerator Program portfolioSources
  • Domain-specific AI platform (Orbital) for the energy sector built on lakehouse infrastructure.

    Vertical AI for energy operators. Member of the Databricks AI Accelerator cohort.

    202311-50
    Built on DatabricksVertical AIDatabricks AI Accelerator Program portfolioSources
  • AI data refinery that transforms raw consumer data into activatable marketing intelligence.

    Consumer data refinery for marketing. Member of the Databricks AI Accelerator cohort.

    11-50
    Built for DatabricksReverse-ETLDatabricks AI Accelerator Program portfolioSources
  • Unified investigation platform that connects code and telemetry to debug software issues with AI.

    AI-powered investigation across code and telemetry. Member of the inaugural Databricks AI Accelerator cohort.

    202311-50
    Built for DatabricksObservabilityDatabricks AI Accelerator Program portfolioSources
  • Composable transformation layer for connecting IoT, security and operational systems with AI.

    Composable transformation for operational data. Member of the Databricks AI Accelerator cohort.

    1-10
    Built for DatabricksDev ToolingDatabricks AI Accelerator Program portfolioSources
  • Data automation and intelligence platform that consolidates and acts on operational data.

    Data automation for security operations. Member of the Databricks AI Accelerator cohort.

    1-10
    Built for DatabricksSecurityDatabricks AI Accelerator Program portfolioSources
  • Starburst

    Series D+ · $414M raised

    Distributed query engine (commercial Trino) with federated analytics across lakehouses and warehouses.

    Federated query engine that interoperates with Databricks Delta and Iceberg tables.

    2017Boston, USA501+
    Ecosystem-adjacentBITechnology Partner; federated query over Delta/IcebergSources
  • Unravel Data

    Series D+ · $108M raised

    AI-powered data observability and FinOps for Databricks, Spark, and modern data pipelines.

    Observability and cost optimization purpose-built for Databricks workloads. Partner in the Databricks Lakehouse Observability program.

    2013Menlo Park, USA11-50
    Built for DatabricksFinOpsTechnology Partner; Lakehouse Observability and FinOps integrationSources
  • Acceldata

    Series C · $106M raised

    Data observability platform monitoring data quality, pipelines and infrastructure across modern stacks.

    Multi-platform data observability with first-party Databricks integration. Databricks Technology Partner.

    2018Campbell, USA201-500
    Built for DatabricksObservabilityTechnology Partner; native Databricks monitoringSources
  • Arize AI

    Series C · $131M raised

    ML and LLM observability platform for monitoring, tracing and evaluating AI applications in production.

    ML and LLM observability that connects directly to MLflow and Databricks Mosaic AI.

    2020Berkeley, USA51-200
    Built for DatabricksObservabilityTechnology Partner; deep MLflow and Mosaic AI integrationSources
  • Atlan

    Series C · $220M raised

    Active metadata and data catalog platform that connects to modern data stacks including Databricks.

    Active metadata and catalog with bidirectional Databricks sync. Databricks Technology Partner.

    2020New York, USA201-500
    Ecosystem-adjacentCatalogTechnology Partner; deep Unity Catalog integrationSources
  • Redpanda

    Series C · $165M raised

    Streaming data platform with a Kafka-compatible API and a single-binary, no-JVM architecture.

    Kafka-compatible streaming platform that lands events directly into Databricks Unity Catalog via Iceberg Topics.

    2019San Francisco, USA201-500
    Ecosystem-adjacentStreamingTechnology Partner; native Iceberg + Unity Catalog integrationSources
  • Coalesce

    Series B · $81M raised

    Visual data transformation platform for building governed pipelines on cloud data warehouses.

    Metadata-driven, GUI-based transformation for Databricks SQL. Databricks Technology Partner.

    2020San Francisco, USA51-200
    Ecosystem-adjacentDev ToolingTechnology Partner; first-class Databricks SQL supportSources
  • Protect AI

    Series B · $108M raised

    AI security platform for red-teaming, model vulnerability scanning and runtime LLM protection.

    AI red-teaming and model security integrated with Databricks Mosaic AI Model Serving endpoints.

    2022Seattle, USA51-200
    Built for DatabricksSecurityTechnology Partner; Recon integration with Mosaic AI Model ServingSources
  • RudderStack

    Series B · $82M raised

    Warehouse-native customer data platform that captures events and reverse-ETLs from lakehouses.

    Warehouse-native CDP and reverse-ETL with direct Databricks Delta integration. Databricks Technology Partner.

    2019San Francisco, USA51-200
    Ecosystem-adjacentReverse-ETLTechnology Partner; Databricks as reverse-ETL sourceSources
  • Soda

    Series B · $43M raised

    Open-source data quality and observability framework for testing data within pipelines and notebooks.

    Open-source data quality framework that integrates directly into Databricks notebooks and Delta tables.

    2019Brussels, Belgium51-200
    Built for DatabricksData QualityTechnology Partner; native Databricks notebook integrationSources
  • Datafold

    Series A · $27M raised

    AI-driven data migration and diff tooling that validates parity across legacy and lakehouse systems.

    Data diff and AI migration agent for moving workloads onto Databricks. Databricks Technology Partner.

    2020New York, USA11-50
    Built for DatabricksMigrationTechnology Partner; AI Migration Agent for DatabricksSources
  • Estuary

    Series A

    Real-time data integration platform with CDC pipelines from operational systems into lakehouses.

    Real-time CDC into Databricks Delta tables. Databricks Technology Partner.

    2019New York, USA11-50
    Ecosystem-adjacentStreamingTechnology Partner; native Databricks destination connectorSources
  • lakeFS (Treeverse)

    Series A · $23M raised

    Git-like version control for data lakes, providing branching, merging and rollback for object storage.

    Git for data lakes with native Databricks workflows. Databricks Technology Partner.

    2020Tel Aviv, Israel11-50
    Ecosystem-adjacentDev ToolingTechnology Partner; native Databricks integrationSources
  • Secoda

    Series A

    AI-powered data discovery and catalog for modern data teams, with Databricks Unity Catalog sync.

    AI-powered data catalog with Unity Catalog sync. Databricks Technology Partner.

    2021Toronto, Canada11-50
    Ecosystem-adjacentCatalogTechnology Partner; Unity Catalog integrationSources
  • SYNQ

    Series A

    Data reliability platform that unifies testing, ownership and alerting across dbt and Databricks.

    Data reliability platform for analytics engineers, with Databricks support.

    2022Copenhagen, Denmark11-50
    Built for DatabricksObservabilityTechnology Partner; native Databricks integrationSources