Alation
Series D+ · $340M raised
Enterprise data intelligence and catalog platform with active metadata and agentic AI capabilities.
Data catalog and governance platform with Unity Catalog integrations. Listed in the Databricks Ventures portfolio.
Companies building on, for, or around Databricks. Curated from the Databricks Ventures portfolio, Partner Directory, AI Accelerator cohort, and the 2025 Partner Awards. Independent reference, not affiliated; ask us to add or remove a company.
72 active · 36 built for · 14 built on · 22 adjacent · 12 acquired
Series D+ · $340M raised
Enterprise data intelligence and catalog platform with active metadata and agentic AI capabilities.
Data catalog and governance platform with Unity Catalog integrations. Listed in the Databricks Ventures portfolio.
Series D+ · $416M raised
SQL-based data transformation framework that has become the standard for analytics engineering.
SQL-based transformation framework that runs as a first-class citizen on Databricks. Both a Databricks Ventures portfolio company and a 2025 Partner Award winner. Now merging with Fivetran.
Series D+ · $600M raised
AI-powered enterprise knowledge assistant that searches and reasons across company applications.
Enterprise AI search and agents platform. Part of the Databricks Ventures portfolio; valued at $7.2B in its 2025 Series F.
Series D+ · $322M raised
Composable customer data platform and reverse-ETL that activates lakehouse data into business tools.
Reverse-ETL pioneer that activates Databricks data into 200+ business tools. Both a Databricks Ventures portfolio company and 2025 Partner Award winner.
Series D+ · $267M raised
Data security platform providing access control, policy enforcement and audit on cloud data platforms.
Data access control and policy automation that integrates deeply with Databricks Unity Catalog. Part of the Databricks Ventures portfolio.
Series D+ · $188M raised
Training data and human-in-the-loop platform for building ML and frontier-model datasets.
Training data platform with native Databricks integration. Part of the Databricks Ventures portfolio.
Series D+ · $310M raised
Data productivity cloud with low-code ETL pipelines and orchestration for cloud data platforms.
Low-code ETL with native Databricks integration. Part of the Databricks Ventures portfolio.
Series D+ · $1.5B raised
AI-powered conversational search engine that synthesizes web answers with cited sources.
Consumer AI answer engine. Part of the Databricks Ventures portfolio; valued at $20B in 2025.
Series D+ · $742M raised
Cloud BI and analytics with a spreadsheet-style interface that runs natively on cloud data platforms.
Spreadsheet-style cloud BI on Databricks. Both Databricks Ventures portfolio and 2025 BI Partner of the Year.
Series C · $85M raised
AI-powered SOC platform for security detection engineering on top of data lakes including Databricks.
Provides a detection engineering layer that runs natively on Databricks-as-security-data-lake. Won the 2025 Databricks Growth Built on Partner of the Year award.
Series C · $122M raised
Collaborative analytics and data science notebook platform for teams working with SQL and Python.
Collaborative analytics workspace with deep Databricks integration. Part of the Databricks Ventures portfolio.
Series C · $118M raised
Modern SOC platform that runs detection and response on a customer's Databricks security data lake.
Modern SIEM alternative that ingests security data directly into a customer's Databricks lakehouse. First security partner on the open security lakehouse ecosystem.
Series C · $2.3B raised
European generative AI lab building frontier open-weight LLMs and enterprise AI products.
European frontier LLM lab whose models are served via Databricks Mosaic AI Model Serving. Part of the Databricks Ventures portfolio.
Series B · $82M raised
Automated data quality monitoring using ML to detect anomalies in data warehouses and lakehouses.
Builds automated data quality monitoring with native Databricks integration. Part of the Databricks Ventures portfolio.
Series B
Universal semantic layer that delivers consistent metrics across BI tools and AI applications.
Open-source semantic layer that runs on Databricks SQL warehouses. Part of the Databricks Ventures portfolio.
Series B · $68M raised
GenAI evaluation and observability platform for measuring LLM and agent quality in production.
Evaluation intelligence for AI teams. Databricks Ventures led participation in the Series B; later acquired by Cisco.
Series B · $160M raised
Open-source agent framework and observability platform (LangSmith) for building LLM applications.
Agent framework and LLM observability with native Databricks Mosaic AI integration. Part of the Databricks Ventures portfolio; unicorn at $1.25B.
Series B · $552M raised
AI software creation platform that turns natural-language prompts into shippable web applications.
Vibe-coding platform for building apps from natural language. Part of the Databricks Ventures portfolio; valued at $6.6B in late 2025.
Series B · $132M raised
AI security platform for governing models, agents and third-party AI applications across the enterprise.
AI model and agent security platform with native Databricks integration. Part of the Databricks Ventures portfolio.
Series B · $95M raised
Business intelligence platform combining a semantic model with self-serve exploration for analysts.
BI platform from ex-Looker and Stitch leaders. Part of the Databricks Ventures portfolio.
Series B · $78M raised
Low-code data engineering and self-serve transformation platform optimized for Databricks SQL.
Visual data engineering and AI-driven data prep, deeply integrated with Databricks Lakeflow and SQL. Part of the Databricks Ventures portfolio.
Series B
Browser-based software creation platform that lets anyone build and deploy apps using natural language.
AI software creation platform. Part of the Databricks Ventures portfolio.
Series B · $60M raised
Behavioral data platform that captures rich event streams and lands them in lakehouses.
Behavioral data pipeline that lands AI-ready customer events into Databricks. Part of the Databricks Ventures portfolio.
Series B · $75M raised
AI pipeline platform for building high-quality multimodal datasets for ML and LLM training.
Training data platform that won the 2025 Databricks Customer Impact Partner of the Year award.
Series B · $65M raised
Data transformation tools that turn unstructured documents into LLM-ready structured data.
Unstructured-data ETL for LLMs and RAG. Part of the Databricks Ventures portfolio.
Series A · $30M raised
Automated data curation and quality platform that fixes label errors and trustworthiness issues for AI.
Data quality platform for ML and LLM training data. Part of the Databricks Ventures portfolio; later acquired by Handshake.
Series A · $20M raised
Data contracts and governance platform to manage source-data changes before they break downstream pipelines.
Data contracts platform that powers the 'shift-left' movement. Part of the Databricks Ventures portfolio.
Series A · $38M raised
AI-native multimodal data lakehouse with vector indexing on top of an open columnar format.
Multimodal lakehouse purpose-built for AI workloads. Part of the Databricks Ventures portfolio.
Series A · $28M raised
Framework and cloud platform for building data-backed agentic LLM applications over unstructured data.
Open-source framework and managed cloud for RAG and agentic AI. Part of the Databricks Ventures portfolio.
Series A
Unified API and marketplace for accessing hundreds of large language models with usage billing.
Multi-model LLM router and marketplace. Part of the Databricks Ventures portfolio.
Series A · $107M raised
Video understanding foundation models for search, classification and content generation.
Video understanding foundation models. Part of the Databricks Ventures portfolio; partners with LanceDB for multimodal RAG.
Seed · $4.5M raised
Agentic marketing intelligence platform for mid-market companies, built on Databricks.
Marketing intelligence agents from ex-Google DeepMind and Databricks engineers. Databricks Ventures led seed alongside Kindred Ventures.
Bootstrapped
Data and AI services partner specializing in Databricks implementation across APJ enterprises.
Databricks-focused data and AI implementation partner. Databricks Ventures portfolio and 2025 APJ SI Partner of the Year.
AI co-worker for revenue teams that automates customer management workflows.
AI co-worker for customer management built for revenue teams. Listed in the Databricks Ventures portfolio.
Builds diffusion-based large language models for faster, more efficient text generation.
Diffusion-based LLM research lab. Part of the Databricks Ventures portfolio.
Generates large-scale single-cell AI datasets and Virtual Cell Models for drug discovery.
Biology AI startup generating proprietary single-cell datasets. Part of the Databricks Ventures portfolio.
Data and AI operations security platform for sensitive data discovery, posture and runtime protection.
Data and AI security posture management on Databricks. Part of the Databricks Ventures portfolio.
Stage unknown
Identity and data enrichment platform that operates on top of customer Databricks environments.
Identity and audience data enrichment built on Databricks. Co-recipient of the 2025 Data Built Partner of the Year award.
Series D+ · $546M raised
AI-native email security platform protecting enterprises from advanced socially-engineered attacks.
Email security at scale, built on Databricks. Featured in the original Built on Databricks announcement.
Series D+ · $18B raised
AI safety company building the Claude family of frontier large language models.
Frontier model lab whose Claude models are served on Databricks Mosaic AI. Won the 2025 AI Visionary award.
Series D+ · $728M raised
Automated data movement platform with hundreds of pre-built connectors into cloud lakehouses.
Managed ELT into Databricks lakehouses. 2025 Databricks Data Integration Partner of the Year; merging with dbt Labs.
Series D+ · $236M raised
Data observability platform that monitors freshness, volume and quality of data and AI pipelines.
Data and AI observability with deep Databricks integration. Won the 2025 Databricks Data Governance Partner of the Year award.
Series D+ · $200M raised
Revenue intelligence platform that uses AI to surface deal insights for enterprise sales teams.
Revenue intelligence on Databricks. Featured in the Built on Databricks program.
Series D+ · $410M raised
Decision intelligence platform using graph analytics for KYC, AML and fraud across financial services.
Decision intelligence platform deployed on Databricks for financial services. Won the 2025 Enterprise Built on Partner of the Year award.
Series D+ · $525M raised
Alternative-data and analytics provider serving institutional investors and corporate clients.
Alternative data analytics built on Databricks. Long-tenured Built on Databricks partner.
Series C · $141M raised
Low-code platform for building internal tools and AI agents on top of databases and APIs.
Internal-tools low-code platform that connects to Databricks SQL and Mosaic AI. Won the 2025 Databricks Emerging Partner of the Year award.
Series A
Warehouse-native product analytics built directly on Databricks and other lakehouses.
Product analytics that uses zero-copy data sharing on Databricks instead of replicating events. Built on Databricks partner.
Series A
Cloud cost optimization platform that automates rightsizing and commitment management on AWS.
Cloud cost optimization platform managing $4B+ annual spend, with its production app rebuilt on Databricks Lakebase.
Seed
AI-native go-to-market platform for regulated financial services with a 360-degree customer graph.
GTM intelligence built on Databricks for financial-services sellers. Built on Databricks partner with the Context360 product.
Seed
Cloud data platform for energy asset operators that standardizes OT data for AI analytics.
Industrial data platform for the energy sector. Validated Built on Databricks partner.
Bootstrapped
Open-source data science platform (formerly RStudio) for R, Python and Quarto workflows.
Maker of RStudio and Posit Workbench, with Databricks Connect integrations. Won the 2025 Databricks Developer Tools Partner of the Year award.
Security analytics that uses pattern recognition to reduce alert noise and false positives in real time.
AI-native SOC tooling. Member of the Databricks AI Accelerator cohort.
Domain-specific AI platform (Orbital) for the energy sector built on lakehouse infrastructure.
Vertical AI for energy operators. Member of the Databricks AI Accelerator cohort.
AI data refinery that transforms raw consumer data into activatable marketing intelligence.
Consumer data refinery for marketing. Member of the Databricks AI Accelerator cohort.
Unified investigation platform that connects code and telemetry to debug software issues with AI.
AI-powered investigation across code and telemetry. Member of the inaugural Databricks AI Accelerator cohort.
Composable transformation layer for connecting IoT, security and operational systems with AI.
Composable transformation for operational data. Member of the Databricks AI Accelerator cohort.
Data automation and intelligence platform that consolidates and acts on operational data.
Data automation for security operations. Member of the Databricks AI Accelerator cohort.
Series D+ · $414M raised
Distributed query engine (commercial Trino) with federated analytics across lakehouses and warehouses.
Federated query engine that interoperates with Databricks Delta and Iceberg tables.
Series D+ · $108M raised
AI-powered data observability and FinOps for Databricks, Spark, and modern data pipelines.
Observability and cost optimization purpose-built for Databricks workloads. Partner in the Databricks Lakehouse Observability program.
Series C · $106M raised
Data observability platform monitoring data quality, pipelines and infrastructure across modern stacks.
Multi-platform data observability with first-party Databricks integration. Databricks Technology Partner.
Series C · $131M raised
ML and LLM observability platform for monitoring, tracing and evaluating AI applications in production.
ML and LLM observability that connects directly to MLflow and Databricks Mosaic AI.
Series C · $220M raised
Active metadata and data catalog platform that connects to modern data stacks including Databricks.
Active metadata and catalog with bidirectional Databricks sync. Databricks Technology Partner.
Series C · $165M raised
Streaming data platform with a Kafka-compatible API and a single-binary, no-JVM architecture.
Kafka-compatible streaming platform that lands events directly into Databricks Unity Catalog via Iceberg Topics.
Series B · $81M raised
Visual data transformation platform for building governed pipelines on cloud data warehouses.
Metadata-driven, GUI-based transformation for Databricks SQL. Databricks Technology Partner.
Series B · $108M raised
AI security platform for red-teaming, model vulnerability scanning and runtime LLM protection.
AI red-teaming and model security integrated with Databricks Mosaic AI Model Serving endpoints.
Series B · $82M raised
Warehouse-native customer data platform that captures events and reverse-ETLs from lakehouses.
Warehouse-native CDP and reverse-ETL with direct Databricks Delta integration. Databricks Technology Partner.
Series B · $43M raised
Open-source data quality and observability framework for testing data within pipelines and notebooks.
Open-source data quality framework that integrates directly into Databricks notebooks and Delta tables.
Series A · $27M raised
AI-driven data migration and diff tooling that validates parity across legacy and lakehouse systems.
Data diff and AI migration agent for moving workloads onto Databricks. Databricks Technology Partner.
Series A
Real-time data integration platform with CDC pipelines from operational systems into lakehouses.
Real-time CDC into Databricks Delta tables. Databricks Technology Partner.
Series A · $23M raised
Git-like version control for data lakes, providing branching, merging and rollback for object storage.
Git for data lakes with native Databricks workflows. Databricks Technology Partner.
Series A
AI-powered data discovery and catalog for modern data teams, with Databricks Unity Catalog sync.
AI-powered data catalog with Unity Catalog sync. Databricks Technology Partner.
Series A
Data reliability platform that unifies testing, ownership and alerting across dbt and Databricks.
Data reliability platform for analytics engineers, with Databricks support.