AI Agents
Recent items mentioning AI Agents across the Databricks ecosystem — releases, news, videos, and community Q&A. Updated hourly.
Databricks is rapidly expanding the practical applications of AI agents, embedding Genie AI agents directly into Veeva Vault CRM for real-time answers in life sciences 2 and transforming electric grid operations from manual to autonomous control 3. The platform also emphasizes building and evaluating high-quality, trustworthy AI agents, leveraging MLflow for observability, automated evaluation, and prompt optimization 7, and enabling personalized responses through user-delegated actions via Model Context Protocol (MCP) servers 4. Furthermore, Databricks Genie Code is emerging as an agentic coding system, allowing users to build custom "skills" for repeatable and consistent engineering tasks 109.
Generated daily from the 10 most recent items mentioning AI Agents. Click any [N] to jump to the source.
Knowledge bases in medallion architecture
Would you put knowledge bases in the bronze/silver/gold layer? The raw documents definitely reside in the bronze layer. But if I create AI Agents atop a volume storing the raw documents, then the knowledge base remains in bronze. However, if I create vector embeddings/do chunking/create a vector search index, then these tables should be in the silver layer. Am I on the right track?
The question your commercial data should already be able to answer
Databricks and Veeva now embed Genie AI agents and AI/BI dashboards directly into Veeva Vault CRM, enabling life sciences commercial teams to get real-time answers to their questions without leaving their workflow. This unified Databricks lakehouse with Unity Catalog delivers governed commercial data to every persona, from sales reps to MSLs, in the format and depth their role requires.
From manual to autonomous: how AI agents are transforming electric grid operations
AI agents are transforming electric grid operations from manual to autonomous control, addressing unprecedented demand and complexity. Hawaiian Electric saw a 60X reduction in document query times, from five minutes to five seconds, in just two weeks with early AI agent deployment.
NewsAI for Data Intelligence Demo: Real-time fraud Detection with Databricks
Databricks demonstrates a real-time fraud detection solution for identifying mule accounts in banking, leveraging a unified data architecture, advanced AI/ML, and graph analytics to uncover complex fraud networks. The solution provides investigators with a single pane of glass application and AI-powered querying (Genie) to analyze risk scores, transaction patterns, and shared device access for efficient fraud investigation and reporting.
TutorialsMaking AI Feel Personal: User-Delegated Actions in MCP Agent Systems
The video demonstrates how to build an AI agent in Databricks that provides personalized responses by integrating user-delegated actions through Model Context Protocol (MCP) servers. It walks through setting up Unity Catalog functions, external MCP tools like web search, and custom MCP servers to access internal APIs, all while maintaining user context for relevant information retrieval.
NewsData + AI Executive Series: Fast 5 — Scaling Real-Time Ops with Databricks at Aer Lingus
Aer Lingus uses Databricks to scale real-time operations, particularly for making critical decisions in their operation control center regarding flight delays and cancellations. They are also exploring using "Agentic" to automate business case creation and review, aiming for a single, governed platform for reusable agents.
NewsData + Semantic Context = AI Ready | How TK Elevator Built It on Databricks
TK Elevator built an AI-ready data platform on Databricks Lakehouse, centralizing fragmented elevator data at scale. This platform integrates semantic context and expert knowledge, using Unity Catalog for governance and a medallion architecture to prepare data for AI applications.
EventsBuilding Trustworthy, High-Quality AI Agents with MLflow
MLflow provides a comprehensive platform for building, evaluating, and deploying high-quality AI agents, offering tools for observability, automated evaluation, prompt optimization, and production monitoring. It enables developers to streamline the agent development lifecycle, from prototyping and testing with human and AI judges to fixing issues and ensuring reliable, governed deployment.
NewsEvaluating AI in Production: A Practical Guide
The video provides a practical guide to evaluating AI in production, emphasizing that evaluation is a continuous process, not a one-time task. It details common evaluation processes, including developing hypotheses, gathering improvement signals, defining success criteria, and utilizing various scoring methods like code-based, LLM-as-judge, and human review.
Agentic Data Engineering with Genie Code and Lakeflow
NewsEnhancing your Skills with Databricks Genie Code
Databricks Genie Code is an agentic coding system that allows users to build custom "skills" using markdown files, enabling it to generate code and perform tasks according to specific in-house standards and conventions. These skills provide context-on-demand, ensuring repeatable and consistent output for various engineering tasks like schema documentation or metric view creation.
The next generation of Databricks Genie just launched. Here is what data engineers actually need to know.
I have been following Genie since it first launched with AI/BI last year. Back then, I honestly thought it was mostly for business users. A chatbot on top of your data that could answer basic questions in plain English. Useful, but not something I thought data engineers really needed to care much about. After seeing the new 2026 version, I completely changed my mind. Genie is no longer just a business chatbot. The biggest change is Genie Code, which is basically an AI agent designed for data professionals. It can generate pipelines, debug failures, create dashboards, monitor systems, and work directly with Lakeflow and Unity Catalog. That part caught my attention immediately because it moves beyond simple Q&A and starts touching actual engineering workflows. What surprised me most is how connected the whole system has become. It can pull context from dashboards, Genie Spaces, apps, metadata, documentation, and external systems like GitHub, Jira, and Confluence through MCP. Instead of only searching tables, it tries to understand relationships across the environment. That feels very different from the first version. The operational side is also interesting. Genie Code can monitor pipelines, investigate failures, help with DBR upgrades, and respond to issues before teams even notice them. The more I read about it, the more it felt less like a chatbot and more like an assistant sitting beside the engineering team. But honestly, the biggest takeaway for me is not the AI itself. It is what this means for data engineers. A lot of people immediately jump to “AI will replace data engineers,” but I think the opposite is happening. These systems are only as good as the data foundation underneath them. If metadata is incomplete, if tables are messy, if naming conventions are inconsistent, or if documentation is missing, the AI layer will give poor answers confidently. That means clean data modeling, governance, metadata, documentation, and data quality are becoming even more important than before. The engineers building those foundations become more valuable, not less. I think the role is slowly shifting away from spending hours writing repetitive boilerplate transformations and more toward building trustworthy, AI-ready data systems. One thing I keep noticing while learning Databricks through BricksNotes and the wider community is that the platform is moving very quickly toward AI-native data engineering. Features like Unity Catalog, Lakeflow, and now Genie all connect together. It feels like understanding metadata and governance is becoming just as important as understanding Spark itself. Also interesting that Genie now has a full mobile experience on iOS and Android. Business users can access dashboards, apps, and chat directly from their phones, which means the underlying data quality matters even more because people are going to depend on these systems everywhere, not only during work hours. Curious if anyone here is already using Genie or Genie Code in production. I would genuinely like to hear how the answer quality has been and whether your teams are changing how they approach metadata and documentation because of it.
News2026 & Beyond: Agentic Future in Finance
Databricks emphasizes that an "agentic future" in finance requires organizations to leverage their unique, proprietary data to provide context to AI models, which is the true competitive advantage. The video demonstrates how Databricks' platform centralizes and governs enterprise data, enabling AI agents to make informed, secure, and differentiated business decisions.
MCP Marketplace Brings Real-Time Intelligence to Agentic Applications
MCP Marketplace now provides real-time external intelligence for agentic applications, with partners like You.com and Moody's offering governed data. Lakebase and Genie enable end-to-end workflows, allowing agents to maintain context and surface decisions to business users for review.
NewsDatabricks Genie, Unity AI Gateway, Project Glasswing, and Model Mania | AI Newsround - April 2026
Databricks Genie is now the business user home screen for Databricks, offering a unified chat interface, external knowledge store connections, and a mobile app. The Unity AI Gateway, integrated with Unity Catalog, provides comprehensive governance for agentic AI, including permissions, auditing, and policy controls for models and tools.
NewsDatabricks in 3 minutes. The unified data and AI platform, explained.
Databricks unifies diverse data sources into a single data lake, providing a governed platform for analytics and AI. It offers capabilities like fine-grained access control, natural language querying with AI, and company-wide intelligent agents.
Show HN: Recursant – service mesh for governing AI agents
Hello, I have just released Recursant to the public. I have been working on it for a while. It is a control plane for governing AI agents across stacks. It provides full observability, guardrails, and control on the network level by routing all traffic through a side car. Problem statement: many large, regulated enterprises (think banks, telcos) have one engineering team on LangGraph, another on CrewAI, marketing on AgentForce, and data teams on Databricks Agent Bricks. They need their agents to talk to each other with consistent policy enforcement, one audit trail, and a single set of guardrails, yet allowing different functions to run on their own stacks. Recursant solves that problem using the service mesh pattern Recursant has two components: a registry and the mesh . The registry contains all live agents. The mesh uses sidecars to route traffic and enforce on the network layer. Aim is for Recursant to provide a real-time EU AI Act Annex IV compliance, so it is not generated from static documents. This saves time and effort for large enterprises subject to the requirement. Linmitations: - Recursant currently plugs in to CrewAI, Langgraph, and n8n . The aim is to support proprietary platforms such as ServiceNow and AgentForce as much as psosible. - The Recursant SDK still needs work to support as many agents as possible - I would also like to provide support for some of the 'personal agent' platforms such as OpenClaw, NanoClaw, and Hermes - Only tested on k8s, not public cloud - Documentation is sparse and needs to be developed. I hope this project is useful to some of you. --- top comments --- [goodra7174] clawdlinux.org building the Kubernetes-native runtime that AI agents call to provision their own secure execution environments — the missing infrastructure layer for every enterprise that can’t send agent data to the cloud #Ycombinator #buildinpublic #startups
Claude code + AI Dev kit vs Genie code
Hi everyone, i am testing the limits for agentic engineering for databricks. Has anyone tried using claude code with AI dev kit and comparing it against Genie code? Specifically around these tasks 1) Data engineering 2) ML prototyping 3) AI/BI Dashboard building So far, claude code + AI dev kit seems to work really well. But from a conversation with claude it seems genie code out of the box is better already. Anyone have any experience?
The Federal Data Paradox: Rich in Data, Poor in Access
Databricks Genie enables federal agencies to overcome siloed, legacy data infrastructure by providing a natural language interface for governed, real-time data access. This empowers mission experts to make faster, evidence-based decisions without requiring a technologist for every routine task.
AI Applications: Tools, Use Cases, and Platforms
AI applications span four capability tiers, each with distinct data requirements and evaluation frameworks, and enterprise deployments often stall due to inadequate data infrastructure. Production-grade model development, from prompt engineering to pretraining, is increasingly accessible with open-source LLMs, but requires pre-built governance and monitoring infrastructure for successful deployment at scale.
Alert Fatigue Is a Business Risk
Lakewatch and Databricks Genie unify data for agentic, machine-speed threat detection, triage, and response, directly addressing the business risk of alert fatigue in SOCs. This new approach helps overcome fragmented telemetry and legacy SIEM architectures that create signal-to-noise challenges and limit effective threat detection.
Possible degradation of Genie agentic capabilities in Databricks Free Edition this week?
NewsTalkdesk Powers AI-Driven CX with Databricks on AWS
Talkdesk uses Databricks on AWS as a unified data platform to power its AI-driven customer experience (CX) platform, which automates and accelerates customer interactions. Databricks centralizes data storage, provides consistent data modeling, and unifies data processing pipelines, enabling Talkdesk to manage both unstructured and structured data in Iceberg format and leverage generative AI capabilities.
Companies Winning with AI Built the Data Layer First
Companies winning with AI, like Trinity improving delivery by 15% and ETA models by 50%, built a unified, governed, and accessible data layer first. Consolidating fragmented systems into a single architecture enables real-time AI, faster decisions, and lower costs.
Agents are ready but your architecture probably isn't
Agentic systems are failing in production due to siloed data, poor governance, and analytics-focused infrastructure. CDOs and CTOs need a transactional database built for agents and a clear vision for success.
MLflow 3.12.0rc0 introduces enhanced AI agent development features, including automatic tracing for more AI coding assistants and OpenClaw, along with new AI Gateway Guardrails for safety checks. It also adds multimodal trace attachments for images and audio in the UI, and a new mlflow.diffusers flavor for saving and serving diffusion models.
[FREE WEBINAR] Running Supply Chain Operations on Databricks: From Dashboards to Agents (BrickTalk)
Hey r/Databricks! We're hosting a free community sponsored BrickTalk this Thursday, May 7th, focusing on modern Supply Chain Management using Databricks. BrickTalks is a community event series where Databricks experts share real-world use cases, demos, and practical insights for building with data and AI, giving customers a direct line to the people behind the products. Discover how to build a unified Control Tower that delivers real-time inventory visibility, AI-powered demand forecasting, and autonomous planning. We'll demo an end-to-end operational platform featuring Databricks AI/BI Genie Rooms and multi-agent Supervisor workflows. You'll see: * Dashboards surfacing key insights. * Genie answering natural language queries grounded in live data. * Agentic systems autonomously processing inbound requests to generate fulfillment plans. This is a great chance to see real-world use cases and get practical insights directly from Databricks experts. **When:** Thursday, May 7 * 9:00 am PT * 12:00 pm ET * 5:00 pm London * 9:30 pm IST [Register here](https://usergroups.databricks.com/events/details/databricks-user-groups-bricktalks-presents-supply-chain-management-bricktalk-running-supply-chain-operations-on-databricks-from-dashboards-to-agents/) Drop any questions below! 👇
Agentic Data Engineering with Genie Code and Lakeflow
Genie Code, an autonomous AI partner for data engineers, is now integrated directly into Lakeflow. Data engineers can leverage Genie Code within Lakeflow's Pipeline Editor and Jobs for the full data engineering lifecycle, from development and orchestration to monitoring and debugging.
ai_classify() - Am I going insane?
I need someone to double check me here because I'm hoping I'm confused. So Databricks has it's nice no-code data classification tool in the UI to let users orchestrate AI functionality. Very cool so far. However, one small problem - there's no option in the UI to select a model. It just runs the default offering from the model reg. So I think maybe that's just not implemented yet and go the SQL route to use ai\_classify() directly. Lo and behold you can't choose the model here either, and the built in model choice is still opaque! What? Databricks doesn't seem to do any analysis on my workflow to automatically route to an appropriate model. Is it really just using the same model regardless of prompt complexity? If this is the case, what is the actual usecase for this functionality? I have been cheerleading Databricks at work for a while now and have onboarded a ton of users, but how can I endorse using ai\_classify() in workflows when the spend has the potential to be so wildly mismatched to the task? I'm currently advising people who want low code agentic solutions to use n8n, which is a shame because pretty much all the other resources they might want to include are in Databricks. Of course technical users can use ai\_query(), but the overlap of people who want to code their own agentic workflows and people who want to use Databricks built-ins is pretty small.
Inside one of the first production deployments of Lakebase: LangGuard's agentic workflow governance engine
LangGuard's agentic workflow governance engine, one of the first production deployments of Lakebase, extends Unity Catalog and AI Gateway with runtime enforcement for autonomous AI agents. Lakebase provides the elastic, low-latency operational data layer for LangGuard's GRAIL™ data fabric, enabling real-time policy evaluation without impacting agent performance.
AI observability for production: Seeing Inside Your Multi-Agent System with MLflow
MLflow now offers enhanced AI observability for multi-agent systems, providing crucial visibility into their internal workings. This helps practitioners prevent unintended actions like data purges or sensitive information leaks in production.
The SDK now automatically detects AI coding agents and appends agent information to HTTP request headers, while also removing the unused `experimentalIsUnifiedHost` field from `DatabricksConfig`. A bug fix addresses `X-Databricks-Org-Id` header issues for `SharesExtImpl.list()` on SPOG hosts, and several API changes introduce new services like `secretsUc()` and `supervisorAgents()`, along with breaking changes to method paths for various update operations.
WorkspaceExt upload/download and SharesExt list now include the X-Databricks-Org-Id header for SPOG host compatibility. WorkspaceClient.get_workspace_id avoids an API call when the workspace ID is already known, fixing a failure on SPOG hosts.
NewsGenAI - For Data Engineers Agenda & Introduction | LLM & Agentic AI | LangChain & LangGraph | Claude
This video introduces a new course, "GenAI for Data Engineers," designed to teach data engineers how to leverage generative AI, LLMs, and agentic AI. The course covers basics of LLMs, building agents with LangChain and LangGraph, using Cloud Code, and applying agentic AI within Databricks and data engineering workflows.
This release adds new workspace-level services for supervisor agents and Unity Catalog secrets, along with an update method for tokens. Several API methods for data classification, environments, knowledge assistants, Postgres, and warehouses have breaking changes due to path modifications.
Structuring AI Evaluation and Observability with MLflow: From Development to Production
MLflow now offers enhanced tools for structuring AI evaluation and observability, including new APIs and UI features for logging LLM calls, prompts, responses, and metrics. This enables practitioners to systematically track, compare, and analyze model performance and behavior across development and production, facilitating iterative improvement and robust monitoring.
Why metric definitions matter for reliable AI agents
Learn how dbt's semantic foundation enables reliable, governed agentic analytics.
This release adds Azure MSI authentication and improves `.databrickscfg` profile resolution. It also fixes issues with non-JSON error responses and Databricks CLI token scope mismatches.
NewsHow Agentic AI is Rewriting Healthcare | NVIDIA x Databricks
Agentic AI is profoundly changing healthcare by automating administrative tasks for professionals and accelerating scientific research, such as drug discovery. Databricks and NVIDIA are collaborating to build an AI-ready data layer and open-source platforms to unlock insights from digitized medical data, enabling these agentic systems.
Meet Antigravity: Google’s agentic IDE enters the dbt orbit
Antigravity, Google's new agentic IDE, now integrates with dbt. This pairing promises to significantly improve developer productivity, potentially giving you your weekends back.
Exploring dbt and Google with AI agents
Learn how to build your first ddbt agent by plugging AI into a dbt project. This practical guide explores what happens when AI agents interact with dbt and Google.
NewsDatabricks Apps vs Model Serving: Authentication, Cost, and Performance Compared
Databricks Apps are now the recommended first choice for deploying agents due to their flexibility in handling full-stack applications with multiple components, offering faster iteration and local testing compared to Model Serving. Model Serving remains suitable for use cases prioritizing high QPS, governance features like AI Gateway, inference tables, and guardrails, or when scaling to zero is acceptable for cost optimization.
EventsStrategic App Expansion and the Power of Proprietary Data | Ali Ghodsi at HumanX
Databricks plans to strategically expand its SaaS application offerings, focusing on areas where proprietary data, security, and governance create a strong competitive moat. The company will prioritize applications that leverage its expertise in massive data processing.
EventsHow Databricks Manages Enterprise Data and AI | Ali Ghodsi at HumanX
Databricks centralizes an organization's data from various systems into a Lakehouse, securing it and setting access rules. This consolidated and secured data then feeds into AI agents, models, and analytics for business forecasting and insights.
EventsSolving the AI Reliability Gap | Ali Ghodsi at HumanX
AI agents currently struggle with end-to-end tasks due to a lack of context, not intelligence. Addressing this reliability gap requires capturing context and changing organizational processes, a multi-year effort that Databricks is focused on.
EventsThree Things Required for Deeper Insights from AI | Ali Ghodsi at HumanX
Databricks enables deeper AI insights by combining agents and AI with a robust database and an analytics platform. This approach allows enterprises to leverage their proprietary data for predictive analytics beyond what traditional SaaS applications offer.
EventsHow Novo Nordisk Uses Databricks Genie for Research | Ali Ghodsi at HumanX
Novo Nordisk utilizes Databricks Genie to enable its scientists to query data warehouses and databases. This allows researchers to ask complex questions about studies, such as adverse effects, and receive accurate, statistically referenced answers.
EventsThe Limits of Human-Led Security Operations | Ali Ghodsi at RSAC
Current Security Information and Event Management (SIM) systems are limited by data ingestion pricing models, leading to incomplete data capture and a lack of long-term historical analysis. Furthermore, detection, investigation, and threat hunting processes within these systems are largely manual, resulting in security operations teams being overwhelmed and detecting only a fraction of potential threats.
Tired of Reviewing Traces? Meet Automatic Issue Detection for Your Agent
Automatic issue detection for your AI agent is now available, eliminating the need for manual trace reviews. This new feature helps you act on your observability data, improving the user experience beyond just recording logs, metrics, and traces.
MLflow 3.11.1 introduces AI-powered issue detection for agent traces, budget alerts and limits for AI Gateway spending, and a new interactive graph view for visualizing trace hierarchies. It also enhances security with pickle-free model serialization and improves dependency management with native UV support.
TutorialsFrom Excel to AI Agents: The Evolution of BI Explained
The video explains the evolution of Business Intelligence (BI) through four phases, from IT-centric to analyst-driven, then semantic layers, and finally to a future where AI agents are primary BI users. It demonstrates how Databricks' BI stack, including Dashboards, Genie (natural language interface), Metric Views (semantic layer), and Databricks One (serving layer), addresses these evolving needs by providing a unified, open, and AI-ready platform.
NewsLakebase: Postgres That Actually Likes Your Lakehouse
Lakebase is a new Databricks offering that provides a fully managed, autoscaling PostgreSQL database designed to bridge the gap between analytical and transactional workloads in a lakehouse architecture. It features bidirectional data streaming between Delta tables and PostgreSQL, database branching for isolated development, and Unity Catalog governance.
NewsSee Databricks Assistant Build a Metric View in 90 Seconds
The video demonstrates how Databricks Assistant can build a metric view in 90 seconds by generating YAML code for joins, dimensions, and measures from a natural language prompt. This metric view, a miniature semantic model, centralizes business logic and is queryable via SQL by various tools and agents.
ReleasesIntroducing Pantheon - Agentic Engineering At Scale
Pantheon is a Databricks application that uses a multi-agent system to generate Lake Flow pipelines for data engineering, allowing users to define data ingestion and transformation rules through a conversational interface. It automates the design, validation, and code generation for lakehouse pipelines, enabling citizen engineers to build robust data solutions without deep PySpark knowledge.
The SDK now automatically detects AI coding agents and appends `agent/<name>` to HTTP request headers. New `DisableGovTagCreation` fields were added to `settings.RestrictWorkspaceAdminsMessage` and `settingsv2.RestrictWorkspaceAdminsMessage`.
The SDK now automatically detects AI coding agents and appends agent information to HTTP request headers. New fields were added to restrict workspace admin tag creation in settings messages.
Your Agents Need an AI Platform
MLflow 2.12 ships with new features for building and managing AI agents, including enhanced logging for agent traces, evaluation tools, and versioning capabilities. Leverage MLflow as your unified platform for developing, deploying, and governing reliable AI agents in production.
NewsDatabricks Lakebase - Instant OLTP for Apps & Agents
Databricks Lakebase provides an OLTP-style database within the Databricks Lakehouse ecosystem, enabling rapid, scalable transactional processing for applications and AI agents. It allows users to quickly provision autoscaling databases that can spin up and down in milliseconds, offering a cost-effective solution for operational data storage.
NewsOpenClaw, Databricks Agentic Data Monitoring & more! | AI Newsround - February 2026 | Advancing AI
The video discusses OpenClaw, an open-source framework for AI agents, and Databricks' new agentic data quality monitoring solution. It also introduces Advancing Analytics' Lake Forge and Pantheon, a framework and AI layer for developing scalable Lake Flow pipelines, and highlights new model releases from Anthropic, Google, and OpenAI.
NewsDatabricks Breaking News: 2026 Week 6: 2 February 2026 to 8 February 2026
Databricks introduces agentic data quality monitoring with anomaly detection, LLM judge UI builder for MLflow, and new SQL warehouse features including a default option and activity details. The platform also enhances its assistant to connect with MCP servers, improves Google Sheets integration with pivot table functionality, and adds direct Git deployment and tagging for Databricks apps.