Anthropic
Recent items mentioning Anthropic across the Databricks ecosystem — releases, news, videos, and community Q&A. Updated hourly.
Databricks users are actively discussing Anthropic's Claude Code in conjunction with the Databricks AI Dev Kit 18 and Databricks Genie Code 478. There's also a reported limitation where the Databricks FMAPI Anthropic endpoint rejects requests with a trailing assistant message 5.
Generated daily from the 10 most recent items mentioning Anthropic. Click any [N] to jump to the source.
Claude Code + Databricks AI Dev Kit
Finding Databricks cached token usage (count and/or cost)
I was asked to do a comparison of token usage between Databricks and Azure AI Services. With Microsoft you get your list price (and discount price if you have an EA) and they break the costs into input, output, input cache, output cache, and sometimes many other SKUs. Databricks charges us what looks like a single price for a token. We can find the input and output count, but the only place we've found any cached token counts is in the gateway system table system.ai\_gateway.usage. (which accounts for only about 4% of our overall token volume) Why do we need this? Without it, we're betting on 40B Claude Opus 4-6 (for example) being at full input and output costs. That artificially makes Azure look more expensive, and we have Azure often between 75-95% cached usage. Currently our model gives us a $2.16/1M token cost for Databricks, and its $5.20/1M for Azure marketplace. At around 70% cache usage, the table will turn to Azure being cheaper. https://preview.redd.it/qx0t0xg15d1h1.png?width=1534&format=png&auto=webp&s=362fd144e41bb2170bc682d9f9441d1db5292606 I need to know at least the number count so I can get a percentage then do a fair comparison. Databricks needs a parts catalog like Azure. It seems they bury it on purpose. That or I just haven't found the right place to look. Thanks. BTW: of course Microsoft isn't perfect either. Claude models are marketplace driven, not in the FOCUS cost table, so we have to look in two separate places for model pricing. At least its there.
Modular structure for Databricks Apps (Streamlit)
Hey, I wanted to share something that's been bugging me for a while and get your take. The official Databricks Streamlit tutorial puts everything into a single **app.py.** Fine for a demo. But the moment a real internal app grows past \~500–600 lines, it stops being fun: * Two people on the team touch the same file → merge conflicts every PR. * Hard to write unit tests when UI, data access, and business logic live in one module. * Git diffs become unreadable, and code review suffers. * When I point Cursor/Claude at the repo, it has to re-read the whole monolith on every prompt. Context window and cost both balloon. So I refactored our internal template into something more boring and modular: app. py # entry point only, routing pages/ ├── home. py ├── analytics. py └── settings. py components/ # reusable UI bits services/ # SQL warehouse / UC / SDK calls assets/ ├── styles.css └── logo.png tests/ *This is my own repo, not a product. Sharing because the single-file pattern bit us hard, and I figured others might find it useful -* [*https://github.com/protmaks/databricks\_apps\_streamlit\_mod\_template*](https://github.com/protmaks/databricks_apps_streamlit_mod_template)
NewsEnhancing your Skills with Databricks Genie Code
Databricks Genie Code is an agentic coding system that allows users to build custom "skills" using markdown files, enabling it to generate code and perform tasks according to specific in-house standards and conventions. These skills provide context-on-demand, ensuring repeatable and consistent output for various engineering tasks like schema documentation or metric view creation.
FMAPI Anthropic endpoint rejects requests with trailing assistant message — known limitation?
Ali Ghodsi from Databricks: AGI Is Here, Enterprise AI Is Not
Ali was a guest lecturer at Stanford's MS&E Economics of the AI Supercycle class and he shared some really interesting insights on how he sees the AI adoption trends and why he thinks that stock market is overreacting to Claude's capabilities in eating away top SaaS companies.
NewsDatabricks Genie, Unity AI Gateway, Project Glasswing, and Model Mania | AI Newsround - April 2026
Databricks Genie is now the business user home screen for Databricks, offering a unified chat interface, external knowledge store connections, and a mobile app. The Unity AI Gateway, integrated with Unity Catalog, provides comprehensive governance for agentic AI, including permissions, auditing, and policy controls for models and tools.
Claude code + AI Dev kit vs Genie code
Hi everyone, i am testing the limits for agentic engineering for databricks. Has anyone tried using claude code with AI dev kit and comparing it against Genie code? Specifically around these tasks 1) Data engineering 2) ML prototyping 3) AI/BI Dashboard building So far, claude code + AI dev kit seems to work really well. But from a conversation with claude it seems genie code out of the box is better already. Anyone have any experience?
MLflow 3.12.0 introduces multimodal tracing, allowing storage and rich rendering of PDFs, audio, and images as artifact attachments in tracing spans. It also adds AI Gateway guardrails to prevent unsafe model inputs/outputs and extends coding agent tracing support to Codex, Gemini, and Qwen.
Resident Solution Architect - Anxiety
Hi all, I was approached by a Databricks Recruiter this week regarding a RSA role in Europe. Now I’m proceeding to the Hiring Manager interview next week. I was told that the following steps in the process are: \- HM interview \- Technical Interview (Spark deep dive) \- Technical Interview (live coding Python & SQL) \- Architecture Interview \- Project Delivery Interview Honestly, I’m a bit flabbergasted looking at the process. It seems like a huge effort for both parties and honestly I have a hard time seeing how someone should prep themselves while being in my current job and not abandoning my family for a month. 😄 Furthermore, I get anxiety about having to do live coding. Like I literally have not written a single line of code this year due to AI. Like yeah I review, check, orchestrate, give instructions etc. but being tested on writing live code gives me the chills! I’m a Solution Architect at a consultancy, work with Databricks almost daily and have multiple certs. So the role matches my profile pretty well BUT I really feel intimidated. Is the process really as tough as it sounds? Does nobody use Claude at Databricks? How are the technical deep dives? Any advice on prep or in general? I would love to hear from you and your experience! Thanks!
Chat layer architecture: what should I use for external customers?
May be wrong but as far as I know, we can use multi tenant schema or RLS to do customer isolation. Then give specific read access to tables using customer specific service principles. But after this what should I use? What APIs of databricks? Claude mcp or genie mcp or what would be accessible so that customers can do natural language query on top of data. Is genie internally using claude and adding databricks context so that we can reliably skip claude mcp inside databricks engine?
Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks
Hi HN, I’ve been working on mljar-supervised (open-source AutoML for tabular data) for a few years. Recently I built a desktop app around it called MLJAR Studio. The idea is simple: you talk to your data in natural language, the AI generates Python code, executes it locally, and the whole conversation becomes a reproducible notebook (*.ipynb file). So instead of just chatting with data, you end up with something you can inspect, modify, and rerun. What MLJAR Studio does: - Sets up a local Python environment automatically, runs on Mac, Windows, and Linux - Installs missing packages during the conversation - Built-in AutoML for tabular data (classification, regression, multiclass) - Works with standard Python libraries (pandas, matplotlib, etc.) - Works with any data file: CSV, Excel, Stata, Parquet ... - Connects to PostgreSQL, MySQL, SQL Server, Snowflake, Databricks, and Supabase. For AI: use Ollama locally (zero data egress), bring your own OpenAI key, or use MLJAR AI add-on. I built this because I wanted something between Jupyter Notebook (flexible but manual) and AI tools that generate code but don’t preserve the workflow. Most tools I tried either hide too much or don’t give reproducible results and are cloud based Demos: - 60-second demo: https://youtu.be/BjxpZYRiY4c - Full 3-minute analysis: https://youtu.be/1DHMMxaNJxI Pricing is $199 one-time, with a 7-day trial. Curious if this is useful for others doing real data work, or if I’m solving my own problem here. Happy to answer questions. --- top comments --- [MSaiRam10] Notebooks as the output format is funny because notebooks are famously bad for reproducibility. Out of order execution, hidden state, etc. You're solving "chat isn't reproducible" with a format that also isn't really [hasyimibhar] How does this compare to open source Deepnote[0]? We use the cloud version (BYOC) at my previous company to replace self-hosted Jupyter notebooks, and it's pretty great. [0] https://github.com/deepnote/deepnote [2ndorderthought] This is one of those product areas I would call high-risk without a human in the loop. So I am glad you kept a person in the loop. It's really easy to lose tons of money making decisions based on bad statistics or models. Anyone remember how much money zillow lost because of automatic time series models? I do have concerns about the workflow. Data people aren't usually the best programmers. Models hallucinate and make mistakes sometimes subtle sometimes not. Can you think of a way to prevent data scientists from having to be expert code reviewers? I feel like taking away the code gives them the chance to find and fix mistakes in their reasoning but I have no evidence for that. [amirathi] Really cool. If somebody doesn't want to adopt a new platform, take a look at open source Jupyter MCP Server[1]. Once integrated with Claude, it can execute code on the live notebook kernel. I just let Claude write notebooks, run top to bottom, debug & fix errors & only ping me when everything is working. [1] https://github.com/datalayer/jupyter-mcp-server [trymamboapp] "AI saves analysis as notebooks" is fighting the wrong fight ig. The reproducibility issue with notebooks isn't the format. it's out-of-order cell execution and silent kernel state llm generation makes that worse: the model has no memory of what state existed when it wrote cell 7, and neither does the user.
Vibe coding on the Databricks free addition
Has anyone used Genie code on Databricks free addition? Have you faced any issues. Is it better to use something like Claude/ Cursor ( I have a subscription already) in combination with AI Dev Kit on the free addition to not hit the rate limits?
Genie Spaces - What do you think?
I've been having a lot of success using Claude Cowork with the Databricks AI Dev Kit to create Genie Spaces that have fully developed Knowledge Stores set up. It saves so much time! The industry I'm in means I need lots of the same space except for a specific facility with all of its own permissions and slightly different context. They are early days so we will see how the users respond, but so far they love them. With the new push to Genie as a whole, I can see us getting a lot of use out of the mobile app paired with Genie's capabilities. What are the thoughts here in the community on Genie Spaces and what do you use them for?
MLflow 3.12.0rc0 introduces enhanced AI agent development features, including automatic tracing for more AI coding assistants and OpenClaw, along with new AI Gateway Guardrails for safety checks. It also adds multimodal trace attachments for images and audio in the UI, and a new mlflow.diffusers flavor for saving and serving diffusion models.
The SDK now automatically detects AI coding agents and appends agent information to HTTP request headers, while also removing the unused `experimentalIsUnifiedHost` field from `DatabricksConfig`. A bug fix addresses `X-Databricks-Org-Id` header issues for `SharesExtImpl.list()` on SPOG hosts, and several API changes introduce new services like `secretsUc()` and `supervisorAgents()`, along with breaking changes to method paths for various update operations.
WorkspaceExt upload/download and SharesExt list now include the X-Databricks-Org-Id header for SPOG host compatibility. WorkspaceClient.get_workspace_id avoids an API call when the workspace ID is already known, fixing a failure on SPOG hosts.
NewsGenAI - For Data Engineers Agenda & Introduction | LLM & Agentic AI | LangChain & LangGraph | Claude
This video introduces a new course, "GenAI for Data Engineers," designed to teach data engineers how to leverage generative AI, LLMs, and agentic AI. The course covers basics of LLMs, building agents with LangChain and LangGraph, using Cloud Code, and applying agentic AI within Databricks and data engineering workflows.
This release adds new workspace-level services for supervisor agents and Unity Catalog secrets, along with an update method for tokens. Several API methods for data classification, environments, knowledge assistants, Postgres, and warehouses have breaking changes due to path modifications.
This release adds Azure MSI authentication and improves `.databrickscfg` profile resolution. It also fixes issues with non-JSON error responses and Databricks CLI token scope mismatches.
MLflow 3.11.1 introduces AI-powered issue detection for agent traces, budget alerts and limits for AI Gateway spending, and a new interactive graph view for visualizing trace hierarchies. It also enhances security with pickle-free model serialization and improves dependency management with native UV support.
Testing and Refining Claude Code Skills with MLflow
MLflow tracing and LLM judges can now test Claude Code skills. This enables a self-improvement loop where Claude Code refines its own abilities.
TutorialsDatabricks AI Dev Kit Demo - Install, DataGen, SDP, Dashboard
The video demonstrates installing the Databricks AI Dev Kit on a Mac, then uses it to generate synthetic data, create serverless Spark declarative pipelines for a medallion architecture, and build a Databricks dashboard based on the generated data. It highlights how the AI Dev Kit leverages skills and an MCP server to automate these development tasks.
The SDK now automatically detects AI coding agents and appends `agent/<name>` to HTTP request headers. New `DisableGovTagCreation` fields were added to `settings.RestrictWorkspaceAdminsMessage` and `settingsv2.RestrictWorkspaceAdminsMessage`.
The SDK now automatically detects AI coding agents and appends agent information to HTTP request headers. New fields were added to restrict workspace admin tag creation in settings messages.
This release introduces AI-powered issue identification for agent traces, budget alerts for AI Gateway spending, and a new interactive graph view for visualizing trace hierarchies. It also includes pickle-free model serialization for enhanced security and native OpenTelemetry GenAI convention support for trace export.
NewsOpenClaw, Databricks Agentic Data Monitoring & more! | AI Newsround - February 2026 | Advancing AI
The video discusses OpenClaw, an open-source framework for AI agents, and Databricks' new agentic data quality monitoring solution. It also introduces Advancing Analytics' Lake Forge and Pantheon, a framework and AI layer for developing scalable Lake Flow pipelines, and highlights new model releases from Anthropic, Google, and OpenAI.
MLflow 3.10.0 introduces multi-workspace support for organizing experiments and models, alongside new GenAI features like multi-turn evaluation, LLM cost tracking, and AI Gateway usage analytics. The UI has been redesigned for improved navigation, and in-UI trace evaluation is now supported.
ReleasesIntroducing Databricks AI Dev Kit - Skills, MCP server, Builder App
The Databricks AI Dev Kit provides agent skills, an MCP server, and a Builder App to enhance AI-driven development on Databricks. It allows users to integrate AI coding tools with Databricks best practices, extending LLM capabilities through specialized functions and offering a chat-based interface for building applications.
5 Tips to Get More Out of Your Claude Code with MLflow
MLflow now offers an MCP server, CLIs, and Skills to extend Claude Code, enabling you to trace tokens and monitor tool usage. These five tips will help you transform your Claude coding agent into a transparent and controllable workflow.
v.3.9.0
MLflow 3.9.0 introduces an in-product MLflow Assistant chatbot and a Trace Overview Dashboard for GenAI experiments, enhancing debugging and performance insights. The AI Gateway is revamped for direct tracking server integration, alongside new LLM judge features for online monitoring and custom prompt building.
MLflow 3.9.0rc0 introduces an in-product AI Assistant for debugging and a new Trace Overview Dashboard for GenAI experiments. The AI Gateway is now integrated into the tracking server, and users can configure LLM judges for online monitoring and build custom judges directly in the UI.
NewsClaude Code: 5 Essentials for Data Engineering
The video introduces five essential concepts for using Claude Code in data engineering: the cloud.mmd file for core project information, skills for packaging expertise, commands for predefined prompts, sub-agents for focused tasks, and Model Context Protocol (MCP) for standardized tool interaction. These components help manage context and memory for effective AI-enhanced development.
TutorialsDatabricks + Cursor IDE: Step-by-Step AI Coding Tutorial
The video demonstrates using Cursor IDE for AI-enhanced Databricks development, focusing on setting up Databricks Connect and leveraging Cursor rules and context for efficient code generation and testing. It shows how to structure projects, write Python and PySpark code, and create unit tests, highlighting the importance of providing clear instructions to the AI agent.
EventsThe Future of AI Agents with Dario Amodei, Co-founder and CEO, Anthropic at Data + AI Summit
Unity Catalog AI 0.2.0
This release introduces new integrations for Gemini and LiteLLM, enabling Unity Catalog functions as tools for these models. The Databricks client now exclusively supports serverless endpoints, adds new APIs for function wrapping, and includes support for `requirements`, `environment_version`, and `Variant` types.
Unity Catalog AI 0.1.0
This initial release introduces Unity Catalog AI, providing a core client for managing and executing Unity Catalog functions as GenAI tools. It includes integration packages for popular AI frameworks like LangChain, LlamaIndex, OpenAI, Anthropic, CrewAI, and AutoGen, enabling seamless use of UC functions within these applications.
