OpenAI
Recent items mentioning OpenAI across the Databricks ecosystem — releases, news, videos, and community Q&A. Updated hourly.
Databricks has significantly deepened its integration with OpenAI, bringing GPT-5.5 and Codex natively into the platform, fully governed by Unity AI Gateway for enterprise agent workflows and secure data querying via Genie 1345. This partnership focuses on enabling enterprises to deploy and adopt AI with enhanced planning capabilities and leading performance in office knowledge work benchmarks 45. The integration provides robust governance for permissions, cost controls, guardrails, and observability for these advanced models 3.
Generated daily from the 5 most recent items mentioning OpenAI. Click any [N] to jump to the source.
Databricks brings OpenAI GPT‑5.5 to enterprise agent workflows
MLflow 3.12.0 introduces multimodal tracing, allowing storage and rich rendering of PDFs, audio, and images as artifact attachments in tracing spans. It also adds AI Gateway guardrails to prevent unsafe model inputs/outputs and extends coding agent tracing support to Codex, Gemini, and Qwen.
Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks
Hi HN, I’ve been working on mljar-supervised (open-source AutoML for tabular data) for a few years. Recently I built a desktop app around it called MLJAR Studio. The idea is simple: you talk to your data in natural language, the AI generates Python code, executes it locally, and the whole conversation becomes a reproducible notebook (*.ipynb file). So instead of just chatting with data, you end up with something you can inspect, modify, and rerun. What MLJAR Studio does: - Sets up a local Python environment automatically, runs on Mac, Windows, and Linux - Installs missing packages during the conversation - Built-in AutoML for tabular data (classification, regression, multiclass) - Works with standard Python libraries (pandas, matplotlib, etc.) - Works with any data file: CSV, Excel, Stata, Parquet ... - Connects to PostgreSQL, MySQL, SQL Server, Snowflake, Databricks, and Supabase. For AI: use Ollama locally (zero data egress), bring your own OpenAI key, or use MLJAR AI add-on. I built this because I wanted something between Jupyter Notebook (flexible but manual) and AI tools that generate code but don’t preserve the workflow. Most tools I tried either hide too much or don’t give reproducible results and are cloud based Demos: - 60-second demo: https://youtu.be/BjxpZYRiY4c - Full 3-minute analysis: https://youtu.be/1DHMMxaNJxI Pricing is $199 one-time, with a 7-day trial. Curious if this is useful for others doing real data work, or if I’m solving my own problem here. Happy to answer questions. --- top comments --- [MSaiRam10] Notebooks as the output format is funny because notebooks are famously bad for reproducibility. Out of order execution, hidden state, etc. You're solving "chat isn't reproducible" with a format that also isn't really [hasyimibhar] How does this compare to open source Deepnote[0]? We use the cloud version (BYOC) at my previous company to replace self-hosted Jupyter notebooks, and it's pretty great. [0] https://github.com/deepnote/deepnote [2ndorderthought] This is one of those product areas I would call high-risk without a human in the loop. So I am glad you kept a person in the loop. It's really easy to lose tons of money making decisions based on bad statistics or models. Anyone remember how much money zillow lost because of automatic time series models? I do have concerns about the workflow. Data people aren't usually the best programmers. Models hallucinate and make mistakes sometimes subtle sometimes not. Can you think of a way to prevent data scientists from having to be expert code reviewers? I feel like taking away the code gives them the chance to find and fix mistakes in their reasoning but I have no evidence for that. [amirathi] Really cool. If somebody doesn't want to adopt a new platform, take a look at open source Jupyter MCP Server[1]. Once integrated with Claude, it can execute code on the live notebook kernel. I just let Claude write notebooks, run top to bottom, debug & fix errors & only ping me when everything is working. [1] https://github.com/datalayer/jupyter-mcp-server [trymamboapp] "AI saves analysis as notebooks" is fighting the wrong fight ig. The reproducibility issue with notebooks isn't the format. it's out-of-order cell execution and silent kernel state llm generation makes that worse: the model has no memory of what state existed when it wrote cell 7, and neither does the user.
MLflow 3.12.0rc0 introduces enhanced AI agent development features, including automatic tracing for more AI coding assistants and OpenClaw, along with new AI Gateway Guardrails for safety checks. It also adds multimodal trace attachments for images and audio in the UI, and a new mlflow.diffusers flavor for saving and serving diffusion models.
Serverless Notebooks and Jobs Environment Variables [let's design this together]
**Quick scenario**: you spend time getting your notebook working. `OPENAI_API_KEY` is set, `PIP_EXTRA_INDEX_URL` is pointing to your private registry, everything runs. You click Schedule. The job fails. Env var not found. Should these be set at the workspace, folder, or user? Sound familiar? If so, we should be friends. It's Justin Breese (PM at Databricks) and I am back to chat about dependency management - researching how to make environment variables work seamlessly across serverless notebooks and jobs - so clicking Schedule just works, no extra config, no surprises. Want them to work cross workspace, project, etc.? **I want to talk to you if:** * 🔁 You re-set env vars every session because they don't persist * 💥 You've had a notebook-to-job failure caused by a missing env var * 🔐 Managing API keys or credentials in notebooks feels more manual than it should * 🏢 You're a workspace admin who wants to set shared config (pip registry, endpoints) once for everyone Options: 1. 30 minutes, no prep needed. Grab time here: [https://calendar.app.google/CxxpHKBWvxRVQM7i9](https://calendar.app.google/CxxpHKBWvxRVQM7i9) 2. Email me direct feedback and tons of context: [j@databricks.com](mailto:j@databricks.com) 3. Messenger pigeon: Send one to me? 4. Or just drop a comment - even a "yes this is a pain" tells me something useful. Thanks!
OpenAI GPT-5.5 + Codex, now available and fully-governed in Databricks
OpenAI GPT-5.5 + Codex, now available and fully-governed in Databricks
GPT-5.5 and Codex are now natively available in Databricks, fully governed by Unity AI Gateway for permissions, cost controls, guardrails, and observability. This enables agent building with GPT-5.5 and natural language querying of enterprise data via Genie.
ReleasesHow OpenAI and Databricks are working together
Databricks and OpenAI are partnering to help enterprises deploy and adopt AI, with Databricks focusing on secure data access and management for AI applications through products like Genie and AI Gateway. The video highlights GPT 5.5's enhanced planning capabilities and its leading performance in office knowledge work benchmarks, demonstrating its impact beyond coding to automate internal business processes.
Unity Catalog AI 0.4.0
DatabricksFunctionClient now supports an optional warehouse_id for function execution, enabling use in workspaces without serverless compute. Python 3.10+ is now required, and several bug fixes address Gemini toolkit, LangGraph, and OSS client function creation issues.
Databricks partners with OpenAI on GPT-5.5
GPT-5.5 and Codex are coming soon to Databricks, governed by Unity AI Gateway, and cut OfficeQA Pro errors nearly in half. This partnership with OpenAI brings advanced models directly to Databricks users.
MLflow 3.11.1 introduces AI-powered issue detection for agent traces, budget alerts and limits for AI Gateway spending, and a new interactive graph view for visualizing trace hierarchies. It also enhances security with pickle-free model serialization and improves dependency management with native UV support.
ReleasesDatabricks Genie Code, Carl, Bull**** Bench & more! | AI Newsround - March '26 | Advancing Analytics
The video discusses Databricks' new AI tools, Genie Code for autonomous data work and Carl for faster, cost-efficient enterprise knowledge agents using custom reinforcement learning. It also covers the Bench V2 for evaluating AI models' ability to detect and push back on nonsense, along with updates to various models like Qwen 3.5, Gemini 3.1 Flashlight, and OpenAI's GPT-5.3 Instant, 5.4, Mini, and Nano, highlighting their focus on agent capabilities and cost-efficiency.
This release introduces AI-powered issue identification for agent traces, budget alerts for AI Gateway spending, and a new interactive graph view for visualizing trace hierarchies. It also includes pickle-free model serialization for enhanced security and native OpenTelemetry GenAI convention support for trace export.
NewsOpenClaw, Databricks Agentic Data Monitoring & more! | AI Newsround - February 2026 | Advancing AI
The video discusses OpenClaw, an open-source framework for AI agents, and Databricks' new agentic data quality monitoring solution. It also introduces Advancing Analytics' Lake Forge and Pantheon, a framework and AI layer for developing scalable Lake Flow pipelines, and highlights new model releases from Anthropic, Google, and OpenAI.
Delta Lake 4.1.0
Delta Lake 4.1.0 enhances Unity Catalog integration with improved support for catalog-managed tables, including atomic CTAS and conflict-free feature enablement for Deletion Vectors and Column Mapping. It also introduces a new Spark V2 connector based on Delta Kernel API for streaming reads and server-side planning capabilities.
v.3.9.0
MLflow 3.9.0 introduces an in-product MLflow Assistant chatbot and a Trace Overview Dashboard for GenAI experiments, enhancing debugging and performance insights. The AI Gateway is revamped for direct tracking server integration, alongside new LLM judge features for online monitoring and custom prompt building.
NewsDatabricks: What’s new in October 2025 #databricks news
Databricks introduces Databricks One, a new business-focused experience with consumer access for dashboards and Genie, alongside updates to Genie for defining relations and extended API endpoints. The platform also adds features like easy conversion of external to managed tables, enhanced Databricks Asset Bundles with policy integration and script execution, and new system tables for MLflow tracking and data classification results.
Unity Catalog AI 0.3.0
Functions now execute in a safer sandboxed process by default, with a new local development mode for easier debugging. You can now retrieve UC-registered Python functions as direct Python callables or their source code, and DatabricksFunctionClient connection reliability has improved.
Unity Catalog AI 0.1.0
This initial release introduces Unity Catalog AI, providing a core client for managing and executing Unity Catalog functions as GenAI tools. It includes integration packages for popular AI frameworks like LangChain, LlamaIndex, OpenAI, Anthropic, CrewAI, and AutoGen, enabling seamless use of UC functions within these applications.




