Vector Search
Recent items mentioning Vector Search across the Databricks ecosystem — releases, news, videos, and community Q&A. Updated hourly.
Databricks has significantly enhanced its Vector Search capabilities, now offering a comprehensive five-stage RAG workflow that includes careful selection of embedding models, vector database indexing, and hybrid search for connecting LLMs to external knowledge bases 2. The platform's AI Search further improves relevance with reranking, metadata filtering, and automated index updates 3. Recent SDK updates across Python and Java have introduced new fields and services for managing Vector Search indexes and AI Search, streamlining development for practitioners 4689.
Generated daily from the 10 most recent items mentioning Vector Search. Click any [N] to jump to the source.
End-to-End RAG Workflow: How Retrieval Augmented Generation Works
Databricks now offers a five-stage RAG workflow for connecting LLMs to external knowledge bases, enabling accurate, domain-specific answers without model retraining. Production RAG requires careful selection of embedding models, vector database indexing, chunking strategies, and hybrid search, with independent evaluation of retrieval precision and generation faithfulness.
What is Vector Search?
Vector search retrieves information based on meaning and context using embeddings, solving keyword-only search limitations for use cases like RAG and recommendations. Databricks AI Search adds reranking, metadata filtering, and automated index updates to improve relevance and simplify operations for production systems.
This release adds new methods for managing PostgreSQL data APIs at the workspace level. It also introduces `serverless_compute_id` fields for various Delta Live Tables pipeline operations and `endpoint_id` for Vector Search indexes.
EventsRecap of product announcements from Data + AI Summit 2026 | Day 1
Databricks announced several new products and features at the Data + AI Summit 2026, Day 1, including the Genetic Data Foundation, Lakehouse RT, Lake Base with disaster recovery, Lake Flow, Genie Ontology, Unity AI Gateway, Omnigent, and various Genie agents (Genie 1, Genie Code, Genie Agents). They also introduced new applications like Lake Watch for SIM and Customer Lake for CP.
SSH connection error messages are improved with server logs, and the SSH server startup timeout for GPU accelerators is increased. Authentication fallback to default profiles is fixed, and bundle variable references now support Unicode letters.
The Databricks SDK for Java now supports CRUD operations for Postgres data APIs and includes new fields for Azure compute attributes, synced table specifications, and vector search indexes. A breaking change makes the `resourceId` field optional for bundle deployment operations.
HealthCare Prior Authorizations with Databricks Lakebase Vector Search
The direct deployment engine is now Generally Available and the default for new Databricks deployments, with options to opt out or migrate existing deployments. New CLI commands include `databricks quickstart` for an introduction and `databricks version --check` to report available updates.
This release adds new services for AI Search and Bundle Deployments, along with numerous new fields across various existing services like Catalog, ML, and Vector Search. It also includes a breaking change by removing the `bundle` package and its associated service.
This release introduces new services for AI Search and Bundle Deployments, along with numerous new fields across existing services like Catalog, ML, and Vector Search. The `bundle` package and its associated workspace service have been removed.
Bring Databricks into Kiro IDE with the AI Dev Kit Power
The Databricks AI Dev Kit Power now offers a one-click setup to integrate Kiro IDE with the full Databricks platform, providing AI-assisted development grounded in your workspace's Unity Catalog metadata. This new path, alongside a lighter PAT-based option, ensures your AI assistant writes SQL with actual columns and respects all row, column, and tag-based grants.
You can now manage Git credentials for service principals and permissions for Agent Bricks resources. Key fixes include proper updates for metastore external access, reliable destruction of UC objects, and an increased timeout for vector search index creation.
New Databricks Certificate Announcement
Databricks has officially launched a brand-new **Databricks Context Engineer Associate** currently available in **Beta** This exam focuses on designing and managing the context layer for AI agents, including: • Prompt & instruction engineering • Retrieval systems & Vector Search • Agent memory architectures • Tool integration using MCP • Governance, PII handling & policy enforcement • Multi-agent workflows and context optimization A very interesting move as Context Engineering becomes a critical skill in building reliable enterprise AI systems. The beta exam will be delivered live at DAIS 2026.
Bundle users now receive a suggestion to set `bundle.engine: direct` in `databricks.yml` when using direct-only resources with the Terraform engine. Vector search indexes are now supported as a bundle resource (direct engine only), including UC grants and destructive operation confirmations.
Document Intelligence on Databricks
80% of enterprise data is locked inside PDFs, scans, emails and contracts and most teams still treat it as someone else's problem. Document Intelligence on Databricks changes that. One SQL function (ai\_parse\_document), governed by Unity Catalog, integrated with Lakeflow for ingestion, Agent Bricks for structured extraction, and Vector Search for RAG. No stitched-together OCR vendors, no brittle Python glue, no separate platform to govern. I put together with [Archika Dogra](https://www.linkedin.com/in/archikadogra/) a walkthrough showing how it actually works end-to-end from a folder of raw PDFs to queryable Delta tables and downstream agents. ▶️ [https://youtu.be/sdG73gI143c](https://youtu.be/sdG73gI143c) Curious to hear what use cases you're tackling invoices, contracts, claims, technical docs? Drop them in the comments.
Context Engineer Associate Beta Ex︁am + free attempt at DAIS
Context engineering is quickly becoming one of the key skills for building reliable AI agent systems. Databricks has just introduced the **Databricks Context Engineer Associate** **Ex︁am**, focused on designing, assembling, and governing the information AI agents receive at inference time - including prompts, retrieval systems, memory, tools, governance, and evaluation. The ex︁am is currently available as a **live beta at Data + AI Summit 2026**, and Databricks states that **one free onsite exam attempt will be offered during Summit**. Walk-ins only, one per attendee. Great opportunity for anyone working with GenAI, AI agents, Vector Search, Unity Catalog, MLflow, MCP, or Lakebase. [https://www.databricks.com/learn/certification/context-engineer-associate](https://www.databricks.com/learn/certification/context-engineer-associate)
NewsMay 2026 Databricks Updates: No Code ETL, New GPUs and Death of the Dashboard
Databricks announced several updates including AI Prep Search for document chunking and vector database preparation, SQL vector functions for embedding mathematics, and the general availability of multi-table transactions. They also introduced Lakeflow Designer for visual, no-code data pipeline creation and updated their serverless GPU offerings to include H100s.
This release fixes an issue where the Databricks CLI `--profile` fallback was broken. It also introduces new API methods for `workspaceClient.supervisorAgents()` and `workspaceClient.vectorSearchEndpoints()`, along with several breaking changes related to `Example` and `Tool` fields, and `minQps` in Vector Search endpoints.
This release introduces new methods for managing supervisor agents and vector search endpoints, along with several new fields for connector options, ingestion sources, and customer-managed keys. Breaking changes include the removal of `min_qps` fields from vector search endpoint configurations and making `guidelines` and `description` fields optional in certain services.
NewsDatabricks News: watermark-based incremental ingestion, MCP in AI gateway, Genie, Vector Search
Databricks now offers watermark-based incremental ingestion from SQL databases without change data feed, allowing for efficient data updates and soft deletion handling. The AI Gateway supports custom MCP servers, enabling integration with external APIs like GitHub for enhanced AI application development.
TutorialsLakebase and PG Vector: Vector Search of the Future?
The video demonstrates how to implement vector search using Lakebase and PG Vector within Databricks, focusing on two patterns: Lakebase native and reverse ETL from the lakehouse. It walks through setting up a maintenance co-pilot application that leverages PG Vector for semantic search, joins, and filtering on maintenance logs, showcasing the process from data embedding to app deployment and job scheduling for continuous updates.
The CLI now supports a --limit flag for paginated list commands and caches host metadata lookups for faster repeated invocations. Bundles gain support for Vector Search Endpoints and prompt before destroying Lakebase resources.
Benchmark Your Way to Better RAG and Agents:Tuning Vector Search with MLflow
High-level summary: problems, approaches, and takeways for better RAG with MLflow
NewsTurbo-Charge your Agents with instant MCP in Databricks
The video demonstrates how to use Model Context Protocol (MCP) in Databricks to give AI agents "superpowers" by enabling them to interact with various tools and data sources. It shows how to easily set up MCP servers within Databricks to connect agents to Unity Catalog functions, vector search, external APIs, and even marketplace MCP services, all without extensive coding.
NewsDatabricks Breaking News: Week 50: 8 December 2025 to 14 December 2025 #databricks news
Databricks now supports native reading and writing of Excel files in PySpark, SQL, and Autoloader, including features like sheet listing and range targeting. Additionally, Databricks Runtime 18 is available in beta, introducing improvements for streaming queries and new system columns for job tables, alongside a new Legase experience with project and branching capabilities for transactional databases.
