StreamingSee on /pulse →

Structured Streaming

Recent items mentioning Structured Streaming across the Databricks ecosystem — releases, news, videos, and community Q&A. Updated hourly.

28 recent items1 release26 videos1 community thread

What's happening in Structured StreamingAI synthesis · updated May 2026

Recent developments highlight the growing maturity of Spark's Structured Streaming Real-Time Mode (RTM), demonstrating significant latency reductions. RTM achieves P50 and P95 latencies of 26ms and 50ms respectively in simplified setups 2, and enables sub-second end-to-end latency for complex applications like real-time air traffic control 3. The community is actively exploring RTM for millisecond streaming pipelines with Kafka 1.

Generated daily from the 3 most recent items mentioning Structured Streaming. Click any [N] to jump to the source.

delta-io/delta

v4.3.0

Delta Lake 4.3.0

Databricks practitioners can now integrate Spark with the Unity Catalog Delta REST API for managed Delta tables and selectively replace data using new `replaceOn` and `replaceUsing` DataFrame APIs. UniForm for Iceberg conversion is now atomic and incremental, and Delta Sharing supports streaming and Change Data Feed for shared tables.

2w ago

RedditTutorial

Building a Spark Streaming Real-Time Mode (RTM) Pipeline — Millisecond Streaming with Kafka

I recently built a fully working real-time transaction enrichment pipeline using PySpark RTM paired with Kafka, achieving end-to-end latency in the milliseconds. The article covers: \- Real-Time Mode (RTM) fundamentals \- Kafka integration with Spark Structured Streaming \- Millisecond-latency pipeline architecture \- Real-time transaction enrichment patterns Blog: https://blog.devgenius.io/building-a-spark-streaming-real-time-mode-rtm-pipeline-millisecond-streaming-with-kafka-dda74e9ef284

72databuff_161mo ago

Tutorials

Apache Spark Streaming Real-Time Mode - Latency Demo

The video demonstrates how to deploy and run Apache Spark Streaming in Real-Time Mode (RTM) using a declarative automation bundle. It shows that RTM significantly reduces P50 and P95 latencies compared to microbatch mode, achieving 26ms and 50ms respectively in a simplified setup without an external messaging bus.

Databricks2mo ago

Tutorials

Air Traffic Control with Apache Spark Structured Streaming Real-Time Mode

The video demonstrates building a real-time air traffic control application using Apache Spark Structured Streaming Real-Time Mode, Lakehouse, and Databricks Apps. This system processes live flight telemetry, detects congestion, and generates alerts with sub-second end-to-end latency, all within a single Databricks platform.

Structured Streaming

Building a Spark Streaming Real-Time Mode (RTM) Pipeline — Millisecond Streaming with Kafka

Apache Spark Streaming Real-Time Mode - Latency Demo

Air Traffic Control with Apache Spark Structured Streaming Real-Time Mode

Building Real-Time Sport Model Insights with Spark Structured Streaming

Unlock Your Use Cases: A Deep Dive on Structured Streaming’s New TransformWithState API

A Comprehensive Guide to Streaming on the Data Intelligence Platform

Crypto at Scale: Building a High-Performance Platform for Real-Time Blockchain Data

Supercharging Sales Intelligence: Processing Billions of Events via Structured Streaming

Real-Time Mode Technical Deep Dive: How We Built Sub-300 Millisecond Streaming Into Apache Spark™

PDF Document Ingestion Accelerator for GenAI Applications

Delivering Sub-Second Latency for Operational Workloads on Databricks

Introducing Simplified State Tracking in Apache Spark™ Structured Streaming

Race to Real-Time: Low-Latency Streaming ETL With Next-Gen OLTP-DB

Nebula: The Journey of Scaling Instacart’s Data Pipelines with Apache Spark™ and Lakehouse

The Future is Open: Data Streaming in an Omni-Cloud Reality

Event Driven Real-Time Supply Chain Ecosystem Powered by Lakehouse

Streaming Data Analytics with Power BI and Databricks

US Army Corp of Engineers Enhanced Commerce & National Sec Through Data-Driven Geospatial Insight

High Volume Intelligent Streaming with Sub-Minute SLA for Near Real-Time Data Replication

How We Made a Unified Talent Solution Using Databricks Machine Learning, Fine-Tuned LLM & Dolly 2.0

Structured Streaming: Demystifying Arbitrary Stateful Operations

Sponsored by: Avanade | Enabling Real-Time Analytics with Structured Streaming and Delta Live Tables

Top Mistakes to Avoid in Streaming Applications

Improving Apache Spark Application Processing Time by Configurations, Code Optimizations, etc.

Streaming Data into Delta Lake with Rust and Kafka

Streaming ML Enrichment Framework Using Advanced Delta Table Features

Spark Inception: Exploiting the Apache Spark REPL to Build Streaming Notebooks