Skip to content
brickster.ai
All videos
newsDatabricks·July 27, 2023

The Future is Open: Data Streaming in an Omni-Cloud Reality

Description

This session begins with data warehouse trivia and lessons learned from production implementations of multicloud data architecture. You will learn to design future-proof low latency data systems that focus on openness and interoperability. You will also gain a gentle introduction to Cloud FinOps principles that can help your organization reduce compute spend and increase efficiency.  Most enterprises today are multicloud. While an assortment of low-code connectors boasts the ability to make data available for analytics in real time, they post long-lasting challenges: - Inefficient EDW targets - Inability to evolve schema - Forbiddingly expensive data exports due to cloud and vendor lock-in The alternative is an open data lake that unifies batch and streaming workloads. Bronze landing zones in open format eliminate the data extraction costs required by proprietary EDW. Apache Spark™ Structured Streaming provides a unified ingestion interface. Streaming triggers allow us to switch back and forth between batch and stream with one-line code changes. Streaming aggregation enables us to incrementally compute on data that arrives near each other. Specific examples are given on how to

Description from YouTube. Full content on the video page.

More from Databricks