How To Make Apache Spark on Kubernetes Run Reliably on Spot Instances
Description
Since the general availability of Apache Spark’s native support for running on Kubernetes with Spark 3.1 in March 2021, the Spark community is increasingly choosing to run on k8s to benefit of containerization, efficient resource-sharing, and the tools from the cloud-native ecosystem. Data teams are faced with complexities in this transition, including how to leverage spot VMs. These instances enable up to 90% cost savings but are not guaranteed to be available and face the risk of termination. This session will cover concrete guidelines on how to make Spark run reliably on spot instances, with code examples from real-world use cases. Main topics: • Using spot nodes for Spark executors • Mixing instance types & sizes to reduce risk of spot interruptions - cluster autoscaling • Spark 3.0: Graceful Decommissioning - preserve shuffle files on executor shutdown • Spark 3.1: PVC reuse on executor restart - disaggregate compute & shuffle storage • What to look for in future Spark releases Connect with us: Website: https://databricks.com Facebook: https://www.facebook.com/databricksinc Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/data... Instagram:…
Description from YouTube. Full content on the video page.
More from Databricks
ReleasesDatabricks launches across the Data + AI stack in 90 seconds
Databricks announced LTAP to unify lakebased and lakehouse data, eliminating ETL and enabling a single copy of data for analytical and operational needs. They also introduced Unity AI Gateway for governance, Genie Ontology for enterprise knowledge graphs, and open-sourced Omniant for managing multiple coding agents.
ReleasesIntroducing Omnigent: The Ultimate Meta-Harness for AI Agents
Omnigent is a new open-source meta-harness for AI agents that provides a unified interface for composition, control, and collaboration across multiple models and agent workflows. It enables stateful, data-centric policies for guardrails and allows real-time sharing and steering of live agent sessions with teammates.
NewsHow DEFRA and Natural England Accelerate Peatland Restoration with AI and Databricks
DEFRA and Natural England utilize AI and Databricks to accelerate peatland restoration by automating the mapping of peatland features and peat dams across England. This technology significantly reduces the time required for mapping, enabling faster identification and restoration of these crucial carbon-storing habitats.
NewsAI Stack Explained in 3 Layers (LLM, Agent Harness, Omnigent)
The AI stack now includes a third layer, the meta harness, which sits above individual agent harnesses. This meta harness, exemplified by Databricks' open-sourced Omnigent, allows for routing queries to appropriate agents and orchestrating tasks across multiple agents, enabling seamless interaction and context sharing between them.
NewsWhat’s coming next to Free Edition
Databricks announces the availability of Genie, GPUs, Agent Hooks, Lakehouse, and Lake Flow Designer on its Free Edition. This update provides virtually all of Databricks' production platform features for free, enabling users to learn and build data and AI projects.
