Securing Apache Spark Big Data Operations

Presented by

Rob Gibbon – Product Manager, and Massimiliano Gori – Senior Information Security Lead

About this talk

A holistic approach to securing Spark-based data engineering: Apache Spark is an open source toolkit that helps users develop parallel, distributed data engineering and machine learning applications and run them at scale. In this webinar, Rob Gibbon – product manager, and Massimiliano Gori – senior information security lead, will survey the state of big data security best practices and outline both high level architectures and pragmatic steps that you can take to secure your Spark applications – wherever they may be running. We will cover: An introduction to Apache Spark - what it is and how it works Motives and techniques of bad actors How to identify and prioritise security requirements. Pragmatic steps to secure Spark based on Kubernetes and object storage.
Related topics:

More from this channel

Upcoming talks (8)
On-demand talks (411)
Subscribers (165704)
Get the most in depth information about the Ubuntu technology and services from Canonical. Learn why Ubuntu is the preferred Linux platform and how Canonical can help you make the most out of your Ubuntu environment.