Intended for software engineers, developers, and technical leads who develop Spark applications, this webinar discusses the results of analyzing many Spark jobs on many multi-tenant production clusters, the common issues seen, the symptoms of those issues, and how developers can address them. Pepperdata has gathered trillions of performance data points on production clusters running Spark, covering a variety of industries, applications, and workload types.
Presenter Vinod Nair will talks about key performance insights — best and worst practices, gotchas, and tuning recommendations — based on analyzing the behavior and performance of millions of Spark applications. In addition, Vinod will describe how we are turning these learnings into heuristics leveraged from the open source Dr. Elephant project.
This webinar is followed by a live Q & A. A replay of this webinar will be available within 24 hours at https://www.pepperdata.com/resources/webinars/.