What's New in the Upcoming Apache Spark 2.3 Release?

Logo
Presented by

Reynold Xin, Chief Architect at Databricks, and Jules Damji, Spark Community and Developer Advocate

About this talk

The upcoming Spark 2.3 release marks a big step forward in speed, unification, and API support. Reynold Xin and Jules Damji from Databricks will walk through how you can benefit from the upcoming improvements: - New DataSource APIs that enable developers to more easily read and write data for Continuous Processing in Structured Streaming. - PySpark support for vectorization, giving Python developers the ability to run native Python code fast. - Improved performance by taking advantage of NVMe SSDs. - Native Kubernetes support, marrying the best of container orchestration and distributed data processing.

Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (79)
Subscribers (38489)
No matter at what stage of your data journey you’re in, this channel will help you get a better understanding of the fundamental concepts of the Databricks Lakehouse platform and the problems we’re helping to solve for data teams.