Reynold Xin, Chief Architect at Databricks, and Jules Damji, Spark Community and Developer Advocate
The upcoming Spark 2.3 release marks a big step forward in speed, unification, and API support.
Reynold Xin and Jules Damji from Databricks will walk through how you can benefit from the upcoming improvements:
- New DataSource APIs that enable developers to more easily read and write data for Continuous Processing in Structured Streaming.
- PySpark support for vectorization, giving Python developers the ability to run native Python code fast.
- Improved performance by taking advantage of NVMe SSDs.
- Native Kubernetes support, marrying the best of container orchestration and distributed data processing.