Mixing Analytic Workloads with Greenplum and Apache Spark

Presented by

Kong Yew Chan, Product Manager, Pivotal

About this talk

Apache Spark is a popular in-memory data analytics engine because of its speed, scalability, and ease of use. It also fits well with DevOps practices and cloud-native software platforms. It’s good for data exploration, interactive analytics, and streaming use cases. However, Spark, like other data-processing platforms, is not one size fits all. Different versions of Spark support different feature sets, and Spark’s machine-learning libraries can also vary in important ways between versions, or may lack the right algorithm. In this webinar, you’ll learn: - How to integrate data warehouse workloads with Spark - Which workloads are better for Greenplum and for Spark - How to use the Greenplum-Spark connector We look forward to you joining the webinar.

Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (29)
Subscribers (2738)
Showing customers how to manage data and deploy advanced analytics with diverse data locality and data types. We demonstrate new features, functionality, and product updates.