Architectural Comparison of Apache Apex and Spark Streaming

Presented by

Thomas Weise, Co-Founder & Architect, PMC Member, Apache Apex.

About this talk

Apache Apex is a native Hadoop data-in-motion platform. In this presentation, we will discuss architectural differences between Apache Apex features with Spark Streaming. We will discuss how these differences effect use cases like ingestion, fast real-time analytics, data movement, ETL, fast batch, very low latency SLA, high throughput and large scale ingestion. We will cover fault tolerance, low latency, connectors to sources/destinations, smart partitioning, processing guarantees, computation and scheduling model, state management and dynamic changes. We will also discuss how these features affect time to market and total cost of ownership.

Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (13)
Subscribers (2085)
DataTorrent, powered by Apache Apex, is the industry’s only open source enterprise-grade unified stream and batch platform.