ETL and big data: Building simpler data pipelines

Presented by

Paul Scott-Murphy

About this talk

In the traditional world of EDW, ETL pipelines are a troublesome bottleneck when preparing data for use in the data warehouse. ETL pipelines are notoriously expensive and brittle, so as companies move to Hadoop they look forward to getting rid of the ETL infrastructure. But is it that simple? Some companies are finding that in order to move data between clusters for backup or aggregation purposes, whether on-premises or to the cloud, they are building systems that look an awful lot like ETL.

Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (39)
Subscribers (5606)
WANdisco is shaping the future of data infrastructure with its groundbreaking LIVE DATA platform, enabling companies to finally put all their data to work for the business - all the time, at any scale. Only WANdisco makes data always available, always accurate, and always protected, delivering hyperscale economics to support exponential data growth with the same IT budget.