Big Insights from Your Big Data using Data Virtualization

Logo
Presented by

Pablo Alvarez, Director of Product Management, Denodo

About this talk

Data lakes have grown to be a popular architecture that enables modern analytics and data science. However, complete replication of all corporate data into giant data lakes is unfeasible. Data volumes are too high, and replication to multiple systems creates brittle point-to-point connections. Out-of-synch data and uncontrolled replication leads to “data swamp” scenarios. On top of the physical data lake, a logical approach is more feasible: a logical layer that connects different systems (the data lake among them) and exposes them as one. The complexity of the back-end systems is hidden from the end user. Security, governance and auditing are again centralized. As data volumes grow exponentially, optimization techniques have also evolved to perform in these scenarios. Techniques like complex query rewriting, on-the-fly data movement between sources, and MPP capabilities provide the processing muscle to perform efficiently. Attend this session to learn: * How a logical data lake can overcome some of the issues of data lakes * How MPP acceleration and other optimization techniques designed for large data volumes work * How Denodo customers have implemented logical data lakes to improve the value of their Hadoop investments
Related topics:

More from this channel

Upcoming talks (0)
On-demand talks (394)
Subscribers (34134)
For IT professionals who are focused on data integration and enterprise data management and are overwhelmed by the growing number of data and data types, data virtualization provides real-time integration with agility to access and integrate disparate sources with ease. For business professionals, Data Virtualization brings agile information access that in turn drives business agility. The webcasts provided in this channel by Denodo, the leader in Data Virtualization, provide the latest in common usage patterns, use cases, best practices and strategies for driving business value with data virtualization.