Data Visualization & Exploration

A key task to create appropriate analytic models in machine learning or deep learning is the integration and preparation of data sets from various sources like files, databases, big data storages, sensors or social networks. This step can take up to 50% of the whole project. This session compares different alternative techniques to prepare data, including extract-transform-load (ETL) batch processing, streaming analytics ingestion, and data wrangling within visual analytics. Various options and their trade-offs are shown in live demos using different advanced analytics technologies and open source frameworks such as R, Python, Apache Spark, Talend or KNIME. The session also discusses how this is related to visual analytics, and best practices for how the data scientist and business user should work together to build good analytic models. Key takeaways for the audience: - Learn various option for preparing data sets to build analytic models - Understand the pros and cons and the targeted persona for each option - See different technologies and open source frameworks for data preparation - Understand the relation to visual analytics and streaming analytics, and how these concepts are actually leveraged to build the analytic model after data preparation

February 15, 2017


Related Webinars

February 16, 2017 4:00pm EST

Predictive Analysis has become a game-changing component for today’s forward-looking organizations seeking to gain a competitive advantage. This functionality has…

February 16, 2017 1:00pm EST

If a volcano erupts in Iceland, why is Hong Kong your first supply chain casualty? And how do you figure…

February 16, 2017 11:00am EST

Continuous streams of data are generated in every industry from sensors, IoT devices, business transactions, social media, network devices, clickstream…