Hi [[ session.user.profile.firstName ]]

Why Visualization is Critical for Data Wrangling

The big data era has lead to the proliferation of new technologies that enable a wider set of users to more effectively analyze and consume data to improve operations and decision making. We’ve moved beyond traditional dashboard visualizations as the sole method of consuming data, and instead are now incorporating data visualizations into nearly every aspect of the analysis process. The discussion will explore the considerations of designing data visualizations to enable fast exploration and profiling of data.
Recorded Jul 16 2015 54 mins
Your place is confirmed,
we'll send you email reminders
Presented by
Phil Vander Broek, User Experience Manager, Trifacta; Will Davis, Director of Product Marketing, Trifacta
Presentation preview: Why Visualization is Critical for Data Wrangling

Network with like-minded attendees

  • [[ session.user.profile.displayName ]]
    Add a photo
    • [[ session.user.profile.displayName ]]
    • [[ session.user.profile.jobTitle ]]
    • [[ session.user.profile.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(session.user.profile) ]]
  • [[ card.displayName ]]
    • [[ card.displayName ]]
    • [[ card.jobTitle ]]
    • [[ card.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(card) ]]
  • Channel
  • Channel profile
  • Unsupervised learning to uncover advanced cyber attacks Aug 22 2017 10:00 am UTC 45 mins
    Rafael San Miguel Carrasco, Senior Specialist, British Telecom EMEA
    This case study is framed in a multinational company with 300k+ employees, present in 100+ countries, that is adding one extra layer of security based on big data analytics capabilities, in order to provide net-new value to their ongoing SOC-related investments.

    Having billions of events being generated on a weekly basis, real-time monitoring must be complemented with deep analysis to hunt targeted and advanced attacks.

    By leveraging a cloud-based Spark cluster, ElasticSearch, R, Scala and PowerBI, a security analytics platform based on anomaly detection is being progressively implemented.

    Anomalies are spotted by applying well-known analytics techniques, from data transformation and mining to clustering, graph analysis, topic modeling, classification and dimensionality reduction.
  • Different Strategies of Scaling H2O Machine Learning on Apache Spark Apr 13 2017 3:00 pm UTC 45 mins
    Jakub Hava, Software Engineer at H2O.ai
    Sparkling Water integrates H2O, open source distributed machine learning platform, with the capabilities of Apache Spark. It allows users to leverage H2O’s machine learning algorithms with Apache Spark applications via Scala, Python, R or H2O’s Flow GUI which makes Sparkling Water a great enterprise solution.

    Sparkling Water 2.0 was built to coincide with the release of Apache Spark 2.0 and introduces several new features. One of the latest and largest features is the ability to configure Sparkling Water for different workloads, scale and optimize the platform according to your data and needs.

    In this talk we will introduce the basic architecture of Sparkling Water, go over different scaling strategies and explain the pros and cons of each solution. We will also compare the approaches with regards to the specific use cases and provide the rationale why or why not each solution may be a good fit for the desired use case.

    This talk will finish with a live demo demonstrating the mentioned approaches and should give you a real time experience of configuring and running Sparkling Water for your use case(s)!
  • Applied Data Science Apr 13 2017 10:00 am UTC 45 mins
    Giovanni Lanzani, Chief Science Officer at GoDataDriven
    Now that the Data Science hype is levelling out, many companies are wondering what went wrong as they could not extract values from their data science efforts.

    In this webinar we will explore what does it take to apply data science and machine learning in the real world.

    Key takeaways include:
    - How can you go beyond the traditional data warehouse when doing machine learning
    - How should you adapt your processes to keep monetizing on your data
    - How to close the feedback loop between your customers and your machine learning models
    - What kind of profiles are essential to successfully become a data driven organization
  • Tensorflow: Architecture and use case Apr 11 2017 8:00 am UTC 45 mins
    Gema Parreño Piqueras. AI product developer
    The webinar drives into the introduction of the architecture of Tensorflow and the designing of use case.

    You will learn:
    -What is an artificial neuron?
    -What is Tensorflow? What are its advantages? What's it used for?
    -Designing graphs in Tensorflow
    -Tips & tricks for designing neural nets
    -Use case
  • Analyse, Visualize, Share Social Network Interactions w Apache Spark & Zeppelin Mar 13 2017 3:00 pm UTC 45 mins
    Eric Charles, Founder at Datalayer
    Apache Spark for Big Data Analysis combined with Apache Zeppelin for Visualization is a powerful tandem that eases the day to day job of Data Scientists.

    In this webinar, you will learn how to:

    + Collect streaming data from the Twitter API and store it in a efficient way
    + Analyse and Display the user interactions with graph-based algorithms wi.
    + Share and collaborate on the same note with peers and business stakeholders to get their buy-in.
  • IT Relevance in the Self-Service Analytics Era Mar 9 2017 5:00 pm UTC 45 mins
    Kevin McFaul and Roberta Wakerell (IBM Cognos Analytics)
    There’s no denying the impact of self-service. IT professionals must cope with the explosive demand for analytics while ensuring a trusted data foundation for their organization. Business users want freedom to blend data, and create their own dashboards and stories with complete confidence. Join IBM in this session and see how IT can lead the creation of an analytics environment where everyone is empowered and equipped to use data more effectively.

    Join this webinar to learn how to:


    · Support the analytic requirements of all types of users from casual users to power users
    · Deliver visual data discovery and managed reporting in one unified environment
    · Operationalize insights and share them instantly across your team, department or entire organization
    · Ensure the delivery of insights that are based on trusted data
    · Provide a range of deployment options on cloud or on premises while maintaining data security
  • IT Analytics: How a Data Company uses Analytics to Improve IT Feb 24 2017 10:00 am UTC 45 mins
    Brian Smith - VP, Technical Operations, Tableau
    Today’s IT departments can’t simply provide IT solutions to other departments. Passively processing other departments’ requests is no longer sufficient to meet modern business needs, power company growth, and excel in a constantly changing marketplace. Instead, IT must strive to be the leading force and early adopters for information technology themselves.

    Join this live webinar to see how Tableau’s IT department uses analytics on a daily basis to analyse their own performance and improve their own efficiency.
  • Systems of Insight - The Next Generation of Business Intelligence Feb 21 2017 3:00 pm UTC 60 mins
    Boris Evelson - Forrester Principal Analyst
    Business intelligence has gone through multiple iterations in the past few decades. While BI's evolution has addressed some of the technology and process shortcomings of the earlier management information systems, BI teams still have a ways to go.

    From the laggards failing to transform enough of their structured and data into information and business insights, to the old guard of spreadsheet-based applications for key business decisions, companies have hit a rut. How do we solve this? With Systems of Insight.


    In this webinar, guest speaker, Boris Evelson, Vice President and Principal Analyst at Forrester, will show you how to get with the BI program by creating systems of insight. Doing so will allow you to:


    - Merge existing IT capabilities with demanding business requirements

    - Harness valuable customer and company data

    - Make decisions faster, and more pointed, to serve customers in the Age of the Customer
  • How to apply real-time Cloud analytics in just an hour Recorded: Feb 17 2017 60 mins
    Iver van de Zand, SAP Analytics Leader
    Cloud analytics has great momentum and that is for a reason: it allows for real-time and live analytics without needing to prepare an environment. In this webinar you will learn how to apply SAP Cloud analytics using BusinessObjects Cloud and the Digital Boardroom. Be amazed by the easiness’ of use and the great visualization capabilities.

    Iver van de Zand – SAP Analytics Leader – will provide a deep dive session on the modeling and visualization capabilities of this stunning product
  • Logistics Analytics: Predicting Supply-Chain Disruptions Recorded: Feb 16 2017 47 mins
    Dmitri Adler, Chief Data Scientist, Data Society
    If a volcano erupts in Iceland, why is Hong Kong your first supply chain casualty? And how do you figure out the most efficient route for bike share replacements?

    In this presentation, Chief Data Scientist Dmitri Adler will walk you through some of the most successful use cases of supply-chain management, the best practices for evaluating your supply chain, and how you can implement these strategies in your business.
  • Unlock real-time predictive insights from the Internet of Things Recorded: Feb 16 2017 60 mins
    Sam Chandrashekar, Program Manager, Microsoft
    Continuous streams of data are generated in every industry from sensors, IoT devices, business transactions, social media, network devices, clickstream logs etc. Within these streams of data lie insights that are waiting to be unlocked.

    This session with several live demonstrations will detail the build out of an end-to-end solution for the Internet of Things to transform data into insight, prediction, and action using cloud services. These cloud services enable you to quickly and easily build solutions to unlock insights, predict future trends, and take actions in near real-time.

    Samartha (Sam) Chandrashekar is a Program Manager at Microsoft. He works on cloud services to enable machine learning and advanced analytics on streaming data.
  • Machine Learning towards Precision Medicine Recorded: Feb 16 2017 47 mins
    Paul Hellwig Director, Research & Development, at Elsevier Health Analytics
    Medicine is complex. Correlations between diseases, medications, symptoms, lab data and genomics are of a complexity that cannot be fully comprehended by humans anymore. Machine learning methods are required that help mining these correlations. But a pure technological or algorithm-driven approach will not suffice. We need to get physicians and other domain experts on board, we need to gain their trust in the predictive models we develop.

    Elsevier Health Analytics has developed a first version of the Medical Knowledge Graph, which identifies correlations (ideally: causations) between diseases, and between diseases and treatments. On a dataset comprising 6 million patient lives we have calculated 2000+ models predicting the development of diseases. Every model adjusts for ~3000 covariates. Models are based on linear algorithms. This allows a graphical visualization of correlations that medical personnel can work with.
  • Bridging the Data Silos Recorded: Feb 15 2017 48 mins
    Merav Yuravlivker, Chief Executive Officer, Data Society
    If a database is filled automatically, but it's not analyzed, can it make an impact? And how do you combine disparate data sources to give you a real-time look at your environment?

    Chief Executive Officer Merav Yuravlivker discusses how companies are missing out on some of their biggest profits (and how some companies are making billions) by aggregating disparate data sources. You'll learn about data sources available to you, how you can start automating this data collection, and the many insights that are at your fingertips.
  • Data Visualization IS NOT Self Service BI - The Case for IoT in BI Recorded: Feb 15 2017 27 mins
    Lee Hermon, Sisense Engagement Manager and Adi Azaria, Sisense Chief Evangelist
    Businesses today already know that visualization in business intelligence is an essential part of competitive success. Yet, too many organizations are falling behind because of the inability to keep up with demand for information. One mistake is thinking that self-serve data visualization is all they need when setting up a self-service BI environment.

    Debunking the common myth, we will explore why data visualization IS NOT self-service BI. The only way for Information workers to become more self-sufficient is by having a BI environment that is more usable but also more consumable. It is these two themes—usability and consumability - that play crucial roles in a fully functioning self-service BI environment. Using modern IoT technologies, the modern business can expand access and consumability of data by engaging the human senses of sight, sound, and touch.

    Join Lee Hermon, Sisense Engagement Manager, as he explores the limitations of current Self Service Visualization models and Adi Azaria, Sisense co-founder & Chief Evangelist as he introduces how IoT in Business Intelligence is changing the game.
  • Hype vs. reality: the truth of self-service BI Recorded: Feb 15 2017 41 mins
    Glen Rabie, CEO, Yellowfin
    The desire to balance the needs of business users with the governance requirements of IT will kill the current concept of self-service BI.

    Discover how to drive success with analytics and better separate the self-service hype from reality with Yellowfin's CEO, Glen Rabie.

    Glen will discuss:
    - The impact of the self-service pipe dream
    - Why changing BI tools won't help
    - What self-service BI should actually look like
  • Comparison of ETL v Streaming Ingestion,Data Wrangling in Machine/Deep Learning Recorded: Feb 15 2017 45 mins
    Kai Waehner, Technology Evangelist, TIBCO
    A key task to create appropriate analytic models in machine learning or deep learning is the integration and preparation of data sets from various sources like files, databases, big data storages, sensors or social networks. This step can take up to 50% of the whole project.

    This session compares different alternative techniques to prepare data, including extract-transform-load (ETL) batch processing, streaming analytics ingestion, and data wrangling within visual analytics. Various options and their trade-offs are shown in live demos using different advanced analytics technologies and open source frameworks such as R, Python, Apache Spark, Talend or KNIME. The session also discusses how this is related to visual analytics, and best practices for how the data scientist and business user should work together to build good analytic models.

    Key takeaways for the audience:
    - Learn various option for preparing data sets to build analytic models
    - Understand the pros and cons and the targeted persona for each option
    - See different technologies and open source frameworks for data preparation
    - Understand the relation to visual analytics and streaming analytics, and how these concepts are actually leveraged to build the analytic model after data preparation
  • Strategies for Successful Data Preparation Recorded: Feb 14 2017 33 mins
    Raymond Rashid, Senior Consultant Business Intelligence, Unilytics Corporation
    Data scientists know, the visualization of data doesn't materialize out of thin air, unfortunately. One of the most vital preparation tactics and dangerous moments happens in the ETL process.

    Join Ray to learn the best strategies that lead to successful ETL and data visualization. He'll cover the following and what it means for visualization:

    1. Data at Different Levels of Detail
    2. Dirty Data
    3. Restartability
    4. Processing Considerations
    5. Incremental Loading

    Ray Rashid is a Senior Business Intelligence Consultant at Unilytics, specializing in ETL, data warehousing, data optimization, and data visualization. He has expertise in the financial, manufacturing and pharmaceutical industries.
  • Data Science Apps: Beyond Notebooks with Apache Toree, Spark and Jupyter Gateway Recorded: Feb 14 2017 48 mins
    Natalino Busa, Head of Applied Data Science, Teradata
    Jupyter notebooks are transforming the way we look at computing, coding and problem solving. But is this the only “data scientist experience” that this technology can provide?

    In this webinar, Natalino will sketch how you could use Jupyter to create interactive and compelling data science web applications and provide new ways of data exploration and analysis. In the background, these apps are still powered by well understood and documented Jupyter notebooks.

    They will present an architecture which is composed of four parts: a jupyter server-only gateway, a Scala/Spark Jupyter kernel, a Spark cluster and a angular/bootstrap web application.
  • Visualization: A tool for knowledge Recorded: Feb 14 2017 49 mins
    Luis Melgar, Visual Reporter at Univision News
    During the last decades, concepts such as Big Data and Data Visualization have become more popular and present in our daily lives. But what is visualization?

    Visualization is an intellectual discipline that allows to generate knowledge through visual forms. And as in every other field, there are good and bad practices that can help consumers or mislead them.

    In this webinar, we will address:

    -What it’s Data Visualization and why it’s important
    -How to choose the right graphic forms in order to represent complex information
    -Interactivity and new narratives
    -What tools can be used
  • How to Setup and Manage a Corporate Self Service Analytics Environment Recorded: Feb 14 2017 48 mins
    Ronald van Loon, Top Big Data and IoT influencer and Ian Macdonald, Principal Technologist (Pyramid Analytics)
    As companies face the challenges arising from a surge in the number of customer interactions and data, it can be difficult to successfully manage the vast quantities of information and still provide a positive customer experience. It is incumbent upon businesses to create a consumer-centric experience that is powered by (predictive) analytics.

    Adopting a data-driven approach through a corporate self-service analytics (SSA) environment is integral to strengthening your data and analytics strategy.


    During the webinar, speakers Ronald van Loon & Ian Macdonald will:

    •Expand upon on the benefits of a corporate SSA environment
    •Define how your business can successfully manage a corporate SSA environment
    •Present supportive case studies
    •Demonstrate practical examples of analytic governance in an SSA environment using BI Office from Pyramid Analytics.
    •Discuss practical tips on how to get started
    •Cover how to avoid common pitfalls associated with a SSA environment

    Stay tuned for a Q&A with speaker Ronald van Loon and domain expert Ian Macdonald, Principal Technologist, Pyramid Analytics.
Managing and analyzing data to inform business decisions
Data is the foundation of any organization and therefore, it is paramount that it is managed and maintained as a valuable resource.

Subscribe to this channel to learn best practices and emerging trends in a variety of topics including data governance, analysis, quality management, warehousing, business intelligence, ERP, CRM, big data and more.

Embed in website or blog

Successfully added emails: 0
Remove all
  • Title: Why Visualization is Critical for Data Wrangling
  • Live at: Jul 16 2015 3:00 pm
  • Presented by: Phil Vander Broek, User Experience Manager, Trifacta; Will Davis, Director of Product Marketing, Trifacta
  • From:
Your email has been sent.
or close