Hi [[ session.user.profile.firstName ]]

Scalable Cross-Platform R-Based Predictive Analytics

In this webinar we will take a quick tour through an end-to-end predictive analytics session. We will start by exploring our data with summaries and histograms.

Using the knowledge gleaned from data exploration, we will create transformations to clean our data and prepare it for model building. Next, we will establish a prediction baseline by performing linear regression.

Then we will apply a state-of-the-art black box algorithm, Ensembles of Decision Trees, to push prediction to the limit. Finally, we will use this high quality ensemble model to score new data, completing the prediction workflow.

We will discover how to perform these steps scalably using an R-based tool across a wide range of platforms: Windows and Linux laptops and workstations, multicore servers, Hadoop and MPI clusters, and massively parallel databases.
Recorded Dec 12 2013 46 mins
Your place is confirmed,
we'll send you email reminders
Presented by
Mario Inchiosa, US Chief Scientist at Revolution Analytics
Presentation preview: Scalable Cross-Platform R-Based Predictive Analytics

Network with like-minded attendees

  • [[ session.user.profile.displayName ]]
    Add a photo
    • [[ session.user.profile.displayName ]]
    • [[ session.user.profile.jobTitle ]]
    • [[ session.user.profile.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(session.user.profile) ]]
  • [[ card.displayName ]]
    • [[ card.displayName ]]
    • [[ card.jobTitle ]]
    • [[ card.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(card) ]]
  • Channel
  • Channel profile
  • Separating the Myths from the Truths in Data Visualisation Aug 10 2016 11:00 am UTC 45 mins
    Andy Kirk, Data Visualization specialist and Editor, VisualisingData.com
    In this talk Andy Kirk will shine a light on some of the most discussed and debated aspects of data visualisation design. The aim of the talk is to expose some of the myths about data visualisation and reinforce some of the truths in order to offer practitioners, professionals and part-time enthusiasts alike greater clarity about this increasingly popular discipline.

    Viewers will come away with a greater understanding of the rights and the wrongs in data visualisation as well as an awareness of the aspects of this activity that must remain tagged with the elusive notion of ‘it depends’. Along the way Andy will exhibit some of the best examples and techniques from across the field.
  • Predictive and Prescriptive Power Discovery from Fast, Wide, Deep Big Data Jul 22 2016 3:00 pm UTC 45 mins
    Kirk Borne, Principal Data Scientist, Booz Allen Hamilton
    I will summarize the stages of analytics maturity that lead an organization from traditional reporting (descriptive analytics: hindsight), through predictive analytics (foresight), and into prescriptive analytics (insight). The benefits of big data (especially high-variety data) will be demonstrated with simple examples that can be applied to significant use cases.

    The goal of data science in this case is to discover predictive power and prescriptive power from your data collections, in order to achieve optimal decisions and outcomes.
  • Big Data Analytics for Risk Management Jul 21 2016 4:00 pm UTC 60 mins
    Graham Seel (BankTech Consulting), Shirish Netke (Amberoon), Bob Mark (Black Diamond Risk)
    When it comes to tracking the flow of money, it’s no doubt that studying patterns and analytics behind the transactions are important in fighting financial crime.

    Join this session where we'll discuss:

    -The application of machine learning and big data in AML monitoring
    -How to implement proper Know-Your-Customer (KYC) processes
    -Challenges around automation and using predictive analytics to prevent future issues
  • Open APIs and the Future of Banking Jul 21 2016 3:01 pm UTC 60 mins
    Annine Nordestgaard Bentzen (Hufsy), Jeremy Light (Accenture), Stefan Weiß (Fidor), Jan Sirich (Nordea)
    A successful Application Programming Interface (API) strategy relies heavily on concepts of open infrastructure and open data. The adoption of Open APIs in banking is thus an idea that has been met with excitement and, understandably, concern as well.

    Attend this summit where our experts will discuss:

    -What’s in it for banks/fintechs?
    -What are the pitfalls when it comes to opening up APIs for banks and integrating into open APIs for fintechs?
    -PSD2 - will you be ready (mostly a consideration for banks)?
    -How should we (fintechs and banks) operate until the PSD2 is rolled out?
  • How to build a geolocated recommender using Python, Spark ML, Cassandra and Akka Jul 13 2016 3:00 pm UTC 45 mins
    Natalino Busa, Head of Applied Data Science at Teradata
    Natalino introduces a collection of machine learning techniques to extract insights from location-based social networks such as Facebook, demonstrating how to combine a dataset of venues’ check-ins with the user social graph using Spark and how to use Cassandra as a storage layer for both events and models before sketching how to operationalize such predictive models and embed them as microservices. In terms of data architecture this processing follows closely the SMACK stack.

    The proposed data-pipeline is effective at detecting patterns in the sequences of visited venues and recommend relevant venues to visit next, based on the user, and friends location's history as well as the venue popularity graph. Natalino Busa explains how these predictive analytics tasks can be accomplished by using Spark SQL, Spark ML, and just a few lines of Scala and Python code.
  • Predictive analytics: How to use today’s insights to create tomorrow’s business Jul 13 2016 2:00 pm UTC 45 mins
    Ronald van Loon, Director Business Development, Adversitement
    Companies today are focusing on creating a 360-degree customer view. To do so, the first step is to have your data collection up and running, making sure that you can deliver data to a centralized environment, from which it can be used for further processing. If you manage this, where do you start if you want to find patterns and insights to outperform the competition? In other words: how can you discover the predictors in your customer data that lead to churn, sales & up & cross sales?

    In this webinar Ronald van Loon, Director at Advertisement, will:
    •Discuss several case studies
    •Elaborate on the challenges
    •Define the impact for organizations and professionals responsible for online sales and customer retention
    •Show how a new approach and technology can solve these challenges
    •Discuss the result for organisations
  • The Role of FPGAs in SparK Accelerators Jul 12 2016 4:00 pm UTC 60 mins
    Shreyas Shah, Principal Data center Architect, Xilinx
    In the cloud computing era, data growth is exponential. Every day billions of photos are shared and large amount of new data created in multiple formats. Within this cloud of data, the relevant data with real monetary value is small. To extract the valuable data, big data analytics frame works like SparK is used. This can run on top of a variety of file systems and data bases. To accelerate the SparK by 10-1000x, customers are creating solutions like log file accelerators, storage layer accelerators, MLLIB (One of the SparK library) accelerators, and SQL accelerators etc.

    FPGAs (Field Programmable Gate Arrays) are the ideal fit for these type of accelerators where the workloads are constantly changing. For example, they can accelerate different algorithms on different data based on end users and the time of the day, but keep the same hardware.

    This webinar will describe the role of FPGAs in SparK accelerators and give SparK accelerator use cases.
  • Time for Financial Institutions to Get Serious about Predictive Analytics Jul 11 2016 2:00 pm UTC 60 mins
    Kasper Sylvest (Danske Bank), Amir Tabakovic (BigML), Nick Jetten (VODW)
    This first white paper of the new series discusses the value of predictive analytics for the financial industry and answers the
    question why this is the right time to start with predictive analytics and how to empower entire organisations to use it.

    As mobile technology evolves and everything around us – not just our mobile devices– is becoming connected we are
    entering a new era of connected experiences. The customer journey in the financial industry is completely digitized. This
    exponentially increases the number of interactions between a financial service company and its customers.
    Customers expect banks to understand their context and the challenge for financial industry is to be relevant at all these interactions.

    In this webinar, we will discuss:

    -How predictive analytics will lead to vast improvements of existing static business rules and achieve progress like reducing cost, increasing revenues and improving customer experience

    -Why Mobey Forum expects that predictive analytics skills will soon be essential for banks to keep their position in the market
    against non-banks but also other banks that will be using predictive analytics as a competitive weapon

    -Why we should not just focus on a "rear view mirror" approach, but also identify and address questions concerned with the future

    -Areas of application for predictive analytics in financial institutions

    -Case studies of card-linked offers, next best action, pricing, claim handling, risk assessment
  • Big-Data-as-a-Service: On-Demand Elastic Infrastructure for Hadoop and Spark Recorded: Jun 22 2016 56 mins
    Kris Applegate, Big Data Solution Architect, Dell; Tom Phelan, Chief Architect, BlueData
    Watch this webinar to learn about Big-Data-as-a-Service from experts at Dell and BlueData.

    Enterprises have been using both Big Data and Cloud Computing technologies for years. Until recently, the two have not been combined.

    Now the agility and efficiency benefits of self-service elastic infrastructure are being extended to big data initiatives – whether on-premises or in the public cloud.

    In this webinar, you’ll learn about:

    - The benefits of Big-Data-as-a-Service – including agility, cost-savings, and separation of compute from storage
    - Innovations that enable an on-demand cloud operating model for on-premises Hadoop and Spark deployments
    - The use of container technology to deliver equivalent performance to bare-metal for Big Data workloads
    - Tradeoffs, requirements, and key considerations for Big-Data-as-a-Service in the enterprise
  • Windows 10: Digital Transformation through Affordable Innovation Recorded: Jun 16 2016 17 mins
    Peter Gossin, Digital Transformation Manager, Microsoft
    Digital transformation is the process of using today’s technology to modernize outdated processes and meet the most pressing needs of your business.

    Thanks to recent advances in lower cost tablet technology and Microsoft’s suite of cloud and productivity services, complete digital transformation is more accessible now than ever before. A new class of affordable devices is revolutionizing the way businesses and their employees work and interact with customers.

    Sign up now to:
    •Engage your customers
    •Empower your employees
    •Optimize your operations
    •Transform your products
  • Data Modeling in Hadoop Recorded: Jun 16 2016 49 mins
    Maloy Manna, Engineering, AXA Data Innovation Lab
    The USP of Hadoop over traditional RDBMS is "Schema on Read".

    While the flexibility of choices in data organization, storage, compression and formats in Hadoop makes it easy to process data, understanding the impact of these choices on search, performance and usability allows better design patterns.

    Learning when and how to use schemas and data model evolution due to required changes is key to building data-driven applications.

    This webinar will explore the various options available and their impact to allow better design choices for data processing and metadata management in Hadoop.
  • Storage Architectures for Next Generation Cognitive Analytics Recorded: Jun 13 2016 36 mins
    Balint Fleischer, Chief Research Officer, Huawei
    In the era of data explosion in Cloud-Mobile convergence and Internet of Things, traditional architectures and storage systems will not be sufficient to support the transition of enterprises to cognitive analytics. The ever increasing data rates and the demand to reduce time to insights will require an integrated approach to data ingest, processing and storage to reduce end-to-end latency, much higher throughput, much better resource utilization, simplified manageability, and considerably lower energy usage to handle highly diversified analytics. Yet next-generation storage systems must also be smart about data content and application context in order to further improve application performance and user experience. A new software-defined storage system architecture offers the ability to tackle such challenges. It features seamless end-to-end data service of scalable performance, intelligent manageability, high energy efficiency, and enhanced user experience.
  • Solving the "Middle Data" problem Recorded: Jun 9 2016 34 mins
    Natalino Busa, Head of Applied Data Science at Teradata
    We are very well aware that companies like Facebook, Twitter, Whatsapp deal with datasets in the range of 100's of Petabytes and more. However not all datasets are that big. Did you know that all english pages of Wikipedia amount to just 49 GB uncompressed text data? Likewise, there are a large amount of datasets ranging from customers data to events and transactions which do not exceed the low Terabyte range.

    In this webinar we will discuss how to process data in this range both for interactive queries as well for batch processing. We will look at what tradeoffs can be made by tuning the architecture with SSD and RAM. And which distributed computing paradigm work best for this datasets and their typical workloads. We will revision the concepts of data locality, data replication and parallel computing for this specific class of datasets.
  • The Big Data decision path incorporating SAP landscapes Recorded: Jun 8 2016 49 mins
    Swen Conrad, CEO, Ocean9
    Leading companies derive big data technology choices from business needs instead of technology merits. With the variety of possible use cases, either Hadoop, Spark or SAP HANA may provide the best fit to solve business challenges and create value.

    Sounds easy, but managing a variety of big data solutions within a single company puts a skills and cost premium on the organization.

    This session will guide you to the right big data technology according to business needs and highlights the fastest path to adoption.
  • Demystifying Spark and Hadoop for Big Data Analytics to gain business insight. Recorded: Jun 8 2016 37 mins
    Adrian Whitehead, Specialist Systems Engineer, Isilon Storage Division, EMC ETD
    Organisations are spoilt for choice when it comes to Big Data tools with current trends promoting Hadoop as a method of analysing vast amounts of stored unstructured data. Organisations are also increasingly looking towards tools which can monitor live feeds - e.g. Twitter - to perform actions in real time based on keywords. To perform this valuable analysis Spark has become the ecosystem of choice.

    Join this session to uncover which tool to choose to improve the performance of your business.
  • Marketing Data Governance: Making Sense of the Current Legal Landscape Recorded: May 27 2016 31 mins
    Chris Slovak and Maltie Maraj, Tealium
    In today’s digital world, with the rich streams of customer data now available come important responsibilities in data governance. From the vendors we choose to work with, to the policies and practices we have in place, today’s marketers are increasingly responsible for ensuring customer data is handled with the utmost concern for security and privacy. In this session at Digital Velocity 2016, Chris Slovak, VP of Global Sales Solutions at Tealium, and Maltie Maraj, Senior Counsel at Tealium, detail the current legal landscape (in marketers’ terms!), and provide guidelines for a more comprehensive approach to data governance and informed technology decisions.
  • Delivering Better Online Experiences with Offline Data Recorded: May 27 2016 31 mins
    Monique Trulson, Director of E-Commerce at Dover Saddlery
    There’s no denying that we now live in an omnichannel world – and that organizations across industries are grappling with how to provide a consistent customer experience across channels. Connecting data across touch points, both online and offline, is not an easy feat - but is the key to delivering the highly relevant, personalized interactions that result in conversions. In this session at Digital Velocity 2016, Monique Trulson shares her deep experience in marrying online data with previously siloed offline data to build a unified customer view, and leveraging that unified view to transform customer engagement.
  • Machine Learning in Financial Services: Generating insights and staying ahead Recorded: May 26 2016 61 mins
    Jay van Zyl (Innosect), Pedro Bizarro (Feedzai), Natalino Busa (Teradata), Matt Mills (Featurespace)
    One of the main benefits of Machine Learning is being able to analyse a large amount of data at the speed and efficiency that would require a huge team of humans. This is something that has proven to be very necessary in the Financial Services industry, where insurance companies, banks, and lenders need actionable insights quickly.

    Join this panel where we will discuss:
    -Why is Machine Learning such a hot topic? What are the benefits/challenges?
    -What is needed to do Machine Learning right?
    -Case studies of how Machine Learning is helping financial institutions — better customer experience, faster actionable insights
    -How ML is able to spot trends and patterns to mitigate risk
  • How Data Analytics is Driving the Future of Financial Services Recorded: May 25 2016 64 mins
    Ina Yulo (BrightTALK), Vamsi Chemitiganti (Hortonworks), Bob Savino (Moven), Jamie Donald (Moneyhub), Pedro Arellano (Birst)
    Businesses around the world have recognised “data management and analytics” as one of the key areas where they are investing time and money. The demand for this push is largely due to new regulations as well as pressure from customers and investors.

    From digital banks which visualise your spending habits, to predictive analytics helping understand consumers’ financial habits, and even to how Big Data can be used to fight fraud and reduce risk, join this panel where industry luminaries will tackle the different opportunities that analytics can unlock.
  • Complex Data Analysis with Microsoft Recorded: May 19 2016 19 mins
    Discover how to zero in on the data and insights you need to analyze cost, inventory, and production data to drive supply chain efficiency.

    Analyzing and interpreting complex data sets can be very time consuming. Using specific tools can help team members collaborate and gain insights on important projects.

    supply chain, logistics, sales, marketing, data, team, HR, CRM
Managing and analyzing data to inform business decisions
Data is the foundation of any organization and therefore, it is paramount that it is managed and maintained as a valuable resource.

Subscribe to this channel to learn best practices and emerging trends in a variety of topics including data governance, analysis, quality management, warehousing, business intelligence, ERP, CRM, big data and more.

Embed in website or blog

Successfully added emails: 0
Remove all
  • Title: Scalable Cross-Platform R-Based Predictive Analytics
  • Live at: Dec 12 2013 7:00 pm
  • Presented by: Mario Inchiosa, US Chief Scientist at Revolution Analytics
  • From:
Your email has been sent.
or close