Hi [[ session.user.profile.firstName ]]

Machine Learning Loves Hadoop

During this live webinar you will hear from Cloudera’s Director of Data Science, Sean Owen, as he discusses how Cloudera’s enterprise data hub allows data scientist to leverage libraries full of machine learning algorithms to analyze high dimensional, high volume data. Sean will also speak to common machine learning challenges and how Cloudera’s enterprise data hub can help eliminate these issues.

Topics we will cover during the 1 hour session will include:
What is machine learning?
Why should I use machine learning algorithms?
What are the common challenges of machine learning?
How does Cloudera’s enterprise data hub support machine learning?

Following the presentation, Sean will be holding a live Q&A to address any outstanding questions of yours.
Recorded Jun 18 2014 46 mins
Your place is confirmed,
we'll send you email reminders
Presented by
Sean Owen, Director of Data Science at Cloudera
Presentation preview: Machine Learning Loves Hadoop

Network with like-minded attendees

  • [[ session.user.profile.displayName ]]
    Add a photo
    • [[ session.user.profile.displayName ]]
    • [[ session.user.profile.jobTitle ]]
    • [[ session.user.profile.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(session.user.profile) ]]
  • [[ card.displayName ]]
    • [[ card.displayName ]]
    • [[ card.jobTitle ]]
    • [[ card.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(card) ]]
  • Channel
  • Channel profile
  • Enterprise Analytics Journey, the IBM point of view for IBM Z customers Dec 14 2017 1:00 pm UTC 45 mins
    Hélène Lyon, IBM, Distinguished Engineer, IBM Z Solutions Architect
    IT is a key player in the digital and cognitive transformation of business processes delivering solutions for improved business value with analytics. This session will step by step explain the journey to secure production while adopting new analytics technologies leveraging mainframe core business assets
  • Mixed Reality - Market Potential and Use Cases Dec 14 2017 9:00 am UTC 60 mins
    Adnyesh Dalpati, Solutions Architect
    Mixed reality is the result of blending the physical world with the digital world. Though it is relatively new technology and its adoption is still in initial stages. Mixed Reality devices and applications are projected to be the next technological era after smart phones.

    The webinar will give a brief on Mixed Reality Potential Usecases those provide an immersive experience but also revenues streams to the creators.
  • Data Fabric: A New Paradigm For Self-Service Data & Data Scientists Dec 12 2017 5:00 pm UTC 45 mins
    Kelly Stirman, VP Strategy, Dremio
    Data Scientists are rare and highly valued individuals, and for good reason: making sense of data, and using the machine learning libraries requires an unusual blend of advanced skills. Why is it then that Data Scientists spend the majority of their time getting data ready for models, and a fraction actually doing the high value work?

    In this talk we introduce the concept of Data Fabric, a new way to provide a self-service model for data, where data scientists can easily discover, curate, share, and accelerate data analysis using Python, R, and visualization tools, no matter where the data is managed, no matter the structure, and no matter the size.

    We will talk through the role of Apache Arrow, the in-memory columnar data standard that is accelerating analytics for GPU-based processing, as well as the role of Pandas and Arrow in providing unprecedented speed in accessing datasets from Python.
  • Applying Machine Learning to unstructured files and data for research Dec 12 2017 11:00 am UTC 45 mins
    Dr Tom Parsons, Research Director and Mitchell Murphy, Data Scientist from Spotlight Data
    Researchers generate huge amounts of valuable unstructured data and articles from research every day. The potential for this information is huge: cancer and pharmaceutical breakthroughs, advances in technology and cultural research that can improve the world we live in.

    This webinar discusses how text mining and Machine Learning can be used to make connections across this broad range of files and help drive innovation and research. We discuss using Kubernetes microservices to analyse the data and then applying Machine Learning and graph databases to simplify the reuse of the data.
  • IOT Future Roadmap - Adoption & Standardization Nov 10 2017 9:00 am UTC 60 mins
    Adnyesh Dalpati, Solutions Architect
    Internet of Things (IoT) envisions that everything in the physical world is connected seamlessly and is securely integrated through Internet. New products are innovated under the umbrella of IOT and opening up different opportunities. This webinar will discuss the future potential of IOT and the trend in which it is moving in adoption and standardisation.
  • Setting up BI in FinTech companies: challenges and opportunities Nov 2 2017 4:00 pm UTC 45 mins
    Mari Hermanns, Head of Business Intelligence at Solaris Bank
    Today most companies collect more data than ever and as we all know: data is the new oil. However gaining insights and turning them into action is easier said than done. In my experience this is a challenge for many companies, including innovative FinTechs.

    In order to create a data driven business and organisational culture it is important to integrate data collection and an appreciation for data driven truth from the starting of a venture. This webinar is a brief overview of the hurdles and challenges BI faces in growing FinTech companies and how they can be overcome. Furthermore this webinar will briefly mention new BI trends and tools and how they could impact businesses.
  • Three Ways To Accelerate Your Data Lake Migration To Cloud Oct 25 2017 4:00 pm UTC 45 mins
    Kelly Stirman, VP Strategy, Dremio
    Public cloud deployments have become irresistible in terms of flexibility, low barriers to entry, security, and developer friendliness. But the sheer inertia of traditional data lakes make them difficult to transition to cloud. In this talk we'll look at examples of how leading companies have made the transition using open source technologies and hybrid strategies.

    Instead of following a "lift and shift" strategy for moving data lake workloads to the cloud, there are new considerations unique to cloud that should be considered alongside traditional approaches related to compute (eg, GPU, FPGA), storage (object store vs. file store), integrations, and security.

    Viewers will take away techniques they can immediately apply to their own projects.
  • Designing Data Lakes: Architecture options with open source tools Oct 25 2017 12:00 pm UTC 60 mins
    Maloy Manna, PM Engineering, AXA Data Innovation Lab, Paris
    The concept of Data lakes evolved to address challenges and opportunities in managing big data.

    Organizations are investing massive amounts of time and money to upgrade existing data infrastructures and build data lakes whether on-premises or in the cloud.

    This talk will discuss architectures and design options to implement data lakes with open source tools. Also covered are challenges of upgrade & migration from existing data warehouses, metadata management, supporting self-service and managing production deployments.
  • Virtual Data Lake: A Reality Oct 25 2017 10:00 am UTC 45 mins
    Hélène Lyon, IBM, Distinguished Engineer, IBM Z Solutions Architect
    As an Enterprise customer, you are potentially using IBM Z in a hybrid cloud implementation. Let's understand how to benefit from cloud access to mainframe data without moving it outside z; thereby improving security, reducing integration challenges and answering your GDPR auditor's needs.
  • Bias in the Web Oct 24 2017 5:00 pm UTC 60 mins
    Ricardo Baeza-Yates, CTO, NTENT; ACM Fellow; IEEE Fellow
    The Web is the most powerful communication medium and the largest public data repository that humankind has created. Its content ranges from great reference sources such as Wikipedia to ugly fake news. Indeed, social (digital) media is just an amplifying mirror of ourselves. Hence, the main challenge of search engines and other websites that rely on web data is to assess the quality of such data. However, as all people have their own biases, web content, as well as our web interactions, are tainted with many biases.

    Data bias includes redundancy and spam, while interaction bias includes activity and presentation bias. In addition, sometimes algorithms add bias, particularly in the context of search and recommendation systems. As bias generates bias, we stress the importance of de-biasing data as well as using the context and other techniques such as explore & exploit, to break the filter bubble.

    The main goal of this talk is to make people aware of the different biases that affect all of us on the Web. Awareness is the first step to be able to fight and reduce the vicious cycle of bias.

    Ricardo Baeza-Yates areas of expertise are web search and data mining, information retrieval, data science, and algorithms. He is CTO of NTENT, a semantic search technology company based in California, USA since 2016. Before, he was VP of Research at Yahoo Labs, based first in Barcelona, Spain, and later in Sunnyvale, California, from January 2006 to February 2016. He also is part time Professor at DTIC of the Universitat Pompeu Fabra, in Barcelona, Spain, as well as at DCC of Universidad de Chile in Santiago.
  • Using Docker to realise a modular Big Data Platform & Leveraging SANSA Stack Oct 24 2017 12:00 pm UTC 45 mins
    Dr. Hajira Jabeen, Senior Researcher at the University of Bonn
    Join this webinar where senior researches will present:

    1) Big Data Integrator Platform
    - Use of Docker and Docker swarm to realize a modular Big Data Platform

    2) Semantic Analytics Stack
    - Use of Big data distributed processing engines to leverage Scalable
    processing for the Semantic Web (RDF data representation, Querying,
    Inference, and Machine Learning)

    3) Seven societal challenges in Big Data Europe
    -Combination of different Big Data tools to create Big Data Value Chain
    (Pipeline) for different Use cases representing the societal challenges
  • Advanced Analytics in the Cloud - the latest cloud analytics innovations Oct 24 2017 8:00 am UTC 60 mins
    Iver van de Zand
    Iver van de Zand will talk and demo on the latest SAP innovations for analytics in the cloud. Keywords are live connectivity and the closed loop of combined business intelligence, planning and predictive analytics all in one environment. Fully ready and prepared for big data.
  • Blockchain: Making It More Than Just a Trend Oct 12 2017 12:00 pm UTC 60 mins
    Travin Keith (Agavon), Sergey Ivliev (Lykke), Vlad Dramaliev (aeternity), Makoto Takemiya (Soramitsu)
    Perhaps one of the buzziest of buzzwords for the past few years has been "Blockchain". But for those who are well-versed in the technology and the goings on of the industry, it is definitely more than just hype.

    Join this session where a panel of experienced Blockchain specialists will discuss:

    -Viability of Use Cases - Just because a centralized database can be more efficient doesn't mean it's the most viable

    -Preventing Blockchain from Being Just a Trend - if blockchain isn't viable as the solution, it shouldn't be used

    -Public vs Private Blockchain Use

    -State of the Industry - Understanding that a lot of the platforms and frameworks are in their early stage of development and that support may not be readily available
  • Image recognition with deep learning Oct 11 2017 5:00 pm UTC 60 mins
    Layla Tadjpour, Data Science Consultant, Ph.D. in Electrical Engineering from University of Southern California.
    In this webinar, we will learn about image recognition with deep learning. After a brief overview of what deep learning is, and why it matters, we will learn how to classify dogs from cats. That is, how to train a model to recognize dog images from cat images.

    We use Keras, an easy to use python deep learning library that sits on top of Tensorflow, and “fine-tuning”, a very important skill for any deep learning practitioner, to train a model to classify the images.

    Once we trained our model to classify dogs from cats images with high accuracy, we dig into the details of the trained model and look at its building blocks, i.e., Convolutional Neural Networks (CNN), Fully Connected Block and activation functions to develop an understanding of how the deep learning model works.
  • Introduction to Blockchain: Bitcoin, Ethereum, Ledgers, and more Oct 9 2017 2:00 pm UTC 60 mins
    David Siegel, Blockchain, decentralization and business agility expert
    Still confused about this whole Blockchain thing? Interested in investing in digital currencies, but not sure where to start? Want to get a better idea of the threats and opportunities?

    David Siegel is a Blockchain, decentralization and business agility expert who has been a high-level management & strategy consultant to companies like Sony, Hewlett Packard, Amazon, NASA, Intel, and many start-ups. David has been praised for being able to explain Blockchain in the most simple and interesting way.

    What you will learn:
    -What is Bitcoin?
    -What is the blockchain?
    -What is Ethereum? What is Ether?
    -What is a distributed application?
    -What is a smart contract?
    -What is a triple ledger?
    -What about identity and security?
    -What business models are at risk?
    -What are the opportunities?
    -What should we do?
  • The Cognitive Bank: Leveraging Advanced Analytics and Artificial Intelligence Recorded: Sep 21 2017 63 mins
    Vivek Bajaj, Global VP of Solutions for IBM Financial Services
    Today the payments industry faces a rebirth by necessity. Financial institutions process massive volumes of customer and payments transaction data, much of it unstructured and untapped.

    Cognitive Systems have the ability to understand, reason and learn. In Financial Services applying cognitive capabilities to real world payments issues like safer and faster payments is yielding significant results. Furthermore Risk and Compliance and segment of one engagement are areas where ROI is tremendous when leveraging advanced analytics and artificial intelligence in cohesion.

    Learn from real world use cases of how financial institutions globally have gained significant competitive advantage by becoming a truly Cognitive Bank.
  • HDFS on Kubernetes: Lessons Learned Recorded: Sep 19 2017 46 mins
    Kimoon Kim, Pepperdata Software Engineer
    HDFS on Kubernetes: Lessons Learned is a webinar presentation intended for software engineers, developers, and technical leads who develop Spark applications and are interested in running Spark on Kubernetes. Pepperdata has been exploring Kubernetes as potential Big Data platform with several other companies as part of a joint open source project.

    In this webinar, Kimoon Kim will show you how to: 

    –Run Spark application natively on Kubernetes
    –Enable Spark on Kubernetes read and write data securely on HDFS protected by Kerberos
  • Using Machine Learning in Software License Discovery for Greater SAM Maturity Recorded: Sep 13 2017 56 mins
    Bill Dyck, Nick Rendall
    Accurate software asset management (SAM) is critical to a company’s IT operations in order to optimize usage and cost, and ensure that license compliance is achieved to avoid true-ups and penalties. Keeping a company’s software application library up-to-date for SAM has been a continuous struggle for asset managers given the time it takes for manual entry of new titles, the changing information in software titles, and the sheer number of titles across an enterprise organization. Every software title not recognized in their library causes potential license compliance breaches (leading to true-ups and potential fines) and reduces an organization’s ability to optimize their software license usage for cost optimization.

    Micro Focus (HPE Software) has added the industry leading IDOL unstructured search engine to add machine learning to Universal Discovery, drastically improving in the process of software license recognition and teaching for quicker software license recognition and a consistently up-to-date SAI over time. Join us on September 13th for an introduction to this exciting new capability and to understand the business value it can bring to your SAM program.

    • Introduction to machine learning and the IDOL engine
    • Machine learning in Universal Discovery
    • Current SAM with Machine Learning Success StoriesLive
    • Q&A about this new technology
  • Visualizing Big Data Recorded: Sep 7 2017 64 mins
    Eva Murray, Exasol & Andy Kriebel, The Data School
    Learn about analyzing and visualizing big data in Tableau. Hear tips & tricks, see examples and ask your burning questions during this live webinar.

    We will give you a live demo on how to connect Tableau to Exasol's dataviz environment where you can explore large public datasets.

    We'll show you how to tackle these data sets when starting your analysis and take you through some example visualizations to inspire your own work.

    We'll be working with data in Exasol for an upcoming Makeover Monday challenge later in September, so use this opportunity to ask us any questions you have ahead of time so you can make the most out of playing with big data.
  • Follow up Q&A: A Guide to Machine Learning Patterns and Data Visualizations Recorded: Sep 6 2017 19 mins
    Dr. Umesh Hodeghatta Rao, CTO, Nu-Sigma Analytics Labs
    Catch this follow up Q&A from the A Guide to Machine Learning Patterns and Data Visualizations presentation. Umesh answers remaining and follow up questions from the audience.
Managing and analyzing data to inform business decisions
Data is the foundation of any organization and therefore, it is paramount that it is managed and maintained as a valuable resource.

Subscribe to this channel to learn best practices and emerging trends in a variety of topics including data governance, analysis, quality management, warehousing, business intelligence, ERP, CRM, big data and more.

Embed in website or blog

Successfully added emails: 0
Remove all
  • Title: Machine Learning Loves Hadoop
  • Live at: Jun 18 2014 4:00 pm
  • Presented by: Sean Owen, Director of Data Science at Cloudera
  • From:
Your email has been sent.
or close