How to build a geolocated recommender using Spark ML, Cassandra and Akka
Natalino introduces a collection of machine learning techniques to extract insights from location-based social networks such as Facebook, demonstrating how to combine a dataset of venues’ check-ins with the user social graph using Spark and how to use Cassandra as a storage layer for both events and models before sketching how to operationalize such predictive models and embed them as microservices. In terms of data architecture this processing follows closely the SMACK stack.
The proposed data-pipeline is effective at detecting patterns in the sequences of visited venues and recommend relevant venues to visit next, based on the user, and friends location's history as well as the venue popularity graph. Natalino Busa explains how these predictive analytics tasks can be accomplished by using Spark SQL, Spark ML, and just a few lines of Scala and Python code.
RecordedJul 14 201646 mins
Your place is confirmed, we'll send you email reminders
Pete Hannam, 6point6 | Tom Mack, Qubole | Raghu Ramakrishnan, Microsoft | Further panelists to be announced
As data analytics becomes an engrained feature of many successful businesses, it's important to ensure that data-driven insights are made accessible to decision makers throughout the organization.
Deep-dive into the topic of how self-service analytics tools are democratizing data and bringing business users deeper into the equation with subject matter expert panelists.
- The landscape of self-service analytics tools available
- Techniques and strategies to ensure your self-service analytics solution is working for your business
- How self-service analytics is enabling business users and all members of an organization to make data-driven decisions
- and more!
Pete Hannam, Head of Data Engineering, 6point6
Tom Mack, VP EMEA, Qubole
Raghu Ramakrishnan, CTO for Data, Microsoft
Tim Carmichael, Director & Chief Data Officer, Ensifera Limited
Richard Corderoy, Oakland Data & Analytics | Andy Mott, Arcadia Data
With just a few weeks to the UK's largest data & analytics event, we've gathered some of the elite speakers who will be taking the stage to debate the latest trends, hottest solutions and the biggest opportunities (and challenges) for businesses in a data-driven world.
* Fast Data & DataOps
* Self-Service Analytics
* Artificial Intelligence
* Customer Experience
* Data Governance
What will they be talking about at The Olympia, London, on the 13-14 November 2018, what do they want to hear about, what are they looking forward to?
Join this panel discussion and arm yourself for excellence in this brave new data-driven world.
Richard Corderoy, Chief Data Officer, Oakland Data and Analytics
Andy Mott, Senior Consultant, Arcadia Data
Your instincts are good — but AI can make them flawless. With AI, every time a customer engages with your brand, you’re making that moment really count.
Add AI powered predictive analytics to ensure the right message at the right time, targeting is accurate and high-value, and customer support is proactive. Automate chat-based customer service, and give your agents more info off the top. Level up the shopping experience by offering the right products at the right time — and add contextual conversation to narrow their choices down.
Customers want this kind of service and attention. And if you can’t deliver, they’re going to find the company that can. So don’t miss this VB Live event, where you’ll learn more about being relevant to your customers, making their engagement meaningful, and more!
Register now for free!
Attend this webinar and learn:
* How AI levels up personalization and customer engagement
* How to use AI-fueled data analytics to create tactical marketing plans.
* How to create personalized moments without being creepy
* How to increase real and effective relevance to customers across channels
* Grant Langston, CEO, Eharmony
* Dave Gerhardt, VP Marketing, Drift
* Brian Witlin, CEO, Yummly
* Moira Dorsey, Founder, Dorsey Experience
Isabelle Nuage (Talend), Brajesh Goyal (Cavirin), Vincent Lam (Talend), Erin Junio (BrightTALK)
Whether you're just starting your cloud analytics journey, or you're a seasoned veteran, the question remains: Is your cloud analytics strategy delivering the insights you need? This panel of experts will help you discover the right way to harness your cloud data to give your big data team the resources they need to make informed business decisions. From preventing your cloud data lakes from becoming data swamps, to scaling your cloud environment to accommodate all your data sources, learn how to activate your cloud BI strategy & achieve powerful analytics results
About the Presenters
Brajesh Goyal is an industry veteran in the hybrid cloud space, bringing to Cavirin more than 20 years of high tech engineering experience. He was the founder & CEO for ITAPP, a disruptive cloud management company that was purchased by ServiceNow in 2016. At Oracle, he defined the term “enterprise grid computing” (former name for cloud) and wrote books on the topic. He continued to lead initiatives for enterprise grid computing, virtualization, and cloud at NetApp. BG holds a Master's degree, in Computer Science and Engineering from University of Minnesota, and a Bachelor of Technology Degree in CS from the Indian Institute of Technology
Isabelle Nuage is Director of Product Marketing at Talend. Her field of expertise include Data Integration, Big Data and Analytics. Isabelle brings 20+ years of experience in the software industry holding various leadership positions in product marketing at SAP & Business Objects. She holds a post graduate degree in computer science applied to GIS from the Pierre & Marie University in Paris, France
Vincent Lam is Head of Cloud Product Marketing at Talend. Throughout his career he has held leadership roles in marketing, product management & product development involving innovative technology solutions to complex problems. Mr. Lam is author of several patents and his background includes innovation across technology firms, Wall St, & entrepreneurship
Enterprise preparation for AI has centered almost exclusively on data prep and data science talent. While without data there would be no AI, enterprises that fail to ready the broader organization, chiefly people, process, and principles, don’t just stunt their capacity for good AI, they risk sunk investment, jeopardize employee trust, brand backlash, or worse.
Ensuring sustainable deployment starts with assessing enterprise data strategy, aligning myriad stakeholders, technological feasibility assessment, and a coordinated approach to ethics.
Join VentureBeat and industry analyst and founding partner of Kaleido Insights, Jessica Groopman for discussion on the five fundamentals of AI readiness at our upcoming VB Live event!
Attend this webinar and learn:
* What you need to do to prepare for AI-- beyond the data science team
* Real-world examples and research findings
* Top 5 best practices for strategic AI implementation
* Nathan Decker, Director of eCommerce, evo
* Ken Natori, President, Natori Company
* Jessica Groopman, Industry analyst and founding partner of Kaleido Insights
* Rachael Brownell, Moderator, VentureBeat
Harry Glaser, Co-founder and CEO of Periscope Data and Erin Junio, Content Manager at BrightTALK
This webinar is part of BrightTALK's What's Big in BI series.
What is the moral responsibility of a data team today? As artificial intelligence and machine learning technologies become part of our everyday life and as data and big data insights become accessible to everyone, CDOs and data teams are taking on a very important moral role as the conscience of the corporation.
In this episode of What's Big in BI, Harry Glaser highlights the risks companies will face if they don't empower data teams to lead the way for ethical data use across a variety of functions including business intelligence, analytics, big data initiatives and more.
Harry founded Periscope Data in 2012 with co-founder Tom O’Neill. The two have grown Periscope Data to serve more than 1000 customers. Glaser was previously at Google, and graduated from the University of Rochester with a bachelor’s degree in computer science.
Ash Seddeek, Founder of bestcash and Erin Junio, Content Manager at BrightTALK
This webinar is part of BrightTALK's Founders Spotlight Series, where each week we feature inspiring founders and entrepreneurs from across industries.
In this episode, Ash Seddeek will share how he channeled his entrepreneurial spirit into the launch of bestcash - a Fintech start up bringing a new point of view to financial services servicing 70-140k income earners.
Ash will answer questions about:
- What inspired him to start bestcash.
- What problem or pain points he aims to solve with bestcash.
- How to close the gap between financial services companies and their target users.
- How he plans to sustain bestcash's differentiation in order to appeal to customers and keep them coming back in the long term.
Join us for this live session where we encourage the audience to participate by asking Ash any live questions they may have!
Natalino Busa, Head of Applied Data Science, Teradata
Jupyter notebooks are transforming the way we look at computing, coding and problem solving. But is this the only “data scientist experience” that this technology can provide?
In this webinar, Natalino will sketch how you could use Jupyter to create interactive and compelling data science web applications and provide new ways of data exploration and analysis. In the background, these apps are still powered by well understood and documented Jupyter notebooks.
They will present an architecture which is composed of four parts: a jupyter server-only gateway, a Scala/Spark Jupyter kernel, a Spark cluster and a angular/bootstrap web application.
Sparkling Water integrates H2O, open source distributed machine learning platform, with the capabilities of Apache Spark. It allows users to leverage H2O’s machine learning algorithms with Apache Spark applications via Scala, Python, R or H2O’s Flow GUI which makes Sparkling Water a great enterprise solution.
Sparkling Water 2.0 was built to coincide with the release of Apache Spark 2.0 and introduces several new features. One of the latest and largest features is the ability to configure Sparkling Water for different workloads, scale and optimize the platform according to your data and needs.
In this talk we will introduce the basic architecture of Sparkling Water, go over different scaling strategies and explain the pros and cons of each solution. We will also compare the approaches with regards to the specific use cases and provide the rationale why or why not each solution may be a good fit for the desired use case.
This talk will finish with a live demo demonstrating the mentioned approaches and should give you a real time experience of configuring and running Sparkling Water for your use case(s)!
Where are your target customers going, and how are they spending their time and, more importantly, their dollars? Location data and intelligence – not just on how consumers are interacting with your brand but also with your competitors – is key to crafting a killer consumer experience and reaching them when and where their hearts and minds (and wallets) are ready to be captured.
From foot traffic patterns and location visits to frequency analysis, custom venue visit analysis offers powerful, actionable insights to companies looking for a competitive edge in a crowded field. Learn how to capture new customer interest, keep older customers coming back, and boost your market share when you join this VB Live event!
Register for free now!
During this webinar you’ll learn how to:
* Boost engagement with location-based consumer insights and competitive intelligence
* Gain insight into the behavioral patterns of customers and prospects
* Apply the best use of location data for your business
* David Bairstow, SVP Product Management, Skyhook
* Sheryl Jacobson, Principal Consulting Strategy and Analytics, Deloitte Consulting LLP
* Stewart Rogers, Analyst at Large, VentureBeat (Moderator)
Avatars, AI and Chatbots: Learn how virtual humans, immersive technology, and AI chatbots are being used across multiple industries. Retail, hospitality, real estate, training, customer service, professional sports, health and wellness, and celebrities are now being driven by human realistic avatars and AI. Learn how Quantum Capture, Portico, and other industry leaders are helping big brands increase the bottom line, drive sales, and enhance productivity. Virtual humans can convey trust, empathy, and evoke an emotional connection that increases guest satisfaction, increases learning and retention, and overall happiness.
Apache Spark for Big Data Analysis combined with Apache Zeppelin for Visualization is a powerful tandem that eases the day to day job of Data Scientists.
In this webinar, you will learn how to:
+ Collect streaming data from the Twitter API and store it in a efficient way
+ Analyse and Display the user interactions with graph-based algorithms wi.
+ Share and collaborate on the same note with peers and business stakeholders to get their buy-in.
SIG Talk: Quality & Testing - LeanFT: How to Combine with Existing Selenium & How It Enables Intelligent Automation (Complete Edition)
Another exciting Quality and Testing SIG Talk focused around Micro Focus LeanFT will feature speakers from Germany and the United Kingdom. Both of these experts will share their knowledge and experience with Micro Focus LeanFT and how it can be combined with your existing Selenium test automation as well as how it can enable Intelligent Automation (IA).
Speaker: Daniel Horn
Combining Micro Focus LeanFT with your Existing Selenium Test Automation: Based on current market disruptors, we want to give an Introduction into Micro Focus LeanFT. Starting with a short Comparison of LeanFT and Selenium, the main topic will be the integration of LeanFT into existing Selenium solutions. At the end, we will give a few guidelines and ideas for using LeanFT in your projects.
Speaker: Jonathon Wright
Enterprise AIOps – Augmented Intelligence – Leveraging Micro Focus LeanFT to enable Intelligent Automation (IA): The dawn of Artificial Intelligence (AI) is upon us. Is your favorite test harness up to the job of testing AI platforms like Graph-based ML or Computer Vision? How can you practically start your journey towards Enterprise AIOps? Leveraging Micro Focus LeanFT to enable Intelligent Automation across Functional API, along with Security and Performance testing utilizing microcontainerization technologies such as Docker and Kubernetes, will help you achieve enterprise grade cognitive adaptive testing.
Adnyesh Dalpati, Solutions Architect, Iotians Group
Marketers have inevitably grown more and more reliance on data, with the general view being held that collecting more data means knowing more about your audience. But having more data is not the only prerogative but monetizing it by using right set of tools and technology is the way forward.
This webinar will focus in interesting ways marketing technology can enable businesses to monetize their data.
AI Machine Learning model accuracy depends on the quality of data. In data science, when we say quality of data, it means data consistency, data completeness and data correctness which are all part of data integrity. In this session we will talk about how machine learning models can be adopted for data integration. Also, in case of some of the machine learning models, we assume data is normally distributed or data elements are appropriately scaled. However, it is not always true. Hence, data has to be transformed by normalizing data without losing its integrity. This is a big challenge in data science. Data integrity is maintained with the help of integrity constraints or the rules that are designed to keep data consistent and correct. In this session we will discuss some of the techniques and methods used for data integration, data transformation and normalization while ensuring data integrity. We will walk you through the steps involved with the help of examples.
Akmal Chaudhri, Technology Evangelist, GridGain Systems
Attend this session to learn how to easily share state in-memory across multiple Spark jobs, either within the same application or between different Spark applications using an implementation of the Spark RDD abstraction provided in Apache Ignite. During the talk, attendees will learn in detail how IgniteRDD – an implementation of native Spark RDD and DataFrame APIs – shares the state of the RDD across other Spark jobs, applications and workers. Examples will show how IgniteRDD, with its advanced in-memory indexing capabilities, allows execution of SQL queries many times faster than native Spark RDDs or Data Frames.
Akmal Chaudhri has over 25 years experience in IT and has previously held roles as a developer, consultant, product strategist and technical trainer. He has worked for several blue-chip companies such as Reuters and IBM, and also the Big Data startups Hortonworks (Hadoop) and DataStax (Cassandra NoSQL Database). He holds a BSc (1st Class Hons.) in Computing and Information Systems, MSc in Business Systems Analysis and Design and a PhD in Computer Science. He is a Member of the British Computer Society (MBCS) and a Chartered IT Professional (CITP).
Robert Cruz, Senior Director, Information Governance Practice, Smarsh
During this presentation, Robert Cruz, Senior Director/Information Governance Practice at Smarsh will discuss the governance challenges of today’s data – namely, the growth of social, mobile, and rich, dynamic content. Cruz will also look at the process of governing your data in the cloud and provide tips and best practices for qualifying cloud services providers for data availability, performance and extraction.
David Ginsburg, Cavirin; Matt Walmsley, Vectra; Joseph Carson, Thycotic; Rishi Bhargava, Demisto
Two months after the implementation of GDPR and many companies are still struggling to address the challenges of stricter compliance requirements and regulations. With data-generated revenue on the line, ensuring a firm handle on your businesses' approach to data governance and compliance will help ease the transition into the new GDPR era and keep your data flowing.
Tune in as this panel of experts take a deep-dive into data governance best practices, the challenges businesses are facing with GDPR, how to navigate common compliance hurdles and more.
Managing and analyzing data to inform business decisions
Data is the foundation of any organization and therefore, it is paramount that it is managed and maintained as a valuable resource.
Subscribe to this channel to learn best practices and emerging trends in a variety of topics including data governance, analysis, quality management, warehousing, business intelligence, ERP, CRM, big data and more.