Hi [[ session.user.profile.firstName ]]

How to Optimize and Tune your Spark Data Pipelines

The first step to understanding and maintaining optimal application performance is to create a holistic, end-to-end perspective on your Spark data pipelines and platform integrations. With modern data pipelines composed of numerous processing stages, data engineers and data scientists can lose time focusing on part of the ecosystem as they do not have access to the end to end flow. Developing an end-to-end view requires collecting and correlating application metadata and identify poor performance failures at the application and operational level.

Join Unravel expert Aengus Rooney to develop an understanding of the performance dynamics of modern data pipelines and applications. In this session, you will learn about uncovering and understanding the key datasets, metrics, and best practices needed to develop mastery with Spark performance management.
Recorded Jun 19 2019 25 mins
Your place is confirmed,
we'll send you email reminders
Presented by
Aengus Rooney, Head of Solution Engineering - International, Unravel Data
Presentation preview: How to Optimize and Tune your Spark Data Pipelines

Network with like-minded attendees

  • [[ session.user.profile.displayName ]]
    Add a photo
    • [[ session.user.profile.displayName ]]
    • [[ session.user.profile.jobTitle ]]
    • [[ session.user.profile.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(session.user.profile) ]]
  • [[ card.displayName ]]
    • [[ card.displayName ]]
    • [[ card.jobTitle ]]
    • [[ card.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(card) ]]
  • Channel
  • Channel profile
  • Effective Cost Management for Azure Databricks Feb 20 2020 6:00 pm UTC 60 mins
    Abha Jain, Senior Director of Products, Unravel Data
    Azure Databricks has become very popular as a computing framework for big data. However, customers are finding unexpected costs eating into their cloud budget. Furthermore, lack of visibility to root cause and general inefficiency is costing organizations thousands, if not millions in operating their Azure Databricks environment.

    Join Unravel to discuss new features to effectively help manage costs on Azure Databricks:

    Cost analytics to provide assurance and forecasting for optimizing Databricks workloads as they scale.

    Accurate, detailed chargeback reporting of the cost of running data apps on Azure Databricks.

    Right-sizing recommendations to reveal the best virtual machine or workload types that will provide same performance on cheaper clusters.
  • Best Practices: Troubleshoot and Optimize Spark Data Pipelines with Unravel Feb 20 2020 4:00 pm UTC 60 mins
    Muji Qadri, Senior Solution Engineer, Unravel Data
    Join Unravel to develop an understanding of the performance dynamics of modern data pipelines and applications. In this session, you will learn about uncovering and understanding the key datasets, metrics, and best practices needed to develop mastery with Spark performance management on-premise and in the Cloud.
  • Best Practices: Monitoring End-to-End Hbase Performance with Unravel Recorded: Feb 6 2020 30 mins
    Chris Santiago, Solution Engineering Director, Unravel Data
    Running real-time data injection workloads on HBase clusters are always challenging. Timely, up-to-date, detailed data is crucial to locating and fixing issues to maintain a cluster's health and performance. Join us to learn how Unravel provides detailed data and metrics to help you identify the root causes of cluster and performance issues in Hbase.
  • 5 Ways to Slash your On-Premise Hadoop Platform Costs Recorded: Jan 23 2020 59 mins
    Chris Santiago, Solution Engineering Director, Unravel Data
    Make your on-premise Hadoop platform faster, better & cheaper with Unravel by joining Chris Santiago, Solution Engineering Manager to learn how to reduce the time troubleshooting and the costs involved in operating your data platform. During this webinar we will demonstrate how Unravel complements and extends your existing on-premise data platform to:

    Instantly understand why technologies such as Spark applications, Kafka jobs, and Impala underperform or even fail!
    Define and meet enterprise service levels through proactive reporting and alerting.
    Reduce the overall cost of Cloudera/MapR/Apache Hadoop/Spark through better cluster utilisation resulting to an immediate reduction in MTTI and MTTR
  • Look Before You Leap: Migrating On-Premises Hadoop to AWS Recorded: Jan 17 2020 55 mins
    Jason Baick, Senior Director of Product Marketing, Unravel; Javier Ramirez, Senior Developer Associate, AWS
    Lack of agility, excessive costs, and administrative overhead are convincing on-premises Spark and Hadoop customers to migrate to cloud native services on AWS. As you’re migrating these applications to the cloud, Unravel helps ensure you won’t be flying blind.

    Join AWS and Unravel as we discuss:

    Top reasons customers choose AWS for their cloud migration journey,
    Advantages of planning out your Hadoop migration to AWS,
    Demo: Migration assessment capabilities to ensure risk-free migration.
  • Unravel Demo - Big Data Application Performance Management Recorded: Jan 8 2020 36 mins
    Mick Nolen, Senior Solution Engineer, Unravel Data
    Enterprises across all sectors have invested heavily in big data infrastructure (Hadoop, Impala, Spark, Kafka, etc.) to turn data into insights into business value. It is increasingly challenging for Data Ops teams to operate and maintain these clusters to meet business requirements and performance SLAs. Unravel helps organizations optimize performance, automate troubleshooting and contain costs - on premises or in the cloud. Register for a demo of Unravel for big data application performance management.
  • Application Performance Management & Operational Intelligence for Amazon EMR Recorded: Dec 31 2019 44 mins
    Abha Jain, Director of Products, Unravel Data; Shashi Raina, Partner Solution Architect at Amazon Web Services
    According to Ovum research, over half of big data workloads will be running in the cloud by the end of this year (2019). Amazon EMR is an industry leading cloud-native big data platform that can easily run Apache Spark, Hadoop, Presto and Hive. Unravel for Amazon EMR provides a solution to deliver comprehensive monitoring, troubleshooting, and application performance management for Amazon EMR environments.

    In this webinar, we will discuss:
    Overview of Amazon EMR with common use cases;
    Application Performance Management for Amazon EMR;
    Comprehensive reporting, alerting, and recommendations for optimization
  • Azure Cloud Migration for your Modern Data Applications Recorded: Nov 29 2019 23 mins
    Chris Santiago, Solution Engineering Manager, Unravel Data
    Whether you are looking to establish a “cloud first” strategy for big data or are migrating from on-premises Cloudera, Hortonworks, and MapR, this session provides practical insights on how to make that journey simple and cost effective on Azure. Join Chris Santiago as he shares how a data driven approach can guide you in deciding which cloud technologies will best fit the needs unique to your organisation and budget.
  • Unravel for Amazon EMR via AWS Marketplace Recorded: Oct 29 2019 12 mins
    Abha Jain, Director of Products, Unravel Data
    Unravel for Amazon EMR via AWS Marketplace
  • Unravel for Azure HDInsights via Azure Marketplace Recorded: Oct 29 2019 7 mins
    Abha Jain, Director of Products, Unravel Data
    How to get started with Unravel for Azure HDInsight from the Azure Marketplace
  • APM & Operational Intelligence for Azure Databricks Recorded: Oct 3 2019 41 mins
    Abha Jain, Director of Products, Unravel Data; Ron Abellera, Microsoft Global Blackbelt Microsoft,
    According to Ovum research, over half of big data workloads will be running in the cloud by the end of this year (2019). Microsoft Azure provides a number of options for powering your modern data estate with the flexibility and scalability of the cloud. AI driven, intelligent DataOps is critical to gain visibility to modern data operations. In this webinar, we will focus on:

    Advantages of running modern data platforms in the cloud
    The importance of visibility into your cloud data infrastructure
    Demonstration of Unravel for Azure Databricks to manage DataOps on Azure

    Try Unravel risk free with a 60 day license and up to $15K Free Azure for starting a Proof of Concept. Contact: hello@unraveldata.com
  • Migrating Big Data Workloads to the Cloud with Unravel Recorded: Sep 25 2019 7 mins
    Abha Jain, Director of Products, Unravel Data
    Migrating Big Data Workloads to the Cloud with Unravel
  • Unravel for Azure Cloud Migration Recorded: Sep 10 2019 13 mins
    Abha Jain, Director of Products, Unravel Data
    As you’re migrating your Spark and Hadoop applications to Microsoft Azure, Unravel helps ensure you won’t be flying blind. With data-driven intelligence and recommendations for optimizing compute, memory, and storage resources, Unravel makes your transition a smooth one. Abha Jain, Director of Products at Unravel demonstrates how.
  • Unravel for Azure Databricks overview demo Recorded: Sep 10 2019 6 mins
    Abha Jain, Director of Products, Unravel Data
    Director of Products Abha Jain provides a demo of Unravel's support for Azure Databricks.
  • Unravel for Cloud Migration Recorded: Sep 5 2019 18 mins
    Abha Jain, Director of Products, Unravel Data
    As you’re migrating your Spark and Hadoop applications to the cloud, Unravel helps ensure you won’t be flying blind. With data-driven intelligence and recommendations for optimizing compute, memory, and storage resources, Unravel makes your transition a smooth one. Abha Jain, Director of Products at Unravel demonstrates how.
  • How to Optimize Spark Data Pipelines on Azure Databricks Recorded: Aug 14 2019 33 mins
    Aengus Rooney, Head of Solution Engineering - International, Unravel Data
    Join Unravel expert Aengus Rooney to develop an understanding of the performance dynamics of modern data pipelines and applications. In this session, you will learn about uncovering and understanding the key datasets, metrics, and best practices needed to develop mastery with Spark performance management on Azure Databricks.
  • Transforming the Business of Healthcare with Data Operations Recorded: Aug 13 2019 6 mins
    Charles Boicy, CIO, Clearsense & Kunal Argawhal
    Unravel and Clearsense chief executives discuss the potential life and death challenges of big data in healthcare.
  • Clearsense on keeping their big data promises in Healthcare Recorded: Aug 13 2019 2 mins
    Charles Boicy, CIO, Clearsense
    Clearsense CIO Charles Boicy explains why you'd be out of your mind to monitor your big data environment without Unravel.
  • DataOps Done Right: How to Optimize DataOps for the Cloud Recorded: Aug 6 2019 64 mins
    George Demarest, Senior Director of Product Marketing, Unravel; Wayne W. Eckerson; Eckerson Group
    Modern applications are powered by data that must first run through a gamut of software, systems, and technologies before being consumed by business users. DataOps represents an emerging discipline for designing, managing, and monitoring the flow of data from source to target. DataOps provides a level of rigor required to manage dozens or hundreds of data pipelines that potentially serve mission-critical applications with stringent service level agreements.

    Today, companies want to run some or all of their data pipelines in the cloud or spanning cloud and non-cloud platforms. But how does that work in theory and in practice? How does a DataOps team manage the processes, technologies, and data when pipelines cross multiple environments? What does a DataOps for the cloud look like? This webcast will define DataOps, explore best practices, and discuss how DataOps can build and manage data pipelines in the cloud.
  • Understanding DataOps and Its Impact on Application Quality Recorded: Jul 23 2019 59 mins
    George Demarest, Senior Director of Product Marketing, Unravel; Chris Riley, Editor, Sweetcode.io
    Modern day applications are data driven and data rich. The infrastructure your backends run on are a critical aspect of your environment, and require unique monitoring tools and techniques. In this webinar learn about what DataOps is, and how critical good data ops is to the integrity of your application. Intelligent APM for your data is critical to the success of modern applications. In this webinar you will learn:

    The power of APM tailored for Data Operations
    The importance of visibility into your data infrastructure
    How AIOps makes data ops actionable
AI-powered performance management for your modern data applications.
At Unravel, we see an urgent need to help every business understand and optimize the performance of their applications, while managing data operations with greater insight, intelligence, and automation.

For these businesses, Unravel is the AI-powered data operations company. We offer novel solutions that leverage AI, machine learning, and advanced analytics to help you fully operationalize the way you drive predictable performance in your modern data applications and pipelines.

Embed in website or blog

Successfully added emails: 0
Remove all
  • Title: How to Optimize and Tune your Spark Data Pipelines
  • Live at: Jun 19 2019 9:00 am
  • Presented by: Aengus Rooney, Head of Solution Engineering - International, Unravel Data
  • From:
Your email has been sent.
or close