Hi [[ session.user.profile.firstName ]]

Techniques to Establish Your Data Lake: How to Achieve Data Quality and Security

The growing volume and variety of data makes it imperative for organizations to manage and govern their data in a way that's scalable and cost-effective. The data lake – once considered just a relatively inexpensive storage solution – can now be a tool for deriving true business value. By implementing a set of best practices for establishing and managing your data lake, you can achieve 360-degree control and visibility of your data.

In this webcast, Ben Sharma, Zaloni's co-founder and CEO discusses techniques to balancing the flexibility a data lake can provide with the requirements for privacy and security that are critical for enterprise data.

Topics covered include:

- How to establish a managed data ingestion process - that includes metadata management - in order to create a solid foundation for your data lake

- Techniques for establishing data lineage and provenance

- Tips for achieving data quality

- Key considerations for data privacy and security

- Unique stages along the data lake lifecycle and management concepts for each stage

- Why a data catalog is important

- Considerations for self-service data preparation

About the speaker:

Ben Sharma is CEO and co-founder of Zaloni. He is a passionate technologist and thought leader in big data, analytics and enterprise infrastructure solutions. Having previously worked in technology leadership at NetApp, Fujitsu and others, Ben's expertise ranges from business development to production deployment in a wide array of technologies including Hadoop, HBase, databases, virtualization and storage. Ben is co-author of Architecting Data Lakes and Java in Telecommunications. He holds two patents.
Recorded Feb 16 2017 63 mins
Your place is confirmed,
we'll send you email reminders
Presented by
Ben Sharma, CEO and Co-Founder, Zaloni
Presentation preview: Techniques to Establish Your Data Lake: How to Achieve Data Quality and Security

Network with like-minded attendees

  • [[ session.user.profile.displayName ]]
    Add a photo
    • [[ session.user.profile.displayName ]]
    • [[ session.user.profile.jobTitle ]]
    • [[ session.user.profile.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(session.user.profile) ]]
  • [[ card.displayName ]]
    • [[ card.displayName ]]
    • [[ card.jobTitle ]]
    • [[ card.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(card) ]]
  • Channel
  • Channel profile
  • Customer Centric DataOps for Trusted Golden Records at Bremer Bank Apr 22 2020 5:00 pm UTC 60 mins
    Leilani Moll, VP Data & Analytics Services at Bremer Bank | Susan Cook, CEO at Zaloni
    A common obstacle for a successful customer 360 initiative is attributed to data sprawl and siloed data, which compromises data quality. Bremer Bank has addressed this problem by transforming their organization and data operations to be more customer centric. In this webinar, you will learn how Bremer Bank unified data across multiple business units and third party sources to build golden records in a governed and secure way. By first building a “nucleus” of customer data, Bremer Bank was able to both align with their data ethics mission and meet regulatory requirements in a cost-effective way.

    During the webinar Zaloni’s CEO, Susan Cook, and Bremer Bank’s VP of Analytics and Data Services, Leilani Moll will discuss common obstacles faced when pursuing customer 360 initiatives, building golden records from disparate sources, technology and architectural considerations, and finding success using a DataOps approach.

    Attend this webinar to learn how to:
    Create golden customer records from disparate sources
    Increase data operations efficiency with machine learning
    Find success using an ethical, customer centric approach
  • Seizing Your Data’s Unrealized Power: Talk with Zaloni’s CEO Recorded: Feb 26 2020 38 mins
    Susan Cook, CEO at Zaloni & Ben Sharma, CPO at Zaloni
    End-to-End, governed data operations is key to 2020 data success, speak with Susan Cook and Ben Sharma on why

    The multiplying of offerings in the data ecosystem, the move to cloud, and the increase in data sources available to enterprises, has led to new series of challenges for CIOs and CDOs - namely, how to accelerate the time to analytics value while maintaining the control needed to ensure data compliance, security, and quality.

    To succeed against the growing potential of data to increase performance, companies need true insight across the entire data journey, a view too often obscured by an uncoordinated array of data-related vendors.

    In this first of a new series, Susan Cook and Ben Sharma sit down to discuss the role data operations plays as the “air traffic control” of the data supply chain.

    Join them as they talk about:
    - The value of having control across the “day in the life of your data”
    - How to achieve greater control while broadening permissioned data access
    - Why Zaloni’s emphasis on extensibility + governance is key to analytics success
  • From Source to Value: Why Automation is Driving Data Catalog Success Recorded: Jan 30 2020 47 mins
    Matthew Monahan, Senior Product Manager
    Traditional data catalog solutions often require a conglomeration of separate tools (multiple catalogs, ETL, data governance, etc.) which are managed in silos by separate teams. When a data analyst needs to derive business value from this data, it requires communication across teams, integrations between products, and a high level of coordination to get them the data they need.

    A single platform, on the other hand, provides a single source of truth for analysts to quickly gain access to the data they need in a self-service manner. From source to provisioning, the automated data catalog keeps gears aligned and the train on the track. This reduces the burden on the IT staff, while ensuring the right level of governance over the whole process.

    An automated data catalog provides the workflow to take your data from source to value without manual intervention. Allowing a small team to accomplish the same tasks as one much larger. The catalog can automatically bring the data in from the systems of record, execute data quality rules, profile the data, prepare it for consumption, and provision it to the locations where the analysts can use it.

    During this webinar, Matthew Monahan, Senior Product Manager at Zaloni, will explore:
    - The benefits of a single application over connecting various point-solutions
    - How automation from source to destination reduces your workload
    - Real-world examples that you can leverage
  • More data, less work. It’s not a dream, it’s a complete data catalog Recorded: Dec 4 2019 58 mins
    Matthew Monahan, Senior Product Manager
    Time to insights. Time to deployment. Time to value. It seems that time is an important factor for many big data projects. The faster they can be completed, the faster the investment starts to pay off. However, today’s organizations face challenges with complex data sprawl, lack of control and security, and data quality issues. These challenges result in unreliable and stale data by the time it reaches a data analyst or data scientist.

    You need a solution that allows you to centrally manage and govern distributed data, ensure data quality and security, automate processes, and provide self-service access to quickly and effectively deliver trusted data to your end-users.

    In this first of 3 series, Matthew Monahan, Zaloni’s Senior Product Manager, will take you through cutting-edge data management techniques and show you how to leverage a single platform to manage the complete end-to-end data pipeline. Curious how Zaloni can save you time?

    Matthew will be discussing:
    - Connecting and cataloging distributed data
    - Harnessing the power of machine learning and AI to build an augmented data catalog
    - Leveraging autonomous data management for improved data quality
    - Applying right-sized data governance for security and control
    - Providing self-service provisioning for near-real-time access to data

    This webinar will provide you with the information you need to guide your users on their journey to an amazing self-service data experience.
  • Change your Company with a Data Platform Recorded: Oct 25 2019 4 mins
    Mike Brady, Industry Thought Leader
    Listen as Mike Brady discusses what's needed for a company to modernize its data environments with a self-service data platform. Learn how a move to the cloud and a data governance focus fosters innovation at companies across the world.
  • How to Build Customer Golden Records that Increase Customer Lifetime Value Recorded: Oct 10 2019 45 mins
    Tim Blackwell, Analytics Data Architect at Chalhoub, and Scott Gidley, VP Product at Zaloni
    How does your company build and maintain customer relationships? Today, companies own numerous data sources and store massive volumes of customer data. Extensive data collection poses the risks of data duplication, poor data quality, and a lack of transparent data access. Without accurate and reliable customer data, companies may face higher spending along with acquisition and reacquisition costs.

    Obstacles like these were apparent across Chalhoub's luxury brand and retail enterprise, a corporation managing over 650 stores throughout the gulf region. To maximize their customer data across multiple e-commerce and CRM systems, Chalhoub architected and deployed a cloud-based, centralized data hub on Microsoft Azure that provided data mastering capabilities to create customer golden records and enabled a 360-degree-view of the customer across all company brands. In this webinar, learn how Chalhoub was able to continue its "customer first" approach and improve their brand performance with Zaloni as a partner in the project.

    Join Tim Blackwell, Analytics Data Architect at Chalhoub, and Scott Gidley, VP Product at Zaloni, as they discuss how a holistic view of customer data allows for greater insights and improved targeted marketing to ultimately increase the lifetime value of their customers.

    Topics Include:
    - Utilizing the Customer 360 approach as a foundational data lake use case
    - Building customer golden records with a zone-based architecture
    - Value achieved through data acceleration
  • The Data Imperative Recorded: Sep 3 2019 7 mins
    Ben Sharma, CEO at Zaloni
    Watch as Ben Sharma, CEO at Zaloni, discusses the "data imperative." Which is to ensure that we guide the current digital transformation in ways that are not just smart, profitable and open, but also wise.

    Big data brings the promise of a new, enlightened era of cross-pollination and new ways of seeing information. It’s our responsibility to make data more available to achieve new possibilities while keeping it secure and private.
  • Governing your Cloud-Based Enterprise Data Lake Recorded: Aug 23 2019 40 mins
    Selwyn Collaco, CDO at TMX Group & Ben Sharma, CEO at Zaloni
    Watch as Selwyn Collaco, CDO at TMX Group, and Ben Sharma, CEO at Zaloni, discuss what it takes to properly architect a data lake to enable centralized data services and more.
  • Achieving Big Business Value from Big Data Initiatives Recorded: Aug 13 2019 34 mins
    Rick Karl, Vice President of Value Engineering, Zaloni
    ROI.
    Risk reduction.
    Cost savings.

    Ask 3 different people what their idea of business value means and you’ll often get three different answers. When you’re trying to implement an enterprise-wide data hub, it’s often necessary to show value early and often but how can that be easily done when there are multiple stakeholders?

    To justify the investment, it’s critical to show how your new data hub will meet the goals of IT, Finance, and any other departments involved in the initiative. By accurately measuring value, your project will be poised to cut costs, generate revenue, and radically transform the business.

    Join Rick Karl, Zaloni’s VP of Value Engineering, as he shows what’s needed when trying to highlight the value an enterprise data hub can provide to an organization.

    Topics include:
    - Challenges in demonstrating business value for complex data initiatives
    - How to perform a business value assessment for modern data efforts
    - Making the business case that aligns IT and Finance objectives
  • What's your Big Data Vision? Recorded: Jun 27 2019 15 mins
    Matthew Monahan, Senior Product Manager
    With the amount of data being created and collected by organizations, it’s imperative that senior executives at these data-driven companies have a solid vision of where they want to go, and how they’ll get there.

    To achieve this vision, the way we find, understand, and use data needs to shift.

    Join Matthew Monahan, Zaloni’s Senior Product Manager, and Eric Kavanagh, CEO of the Bloor Group, as they discuss big data vision in this excerpt from the DM Radio podcast. They’ll also address:
    - Data governance concerns
    - Self-service data
  • Evolving your Passive Data Catalog into an Active Data Hub Recorded: May 8 2019 31 mins
    Scott Gidley, Vice President of Product Management
    We all know that data is the lifeblood of a modern enterprise. But how can your business users action relevant, quality data into their applications for immediate value?

    The answer used to be “Build a data catalog”. Data catalogs have grown in popularity as an essential tool for understanding where your data exists and what it is. But, that’s only the first step – and an easy step at that! The harder part is giving your business users self-service access to understand THEIR catalog and enrich the data THEY need when THEY need it … and then allowing them to action it into their analytical or operational applications for rapid insights.

    Empowering the business: that’s where an “active data hub” differentiates from a “passive data catalog”.

    Join Scott Gidley, VP of Product Management from Zaloni, to learn how real world users are moving away from traditional data catalogs to embrace active data hubs, including:
    - How a unified data supply chain removes unwanted tooling and rapidly delivers real business value to departmental or line of business users
    - How business users can enrich their data themselves and action it into their business applications - with the right amount of governance and trust
    - Test yourself against the data hub maturity curve to determine where you are within your enterprise
  • A Governed Self-Service Data Platform Accelerates Insights Recorded: Mar 27 2019 58 mins
    Ryan Peterson, Global Technology Segment Lead at AWS & Scott Gidley, Vice President of Product at Zaloni
    Today's enterprises need a faster way to get to business insights. That means broader access to high-value analytics data to support a wide array of use cases. Moving data repositories to the cloud is a natural step. Companies need to create a modern, scalable infrastructure for that data. At the same time, controls must be in place to safeguard data privacy and comply with regulatory requirements.

    In this webinar, Zaloni will share its experience and best practices for creating flexible, responsive, and cost-effective data lakes for advanced analytics that leverage Amazon Web Services (AWS). Zaloni’s reference solution architecture for a data lake on AWS is governed, scalable, and incorporates the self-service Zaloni Data Platform (ZDP).

    Join our webinar to learn how to:
    - Create a flexible and responsive data platform at minimal operational cost.
    - Use a self-service data catalog to identify enterprise-wide actionable insights.
    - Empower your users to immediately discover and provision the data they need.
  • A Modern Digital Data Architecture: Best Practices for Adoption Recorded: Mar 13 2019 61 mins
    Alex Gurevich, DXC Technology’s Analytics CTO for the Americas & Clark Bradley, Solutions Engineer at Zaloni
    Organizations that put analytics and artificial intelligence (AI) at the core of their transformation strategy will survive and thrive in the age of digital disruption. To achieve this, a holistic, modern data architecture and a rock-solid information supply chain are critical for success.

    Organizations can deliver timely, self-service, democratized data access and analytical insights at enterprise scale by leveraging the innovation design principles of data lakes, scalable and elastic cloud infrastructures, and automated information pipelines. However, many find that these architectures are complex to create, deploy and operate — often resulting in poor performance, unnecessary expense and underutilized assets for the do-it-yourselfers. Transitioning to such architectures from legacy paradigms carries additional difficulty and risk, especially in hybrid environments that can span multiple design patterns and cloud providers.

    In this webinar, Clark Bradley, Zaloni solutions engineer, and Alex Gurevich, DXC Technology’s Analytics chief technology officer for the Americas, will present solution designs and representative field-use cases for simplifying and accelerating adoption of a modern, digital data architecture.

    Topics to be discussed will include:
    - Best practices for migrating from a legacy to a modern data architecture
    - Deploying a data catalog in support of data lake architectures
    - Data lake architectures for hybrid and cloud environments
    - Protecting data assets and privacy without obstructing access
  • So You’ve Got a Data Catalog...Now What? Recorded: Feb 27 2019 40 mins
    Scott Gidley, Vice President of Product
    Achieving actionable insights from data is the goal of any organization. To help in this regard, data catalogs are being deployed to build an inventory of data assets that provides both business and IT users a way to discover, organize and describe enterprise data assets. This is a good first step that helps all types of users easily find relevant data to extract insights from.

    Increasingly, end users want to take the next step in provisioning or procuring this data into a sandbox or analytics environment for further use. Attend this session to see how organizations are looking to build actionable data catalogs via a data marketplace, that allow self-service access to data without sacrificing data governance and security policies.

    Learn how to provide governed access and visibility to the data lake while still staying on track and within budget. Join Scott Gidley, Zaloni’s Vice President of Product, as he discusses:
    - Architecting your data lake to support next-gen data catalogs
    - Rightsizing governance for self-service data
    - Where a data catalog falls short and how to address
    - Success use cases
  • Empower your Enterprise with a Self-Service Data Marketplace Recorded: Feb 13 2019 38 mins
    Aashish Majethia, Senior Solutions Engineer
    Analysts need timely access to enterprise data in order to stay competitive in today’s rapidly changing environment. Typically, business users need to request access through the IT department, which can be a waiting game, either because of technological roadblocks, governance restrictions or both. This adds more work, more process, and more frustration on both sides. Having the ability to find data sets, examine, update, and provision the data themselves allows business users to move quickly and frees IT to work on higher priority items.

    A modern data platform should provide a self-service data marketplace that gives right-sized governed access to data. The security permissions allow IT to define who needs access to the correct data at the appropriate stage of the data pipeline. This becomes quite complicated in regulated environments. Users should be able to search for data they have access to, explore and potentially update the metadata associated, and provision it into a sandbox when ready.

    Join us as Aashish Majethia, a Senior Solutions Engineer, dives into the self-service data marketplace and what is required to make it successful. He will cover topics including:
    - Catalogs
    - Self-service data preparation
    - Governance considerations and how they can enable a more agile data-driven enterprise
  • The Top 3 Considerations for Modernizing Your Data Platform Recorded: Jan 16 2019 50 mins
    Clark Bradley, Solutions Engineer at Zaloni
    How does your organization collaborate with data? Aligning data management tasks across any size organization can be a challenge. This can be attributed to a lack of transparent data access, lack of big data skills, or antiquated toolsets that do not enable shared metadata for clear lineage of the data. Regardless of the reason, the results are slow, rigid decision-making processes.

    While modernizing your data architecture for more agility can seem overwhelming, with an integrated platform that enhances collaboration, organizations can reap the benefits of quality data that is well understood. The data platform should provide users with the ability to fully understand all aspects of the data with a simple, unified user interface where the business and IT can define, transform and provision the data. All while providing right-sized governance for access, security and auditability.

    Join Clark Bradley, Solutions Engineer with Zaloni, as he tackles modernizing your data platform and explains how your organization can expand collaborative practices with the Zaloni Data Platform.

    By the end of the presentation, you’ll be able to answer these questions:
    - Why is a data catalog important?
    - What do I need to know about data quality?
    - How does self-service play a role in the data strategy?
  • How to Achieve a 360° View of your Data Recorded: Jan 9 2019 58 mins
    Jatin Hansoty, Director of Solutions Architecture
    A majority of the data collected by organizations today is wasted. Whether through poor analytics, lack of resources, or because they have too much of it. So how can organizations turn this around and actually start utilizing their data for powerful results?

    More and more companies are taking their customer, product, patient, or other data and providing a 360-degree view using a governed and actionable data lake. By breaking down the silos associated with traditional data located in disjointed systems and databases, companies are finding new ways to improve loyalty programs, product development, marketing campaigns, and even find a new source of revenue from their data.

    Join Jatin Hansoty, Director of Solutions Architecture at Zaloni, as he dives into real-world use cases from several of the world’s top companies. Learn from their architecture and the results they achieved.

    Topics covered include:

    - Best practices
    - Common pitfalls to avoid
    - Real-world use cases
    - Future-proof architecture
  • How to Achieve a 360° View of your Data Recorded: Nov 28 2018 58 mins
    Jatin Hansoty, Director of Solutions Architecture
    A majority of the data collected by organizations today is wasted. Whether through poor analytics, lack of resources, or because they have too much of it. So how can organizations turn this around and actually start utilizing their data for powerful results?

    More and more companies are taking their customer, product, patient, or other data and providing a 360-degree view using a governed and actionable data lake. By breaking down the silos associated with traditional data located in disjointed systems and databases, companies are finding new ways to improve loyalty programs, product development, marketing campaigns, and even find a new source of revenue from their data.

    Join Jatin Hansoty, Director of Solutions Architecture at Zaloni, as he dives into real-world use cases from several of the world’s top companies. Learn from their architecture and the results they achieved.

    Topics covered include:

    - Best practices
    - Common pitfalls to avoid
    - Real-world use cases
    - Future-proof architecture
  • A Governed Self-Service Data Platform Accelerates Insights Recorded: Oct 30 2018 59 mins
    Ryan Peterson, Global Technology Segment Lead at AWS & Scott Gidley, Vice President of Product at Zaloni
    Today's enterprises need a faster way to get to business insights. That means broader access to high-value analytics data to support a wide array of use cases. Moving data repositories to the cloud is a natural step. Companies need to create a modern, scalable infrastructure for that data. At the same time, controls must be in place to safeguard data privacy and comply with regulatory requirements.

    In this webinar, Zaloni will share its experience and best practices for creating flexible, responsive, and cost-effective data lakes for advanced analytics that leverage Amazon Web Services (AWS). Zaloni’s reference solution architecture for a data lake on AWS is governed, scalable, and incorporates the self-service Zaloni Data Platform (ZDP).

    Join our webinar to learn how to:
    - Create a flexible and responsive data platform at minimal operational cost.
    - Use a self-service data catalog to identify enterprise-wide actionable insights.
    - Empower your users to immediately discover and provision the data they need.
  • So You’ve Got a Data Catalog...Now What? Recorded: Oct 3 2018 41 mins
    Scott Gidley, Vice President of Product
    Achieving actionable insights from data is the goal of any organization. To help in this regard, data catalogs are being deployed to build an inventory of data assets that provides both business and IT users a way to discover, organize and describe enterprise data assets. This is a good first step that helps all types of users easily find relevant data to extract insights from.

    Increasingly, end users want to take the next step in provisioning or procuring this data into a sandbox or analytics environment for further use. Attend this session to see how organizations are looking to build actionable data catalogs via a data marketplace, that allow self-service access to data without sacrificing data governance and security policies.

    Learn how to provide governed access and visibility to the data lake while still staying on track and within budget. Join Scott Gidley, Zaloni’s Vice President of Product, as he discusses:
    - Architecting your data lake to support next-gen data catalogs
    - Rightsizing governance for self-service data
    - Where a data catalog falls short and how to address
    - Success use cases
A data acceleration company
At Zaloni, we believe in the unrealized power of data. Our software platform, Arena, improves DataOps with an augmented catalog and controlled, self-service consumption. We work with the world's leading companies, delivering trusted data agility and cost savings while accelerating the time to analytics value. To find out more visit www.zaloni.com.

Embed in website or blog

Successfully added emails: 0
Remove all
  • Title: Techniques to Establish Your Data Lake: How to Achieve Data Quality and Security
  • Live at: Feb 16 2017 4:00 pm
  • Presented by: Ben Sharma, CEO and Co-Founder, Zaloni
  • From:
Your email has been sent.
or close