Hi [[ session.user.profile.firstName ]]

How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency

See how to consistently deliver accurate COUNT DISTINCT queries in under a second, even on petabyte-scale datasets. This presentation will share Apache Kylin’s approach to COUNT DISTINCT queries for user behavior analysis.
Recorded Jun 24 2021 60 mins
Your place is confirmed,
we'll send you email reminders
Presented by
Kaige Liu - Sr. Solutions Architect, Kyligence
Presentation preview: How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency

Network with like-minded attendees

  • [[ session.user.profile.displayName ]]
    Add a photo
    • [[ session.user.profile.displayName ]]
    • [[ session.user.profile.jobTitle ]]
    • [[ session.user.profile.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(session.user.profile) ]]
  • [[ card.displayName ]]
    • [[ card.displayName ]]
    • [[ card.jobTitle ]]
    • [[ card.companyName ]]
    • [[ userProfileTemplateHelper.getLocation(card) ]]
  • Channel
  • Channel profile
  • Kyligence Pivot to Snowflake - A Solution for Excel Pivot Tables on Snowflake Aug 26 2021 5:00 pm UTC 16 mins
    George Demarest, Head of Marketing; Rachel Beddor, Solution Architect; Kyligence
    Kyligence has just introduced a new solution for Snowflake and Excel users called Kyligence Pivot to Snowflake. It provides support for Excel Pivot Tables against data in Snowflake Data Warehouses, seamlessly and transparently. This presentation explains the solution and provides a demonstration of the product.
  • Architecting Snowflake for High Concurrency and High Performance Aug 19 2021 5:00 pm UTC 62 mins
    Robert Hardaway, Senior Solutions Architect
    Cloud Data Warehousing juggernaut Snowflake has raced out ahead of the pack to deliver a data management platform from which a wealth of new analytics can be run. Using Snowflake as a traditional data warehouse has some obvious cost advantages over a hardware solution. But the real value of Snowflake as a data platform lies in its ability to support a high-concurrency analytics platform using Kyligence Cloud, powered by Apache Kylin.

    In this webinar, Senior Solutions Architect Robert Hardaway will describe a modern data service architecture using precomputation and distributed indexes to provide interactive analytics to hundreds or even thousands of users running against very large Snowflake datasets (TBs to PBs).
  • A High-Performance, High-Concurrency Architecture for Analytics on Azure Aug 12 2021 5:00 pm UTC 59 mins
    Mike Shen, Senior Solutions Architect
    There is good news for the thousands of Excel, Power BI, and SSAS users: a new distributed analytics platform from Kyligence - based on Apache Kylin - is breathing new life into these tools by providing a high-performance data aggregation mechanism.

    This session will explore how a Kylin-powered architecture can achieve sub-second response times for queries against terabytes or even petabytes of Azure data.

    The Kyligence platform provides:
    -An intelligent precomputation layer
    -AI-assisted data modeling and query optimization
    -Virtually limitless concurrency and scale for OLAP, SQL, and MDX analytics on Azure.
  • Addressing the Systemic Shortcomings of Cloud Analytics Aug 5 2021 5:00 pm UTC 83 mins
    Kaige Liu, Sr. Solutions Architect
    Learn how to increase the value of your analytics investment with existing open source technologies like Apache Kylin, Spark, and Mondrian.

    Talk #1: Addressing the Systemic Shortcomings of Cloud Analytics

    As we enter what some have called The Golden Age of Analytics, there are still some fundamental challenges that plague even the largest and most sophisticated cloud analytics adopters. Chief among these is the challenge of scale, often reflected in limitations of concurrency, multi-tenancy, distributed query performance, and all manner of latencies.

    Other less obvious, but equally crucial, challenges of scale and performance have to do with IT and end-user productivity. In other words, there have been few technological advances that enable the quick deployment of big data analytics and the rapid creation of business value from the data being analyzed.

    This presentation will consider a few of these systemic challenges and suggest some ways that they can be addressed with available open source technology such as Apache Kylin, Apache Spark, and Apache Mondrian.

    Talk #2: Accelerating Linux Workload Onboarding Experience on Azure

    Whether you run Linux or Windows, Azure has unlimited capacity to deliver tangible benefits with built in security, hybrid infrastructure, data analysis and intelligence to support your Linux and Open Source Software (OSS) workloads. Our partnership with companies like Kyligence is one of our key strengths.

    In this talk, we will talk about how Azure supports the OSS ecosystem, and how it empowers customers and partners to build their solutions on Azure.
  • Apache Kylin: Simplify Machine Learning on Big Data Jul 29 2021 5:00 pm UTC 39 mins
    Dong Li - Director of Product, Kyligence
    Apache Kylin is an open source analytical data warehouse that has made interactive big data analytics possible. It does so by combining data warehouse and big data technology and by providing a standard ANSI-SQL query interface and sub-second latency for petabyte-scale datasets. This solution has been widely adopted around the world.

    Kylin also enables self-service data analysis for machine learning applications. The integration with auto machine learning technology means that users don't need to be experts in big data and machine learning technology.

    Dong will introduce the architecture and demonstrate how Apache Kylin has simplified machine learning on big data and empowered each end-user to perform advanced self-service analytics.
  • Snowflake: The Good, the Bad, and the Ugly Recorded: Jul 22 2021 59 mins
    Kaige Liu - Sr. Solutions Architect, Kyligence
    Learn how to solve the top 3 challenges Snowflake customers face, and what you can do to ensure high-performance, intelligent analytics at any scale. Ideal for those currently using Snowflake and those considering it.
  • Apache Kylin on Parquet: An Introduction to Kylin’s New Storage Engine Recorded: Jul 15 2021 50 mins
    Kaige Liu - Sr. Solutions Architect, Kyligence
    Discover how Kylin's new Parquet-powered storage engine is delivering better performance than ever before to the world's leading open source query engine for big data. See what's improved and get benchmark comparisons to understand how Kylin's latest update can help your organization deliver faster insights on any size dataset.
  • Kyligence Partner Solutions Day: Accelerate Your Analysis on Microsoft Azure Recorded: Jul 8 2021 100 mins
    Saikat Basu - Sr. Solutions Architect, Kyligence | Matt Basile - Azure Data Program Manager, Microsoft
    Learn how Kyligence can be combined with Microsoft solutions like Azure Synapse, Power BI, and Excel to dramatically accelerate your analytics on any volume of data. Experts from Kyligence and Microsoft will provide a roadmap and tips for getting started optimizing your organization's analytics.
  • How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency Recorded: Jun 24 2021 60 mins
    Kaige Liu - Sr. Solutions Architect, Kyligence
    See how to consistently deliver accurate COUNT DISTINCT queries in under a second, even on petabyte-scale datasets. This presentation will share Apache Kylin’s approach to COUNT DISTINCT queries for user behavior analysis.
  • Providing Interactive Analytics on Excel with Billions of Rows Recorded: Jun 17 2021 53 mins
    Saswata Sengupta - Sr. Solutions Architect, Kyligence
    See how to get lightning-fast query performance on Microsoft Excel that scales into the petabytes. This presentation shares the top challenges Excel faces with big data and outlines strategies to keep Excel running smoothly.
  • Kyligence Cloud 4 - Feature Focus: Spark-Powered Cubing and Indexing Recorded: Jun 9 2021 38 mins
    Mike Shen, Senior Solution Architect
    You’ve moved your data to the cloud, awesome. Now you’re running into issues of concurrency, scale, and cost overruns. But there’s a better way to run your cloud analytics if you think of cloud resources as commodities to conserve and maximize. Sure, you could run the same query from start to finish every time, or you could speed up this process, and save some cash in the process, by precomputing those queries and storing the response for fast retrieval any time, by any number of analysts.

    Kyligence Cloud 4’s Spark-Powered Cubing and Indexing feature provides just that - intelligent precomputation, which fundamentally boils down to low-cost, high-performance analytics. Join us for the fourth part of this series exploring the key features of Kyligence Cloud 4.

    In this webinar you will learn:
    -About modern, cloud era OLAP and cubing theory
    -Performance gains you’ll get from intelligent precomputation
    -How to apply cloud computing and distributed processing
    -Precomputation strategies and tactics

    Register now!
  • Apache Kylin 101: How to Deliver Sub-Second Analytics on Massive Datasets Recorded: Jun 3 2021 60 mins
    Kaige Liu - Sr. Solutions Architect, Kyligence
    See how the world’s leading open source solution for query acceleration on massive datasets is revolutionizing analytics for enterprises across every industry, and how you can get started using it in your organization.
  • Smashing Through Big Data Barriers with Tableau and Snowflake Recorded: Jun 2 2021 56 mins
    Saikat Basu, Senior Solution Architect
    Your analysts are working with more data than ever before in Tableau. Chances are, as the data volumes grow, your teams are experiencing some slowdowns. While it may be tempting to blame Tableau, the most likely explanation for performance and scalability pains lies in your data service layer. What if you could transform the way you do analytics without having to retrain your Tableau users? What if you could get more critical business value out of Tableau, and your data, without disrupting the way your business operates?

    Join us for this session to learn how Tableau could be the ultimate window into ALL of your valuable data, no matter how large. Learn how precomputation technology and AI-augmented query optimization can help you break free of the downward performance spiral of legacy analytics approaches.

    In this webinar, you will learn:
    -How to get the fastest big data analytics experience on Tableau
    -How a unified semantic layer can ensure that your current Tableau users are not disrupted by big data
    -How to improve your analytics operations with automation and machine intelligence

    Plus, you’ll get to see this technology in action during the live Snowflake demo. Enter the onramp to unmatched performance with big data analytics on Tableau. Register now!
  • Kyligence Cloud 4 - Feature Focus: AI-Augmented Engine Recorded: May 26 2021 43 mins
    Robert Hardaway, Senior Solutions Architect
    If you have big data, more and more of your analytics stack needs to be intelligent. Your tools need to be able to anticipate the needs of your analysts, customers, and your business. With the AI-Augmented Engine, this learning process is automated and predictive. It intelligently adapts to user behavior and query patterns and learns to anticipate each users’ needs. Join us for the third installment of this series diving into the core features of Kyligence Cloud 4.

    In this webinar you will learn:
    -How the Kyligence Cloud 4 AI-Augmented Engine works
    -How the AI-Augmented Engine gives optimal efficiency for cube building
    -How the AI-Augmented Engine greatly simplifies data modeling

    See the Kyligence Cloud 4 AI-Augmented Engine live in action during the product demo! Register now.
  • AI-Powered Analytics: What It Is and How It’s Powering Self-Service Analytics Recorded: May 20 2021 53 mins
    Saswata Sengupta - Sr. Solutions Architect, Kyligence
    Empower your analysts with easier access to all the data they need, exactly when they need it - all while reducing workloads for IT and data engineering. This presentation explains why combining AI with self-service analytics can help.
  • Precomputation or Data Virtualization, which one is right for you? Recorded: May 12 2021 54 mins
    Li Kang - VP of North America
    In the world of cloud analytics, what role do precomputation and distributed OLAP play compared with a data virtualization approach? Which should you choose? Do they compete or complement each other? This webinar will address these questions and provide some guidance for how to choose the right approach for your circumstances.

    Both technologies are trying to address a similar challenge: make analytics easily accessible to a wider audience in a modern big data environment. Precomputation focuses on performance, response time, and concurrency in the production environment. Data Virtualization technologies focus on making analysis easily available to users by reducing or eliminating ETL and data warehouses.

    In this webinar we will cover:
    -The key differences between precomputation and data virtualization
    -How your choice between the two affects data quality, security, governance, and TCO
    -The financial impact each of these technologies have on your analytics program

    Register now!
  • How Analytics Teams Using SSAS Can Embrace Big Data and the Cloud Recorded: May 6 2021 46 mins
    Li Kang - Head of North America, Kyligence
    Unburden yourself from the limitations of SSAS, without losing the capabilities you rely on. If you’re ready to modernize your Big Data analytics, this 45-minute webinar delivers the tools and ideas you need to do so.
  • Excel, Data Discovery, and Snowflake: Unhappy Together Recorded: Apr 29 2021 62 mins
    Robert Hardaway, Senior Solutions Architect
    12 not so simple steps to enabling Excel for data discovery

    Microsoft is a recognized leader for BI tools and analytics platforms and there are hundreds of millions of Excel users. Snowflake has captured everyone’s imagination as the Cloud Data Warehouse juggernaut. So why are they not completely happy together?

    In this webinar you will learn:
    -The painful process of data discovery using Excel and Snowflake
    -How Kyligence Cloud enables Excel pivot tables against Snowflake environments
    -What MDX means to the future of cloud analytics

    We will discuss the 12 unhappy steps you must take to make Excel play nice with Snowflake in data discovery. We will also talk about how Apache Kylin and Kyligence Cloud can be used to make the misery go away, and how you can finally run Excel pivot tables directly against Snowflake data and make them so happy together!
  • Kyligence Cloud 4 - Feature Focus: Unified Semantic Layer Recorded: Apr 28 2021 45 mins
    Joanna He - Director, Product Management
    We are in the midst of an analytics explosion. Big data is shooting some to the stars, but most are losing their most valuable business insights to “cave-ins”, mostly due to an aging architecture and institutional sprawl. With different teams using different BI platforms, and the management and maintenance challenges that follow, it’s no surprise that many companies struggle to get the most out of their data.

    Better information comes from better governance.

    By establishing a Unified Semantic Layer that serves both BI teams and data engineers, you can create a common business data dialect that your entire analytics ecosystem can benefit from. It’s simpler than you might think. Kyligence Cloud 4 was released in January of this year and has been getting attention from industry experts for many features, including the Kyligence Cloud 4 Unified Semantic Layer.

    In this webinar you will learn:
    -What is the Kyligence Cloud 4 Unified Semantic Layer?
    -What problems does the Kyligence Cloud 4 Unified Semantic Layer solve?
    -What value does the Kyligence Cloud 4 Unified Semantic Layer bring to your business?

    You will also have the opportunity to see all of this in action during the live product demonstration. Register now!
  • Kyligence Cloud 4 - An Overview Recorded: Mar 24 2021 60 mins
    Li Kang - VP of North America
    In January of this year, Kyligence announced the immediate availability of Kyligence Cloud 4, the first fully cloud-native, distributed OLAP platform. During our announcement, EMA analyst John Santaferraro said:

    “As the race for unified analytics heats up, Kyligence offers a solution that overcomes the challenges of querying data in both data lakes and data warehouses located both in the cloud and on premises.”

    Join Li Kang - VP of North America at Kyligence - as he provides an overview of the Kyligence Cloud 4 release that will show:

    --The new cloud native architecture that employs Apache Kylin, Apache Spark, and Apache Parquet to ensure optimal performance.
    --How KC4 delivers sub-second query responses on very large datasets using precomputed aggregate indexes (hyper-cubes) and table indexes.
    --The AI-Augmented engine that intelligently organizes your data and reduces data modeling time from days/weeks to minutes.

    In this webinar, we will present the Kyligence Cloud 4 story - high-speed analytics with unprecedented sub-second query response times against petabyte datasets.
Expert Insights for Managing Your Organization's Most Valuable Data
Tips and technology walkthroughs you can use to supercharge big data analytics across your organization on any BI tool and any size dataset. Learn how to help your business quickly make data-driven decisions with confidence.

Embed in website or blog

Successfully added emails: 0
Remove all
  • Title: How to Guarantee Exact COUNT DISTINCT Queries with Sub-Second Latency
  • Live at: Jun 24 2021 5:00 pm
  • Presented by: Kaige Liu - Sr. Solutions Architect, Kyligence
  • From:
Your email has been sent.
or close