Hi [[ session.user.profile.firstName ]]

HPCC Systems Open Source Big Data Platform

  • Date
  • Rating
  • Views
  • The Download: Tech Talks by the HPCC Systems Community, Episode 24
    The Download: Tech Talks by the HPCC Systems Community, Episode 24
    HPCC Systems Recorded: May 23 2019 82 mins
    Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.

    Featured speakers include:
    Itauma Itauma,PhD, Keiser University, HPCC Systems Community Innovator - Cervical Cancer Risk Factors: Exploratory Analysis using HPCC Systems
    Cervical cancer is a leading cause of cancer-related death among women with about half a million new cases worldwide in 2018. 90% of cervical cancer deaths occur in low resource settings. This mortality could be reduced through effective prevention, screening and treatment programs. I will explain how an exploratory analysis of a cervical cancer database was performed using HPCC Systems with data visualizations and how the findings could be beneficial.

    Itauma Itauma has a PhD in Instructional Design and Technology from Keiser University and is a student in the Harvard Business Analytics Program.

    Lili Xu, Software Engineer III, LexisNexis Risk Solutions - Automatically cluster your data with the HPCC Systems massively scalable K-Means machine learning bundle
    Imagine you are sitting in front of thousands of articles and trying to organize them into different folders. How would you accomplish it and how long would you expect to finish it? If you have some sort of data but have no clue how to efficiently cluster them, then Lili's talk will provide insight on a great place to start.

    Lili Xu is in the final stages of completing her PhD in Computer Science at Clemson University. Now an employee, Lili has completed three internships with HPCC Systems on machine learning.

    Richard Taylor, Chief Trainer, HPCC Systems, LexisNexis® Risk Solutions - ECL Tips and Tricks: DICTIONARY does it!
    Richard Taylor has worked with the HPCC Systems technology platform and the ECL programming language for over 15 years.
  • The Download: Tech Talks by the HPCC Systems Community, Episode 23
    The Download: Tech Talks by the HPCC Systems Community, Episode 23
    HPCC Systems Recorded: Apr 25 2019 75 mins
    Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.

    Featured speakers include:
    Jeremy Meier and David Noh, both Undergraduate Students at Clemson University - An Investigation into Time Series Analysis
    Over the past several months, our team has worked closely with a dataset having roughly 16,000 total observations, recording both the date and balance in financial data. Focusing on individual accounts with a size of around 400 observations, our first goal was to compare statistical metrics and techniques used commonly in time series analysis on the given data sets. We dove deep into two major industry standard methods for understanding and predicting on a dataset. Using insights learned from these observations, we hope to better predict future balances in the dataset, as well as find any anomalies or misbehavior in the data in order to provide business value.

    Roger Dev, Sr Architect, LexisNexis Risk Solutions - TextVectors - Machine Learning for Textual Data
    Text Vectorization allows for the mathematical treatment of textual information. Words, phrases, sentences, and paragraphs can be organized as points in high-dimensional space such that closeness in space implies closeness of meaning. HPCC Systems' new TextVectors module supports vectorization for words, phrases, or sentences in a parallelized, high-performance, and user-friendly package.

    Allan Wrobel, Consulting Software Engineer, LexisNexis Risk Solutions - ECL Tips and Tricks: Leveraging the power of HPCC Systems. Using AGGREGATE.
    The ECL built-in function AGGREGATE has been seen by many in the community as ‘complex’ and as such has been underused. However in using AGGREGATE you can be sure you’re playing to the strengths of HPCC Systems.
  • HPCC Systems Community Focus: 5 Questions with Jo Prichard
    HPCC Systems Community Focus: 5 Questions with Jo Prichard
    Jo Prichard, Flavio Villanustre Recorded: Apr 17 2019 24 mins
    In this session, we are highlighting some of the rock stars of the HPCC Systems Community. Today's session is 5 Questions with Jo Prichard.
  • The Download: Tech Talks by the HPCC Systems Community, Episode 22
    The Download: Tech Talks by the HPCC Systems Community, Episode 22
    HPCC Systems Recorded: Mar 21 2019 93 mins
    Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.

    Featured speakers include:

    Vincent Freeh, Professor NC State University, HPCC Systems as a Service (Haas)
    There are numerous reasons to use an IaaS for HPCC Systems instead of dedicated hardware, especially if the workload does not execute 24/7. We developed a CloudFormation Template and an AMI for HPCC Systems and a reference architecture for HPCC Systems in AWS. Significant effort was expended to determine the best set of resources for HPCC Systems clusters. Furthermore, we created a program to create and manage HPCC Systems clusters in AWS from the command line. This talk will present the tools we created and also explain the reference architecture and many of the configuration options.

    David de Hilster, Consulting Software Engineer, LexisNexis Risk Solutions, New ECL IDE Features in 7.0
    The ECL IDE is an integrated development environment for ECL programmers to create, edit, and execute ECL code within the HPCC Systems platform. The latest 7.0 version includes new features and enhancements such as a more comprehensive autocomplete, tooltips and F12 capabilities. In this talk, David will discuss how users can leverage these features and more.

    Bob Foreman, Senior Software Engineer, HPCC Systems, LexisNexis Risk Solutions - ECL Tip: A Tiny Trove of TABLE Tidbits
    This month’s ECL Tip of the Month will focus on the ECL TABLE Function. Common (and some not so common) use cases will be discussed. Code example demonstrated will also be available for download.
  • HPCC Systems Community Focus: 5 Questions with Anupam Sengupta
    HPCC Systems Community Focus: 5 Questions with Anupam Sengupta
    Anupam Sengupta, Flavio Villanustre Recorded: Mar 15 2019 20 mins
    In this session, we are highlighting some of the rock stars of the HPCC Systems Community. Today's session is 5 Questions with Anupam Sengupta.

    Anupam is a co-founder and the CTO of GuardHat.
  • The Download: Tech Talks by the HPCC Systems Community, Episode 21
    The Download: Tech Talks by the HPCC Systems Community, Episode 21
    HPCC Systems Recorded: Feb 21 2019 84 mins
    Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.

    Featured speakers include:

    Adwait Joshi, CEO DataSeers - HPCC Systems - An IoT use case for Payments
    Traditionally we all have used Thor for data processing and ROXIE indexes for data pulls. Think about using ROXIE for a data ingest and Thor directly pulling data into the back end repository. This talk will explain about how DataSeers has designed a realtime transaction monitoring system using HPCC Systems, Kafka, ElasticSearch and MySQL pushing the envelope for a typical use case. Learn the roadblocks we encountered, how we worked around them, and how we hardened the system to be truly disaster resistant with all open source technologies.

    Yanrui Ma, Software Architect, LexisNexis Risk Solutions - Dynamic ESDL Has Become More Dynamic In 7.0
    In this talk, Yanrui will talk about some of the major changes with Dynamic ESDL in 7.0, with a focus on the mechanisms and enhancements that have made it even more dynamic. He’ll give a demo of creating a DESDL service with the improved “esdl” command line to show you how easy and quick it can be. He’ll also go over DESDL related ECL Watch changes in 7.0, and some of the upcoming DESDL features.

    Bob Foreman, Senior Software Engineer, HPCC Systems, LexisNexis Risk Solutions - ECL Tip: All About the ECL SET
    This month’s ECL Tip spotlights the ECL SET definition, value type, and other supported functions that use it. Several code examples and best practices will be demonstrated.
  • HPCC Systems Community Focus: 5 Questions with Richard Chapman
    HPCC Systems Community Focus: 5 Questions with Richard Chapman
    Richard Chapman, Flavio Villanustre Recorded: Feb 14 2019 17 mins
    In this session, we are highlighting some of the rock stars of the HPCC Systems Community. Today's session is 5 Questions with Richard Chapman.

    Richard has been with LexisNexis Risk Solutions for more than 25 years. He is the VP of Research and Development and the leader of the HPCC Systems development team. Richard wrote the code to create the HPCC Systems query cluster, also known as ROXIE which stands for Richard’s Online XML Inquiry Engine. He was one of the original designers of ECL which was created as a data centric programming language for easily expressing problems involving large quantities of data.
  • The Download: Tech Talks by the HPCC Systems Community, Episode 20
    The Download: Tech Talks by the HPCC Systems Community, Episode 20
    HPCC Systems Recorded: Jan 24 2019 80 mins
    Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation, and further build and link the relationships within our HPCC Systems community.
    Featured speakers and topics include:

    •Rob Mansfield, Senior Data Scientist, Proagrica - Dapper - A bundle to make your ECL neater
    Have you ever written a long project for a simple column rename and thought, this should be easier? What about nicely named output statements? Yeah they bother me too. Oh, and DEDUP(SORT(DISTINCT()))? There is a better way! Learn how dapper can help!

    •Bob Foreman, Senior Software Engineer, HPCC Systems, LexisNexis Risk Solutions - ECL Tip: The Seven Faces (Forms) of Dr. LOOP (Function)
    The LOOP function has always been a powerful, yet tough ECL function to understand and use. Bob will review and examine the upcoming major changes to this documentation and showcase new examples.

    •Lorraine Chapman, Consulting Business Analyst, LexisNexis Risk Solutions - Update on Academic Collaboration
    Lorraine will share an update on recent collaboration, upcoming academic events and the 2019 HPCC Systems Internship Program.
  • HPCC Systems Community Focus: 5 Questions with Lili Xu
    HPCC Systems Community Focus: 5 Questions with Lili Xu
    Lili Xu, Flavio Villanustre Recorded: Jan 24 2019 15 mins
    In this session, we are highlighting some of the rock stars of the HPCC Systems Community. Today's session is 5 Questions with Lil Xu.

    Lili is in the final stages of completing her PhD in Computer Science. She has worked in the DICE lab directed by Dr. Apon in the school of computing at Clemson University.

    Lili has completed three internships with the HPCC Systems team, working on machine learning applications. Her research area is machine learning, natural language processing and high performance computing. We are pleased that Lili has joined the team as a LexisNexis employee.
  • HPCC Systems Community Focus: 5 Questions with Amy Apon
    HPCC Systems Community Focus: 5 Questions with Amy Apon
    Flavio Villanustre and Amy Apon Recorded: Dec 18 2018 12 mins
    In this session, we are highlighting some of the rock stars of the HPCC Systems Community. Today's session is 5 Questions with Amy Apon, Ph.D.

    Dr. Apon maintains an active research program at Clemson. Areas of research interest include cloud computing, performance modeling and analysis of parallel and distributed system, data-intensive computing, emerging parallel architectures, and impact of high performance computing to research competitiveness. Her research is currently supported by the National Science Foundation, the Department of Education, BMW, HPCC Systems, LexisNexis, Elsevier Scopus, RELX Group, and Amazon.

Embed in website or blog