Hi [[ session.user.profile.firstName ]]
Sort by:
    • The Download: Tech Talks by the HPCC Systems Community, Episode 16
      The Download: Tech Talks by the HPCC Systems Community, Episode 16 HPCC Systems Recorded: Aug 2 2018 3:00 pm UTC 106 mins
    • This episode will feature our 2018 HPCC Systems summer interns:

      Shah Muhammad Hamdi, PhD student, CS at Georgia State University - Dimensionality Reduction and Feature Selection in ECL-ML

      Hamdi will discuss the parallel implementation of Principal Component Analysis (PCA) using the Parallel Block Basic Linear Algebra Subsystem (PBblas) library and ECL implementations of feature selection algorithms for the HPCC Systems platform.

      Robert Kennedy, PhD student in Computer Science at Florida Atlantic University - Parallel Distributed Deep Learning on HPCC Systems

      Robert will cover what he implemented during his summer internship. Combining HPCC Systems and Google’s TensorFlow, Robert created a parallel stochastic gradient descent algorithm to provide a basis for future deep neural network research and to enhance HPCC System’s distributed neural network training capabilities.

      Aramis Tanelus, programmer and senior at American Heritage High School where he is the lead programmer for the Advanced Robotics Team - Developing HPCC Systems Data Ingestion APIs for Common Robotic Sensors.

      Aramis’s project will make it easy for anyone in robotics around the world to ingest data from common robotic sensors into an HPCC Systems platform for use in data analysis. Aramis will be speaking about his work on the autonomous agricultural robot and implementing new packages for the Robotics Operating System to interface with HPCC Systems for big data analysis.

      Saminda Wijeratne, Masters student, Computational Science and Engineering at Georgia Institute of Technology, Atlanta - MPI Proof of Concept

      The built-in "Message Passing" library in HPCC Systems is designed to handle these communications among dissimilar components and perform non-trivial communication patterns among them. Saminda will explore how this library currently operates and how we can introduce a different implementation such as an existing popular library called MPI.

      Read more >
    • The Download: Tech Talks by the HPCC Systems Community, Episode 17
      The Download: Tech Talks by the HPCC Systems Community, Episode 17 HPCC Systems Upcoming: Sep 13 2018 2:00 pm UTC 135 mins
    • Speakers and topics for this episode include:

      Lili Xu, Clemson University & Gus Reyna, LexisNexis - Using HPCC Systems ML to Map Thousands of Violation Descriptions to SVCs
      Insurance companies face a challenge to underwrite & rate their policies using different descriptions across states for similar traffic violations. LexisNexis Risk Solutions provides Standard Violation Codes (SVCs) that give one consistent meaning for insurers. We explain how HPCC Systems ML addressed the problem of mapping thousands of disparate violation descriptions to a corresponding SVC & the future for this approach.

      Farah Al Shanik, Clemson University - Equivalence Terms for Text Search Bundle
      Text Search Bundle (TSB) is an open source project for searching on XML text documents & contains many subtasks, one being equivalence terms. We can consider equivalence terms as strong synonyms for TSB. Several term equivalences: initialism, abbreviation, synonyms & similarity based on context. We used HPCC Systems to develop a Text search tool via Moby thesaurus to return a set of synonyms, word2vec algorithm to return similar words, then built a dataset for state names & its abbreviation to return the set of related documents while improving the initialism for TSB to find strings with or without the punctuation.

      Soukaina Filali, Georgia State University - Fraud Detection on Transactional Data using a Time Series Mining Approach
      The project consists of detecting fraudulent pre-paid cards from non-fraudulent ones using mined patterns on their respective historical bank transactions data. There are numerous types of card programs, each of which comes with different fraud risk levels. Every fraud category has representative patterns that a human manually monitors on a daily basis. The goal here is to combine the domain expert engineered features with time series shapelets mining techniques to provide an automated fraud detection solution, which can potentially help in early fraud detection.

      Read more >
    • The Download: Tech Talks by the HPCC Systems Community, Episode 15
      The Download: Tech Talks by the HPCC Systems Community, Episode 15 HPCC Systems Recorded: Jun 28 2018 3:00 pm UTC 64 mins
    • Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation and further build and link the relationships within our HPCC Systems community. This episode will feature three speakers on the following topics:

      Jingqing Zhang, Imperial College of London
      Deep Sequence Learning and Text Classification

      Bob Foreman, LexisNexis Risk Solutions
      ECL Summer Code Camp Review
      On May 16th, five HPCC Systems Ambassadors along with Flavio Villanustre met with eight iRISE2 members for a two-hour ECL Code Camp. The event was a great success, and I thought I’d share with the community what we did and some of the ECL ideas that came out of it. Tips from Data Ingestion to ECL to Data Evaluation will be included in this segment.

      Read more >
    • The Download: Tech Talks by the HPCC Systems Community, Episode 14
      The Download: Tech Talks by the HPCC Systems Community, Episode 14 HPCC Systems Recorded: May 17 2018 3:00 pm UTC 88 mins
    • Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation and further build and link the relationships within our HPCC Systems community. This episode will feature three speakers on the following topics:

      Tai Donovan, Robotics Director, American Heritage School - High School Autonomous Agricultural Project
      A group of 5-6 students are working on an autonomous agricultural project with the goal of providing time sensitive data to the owner-operator/farmer/grower of a production farm. Tai will discuss their challenges and how he is using HPCC Systems.

      Lorraine Chapman, Consulting Business Analyst, LexisNexis Risk Solutions - Meet Our Summer Interns
      By the end of 2018, ten students will have completed projects as part of the HPCC Systems intern program. Find out about these students, including where and what they are studying, the projects they will be working on and the intern experience we provide to help them feel part of the team. Lorraine will also speak about how you can get involved with the program by being a mentor, or contributing a project idea for a new feature or enhancement to the HPCC Systems platform and/or Machine Learning Library.

      Richard Taylor, Chief Trainer, HPCC Systems, LexisNexis Risk Solutions – Current/Longest Event Sequence by Month
      Richard will discuss processing event dates to discover for each event within a given time frame: the current number of sequential months the event occurred, and the longest contiguous month-by-month sequence. This topic is based on questions from one of our Statistical Modelers (new to ECL) regarding how to approach the problem in a non-procedural manner. The example code will make use of the GROUP and HAVING functions.

      Read more >
    • The Download: Tech Talks by the HPCC Systems Community, Episode 13
      The Download: Tech Talks by the HPCC Systems Community, Episode 13 HPCC Systems Recorded: Apr 19 2018 3:00 pm UTC 98 mins
    • Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation and further build and link the relationships within our HPCC Systems community.

      Episode 13 includes Tech Talks featuring speakers from our community on topics covering the Future of Automotive Telemetry: Assessing Autonomous Vehicle Risk Implications using Simulated Data, Developing A Custom, Pluggable HPCC Systems Security Manager and Understanding the ECL Watch Graphs. View the full details at hpccsystems.com

      Read more >
    • The Download: Tech Talks by the HPCC Systems Community, Episode 12
      The Download: Tech Talks by the HPCC Systems Community, Episode 12 HPCC Systems Recorded: Mar 15 2018 3:00 pm UTC 95 mins
    • Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation and further build and link the relationships within our HPCC Systems community.

      Episode 12 includes Tech Talks featuring speakers from our community on topics covering exploratory data analysis, geospatial solutions and ECL Tips leveraging the HPCC Systems platform.


      1) Itauma Itauma, PhD Candidate, Keiser University - Conducting exploratory data analysis in educational research using HPCC Systems®

      2) Ignacio Calvo, LexisNexis Risk Solutions - Big Data and Geospatial with HPCC Systems®

      3) Bob Foreman, Senior Software Engineer, HPCC Systems, LexisNexis Risk Solutions - ECL Tip of the Month

      Read more >
    • The Download: Tech Talks by the HPCC Systems Community, Episode 11
      The Download: Tech Talks by the HPCC Systems Community, Episode 11 HPCC Systems Recorded: Feb 15 2018 4:00 pm UTC 91 mins
    • Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation and further build and link the relationships within our HPCC Systems community.

      Episode 11 includes Tech Talks featuring speakers from our community on topics covering Big Data solutions, Spark Integration and other ECL Tips leveraging the HPCC Systems platform.

      1) Raj Chandrasekaran, CTO & Co-Founder, ClearFunnel - Scaling Data Science capabilities: Leveraging a homogeneous Big Data ecosystem

      2) James McMullan, Software Engineer III, LexisNexis Risk Solutions - HDFS Connector Preview

      3) Bob Foreman, Senior Software Engineer, LexisNexis Risk Solutions - Building a RELATIONal Dataset - A Valentine’s Day Special!

      Read more >
    • The Download: Tech Talks by the HPCC Systems Community, Episode 10
      The Download: Tech Talks by the HPCC Systems Community, Episode 10 HPCC Systems Recorded: Jan 18 2018 4:00 pm UTC 86 mins
    • Join us as we continue this series of webinars specifically designed for the community by the community with the goal to share knowledge, spark innovation and further build and link the relationships within our HPCC Systems community.

      Episode 10 will kick off our first Tech Talk in 2018 and includes 15 minute Tech Talks featuring speakers from the community:

      1) Chris Gropp, PhD candidate, Clemson University - Asking the Right Questions with Machine Learning

      The HPCC Systems Machine Learning Library contains a number of powerful tools, but it is important to use them properly. Chris will discuss how to ask the right questions by taking a step backwards from the methods themselves and examining the requirements defined by the applications.

      2) Rodrigo Pastrana, Software Architect, LexisNexis Risk Solutions - Creating Front-facing Web Services to Deliver your HPCC Systems Query Data

      The HPCC Systems platform provides everything you need to easily create production grade web services to deliver your query data. Rodrigo will discuss the tools and frameworks provided by the HPCC Systems platform and walk through the end-to-end creation of a sample web service.

      3) Richard Taylor, Chief Trainer, HPCC Systems, LexisNexis Risk Solutions – ECL Tips and Cool Tricks

      Join Richard for the latest tips and tricks with using ECL! In this session, he will talk about the PARSE function and interesting techniques used in data parsing.

      Read more >
    • The Download: Tech Talks by the HPCC Systems Community, Episode 3
      The Download: Tech Talks by the HPCC Systems Community, Episode 3 HPCC Systems Recorded: Mar 30 2017 3:00 pm UTC 102 mins
    • This series of workshops is specifically designed for the community by the community with the goal to share knowledge, spark innovation and further build and link the relationships within our HPCC Systems community.

      1. Joselito (Joey) Chua , PhD, Manager Software Engineer, Optimal Decisions Group - Prescriptive Analytics - a Software Engineering Perspective

      This talk presents an overview of prescriptive techniques involving simulation and optimisation, the engineering challenges in building prescriptive tools, and HPCC solutions for those challenges.

      2. Jill Luber, Senior Architect, LexisNexis Risk Solutions - Migrating an ECL code repository into Git, Part II

      This session will take a quick look at a migration plan that moved ECL production code, production processes and developers out of MySQL/SVN and into a Git code management culture. This includes migrating both ROXIE and Thor processes to use Git branches across multiple HPCC Systems environments, all while continuing production data builds and releases.

      3. Michael Gardner, Software Engineer II, LexisNexis Risk Solutions - HPCC Systems Platform: Java APIs and tools

      This presentation will be in regards to the Java API and tools released by the HPCC Systems Platform team. These projects include wsclient, rdf2hpcc, clienttools, and jdbc. These open source projects, which can be found in the hpcc-systems github repositories, are designed to allow downstream developers a consistent means by which to interface with the HPCC Systems Platform, and to facilitate the workflow of common tasks a downstream developer might be concerned with.

      4. Bob Foreman, Senior Software Engineer, HPCC Systems, LexisNexis Risk Solutions - In Search of the Lost Tutorial – the best ECL lesson you have never seen.

      In this presentation, Bob will explore David Bayliss’ ECL Bible Tutorial, with particular focus on the GRAPH function and building the inverted index for the ROXIE search.

      Read more >
    • The Download: Tech Talks by the HPCC Systems Community, Episode 2
      The Download: Tech Talks by the HPCC Systems Community, Episode 2 HPCC Systems Community Recorded: Feb 16 2017 4:00 pm UTC 91 mins
    • The purpose of the workshop will be to share knowledge, spark innovation and further build and link the relationships within our HPCC Systems community.

      1. Fujio Turner, Solutions Architect, Couchbase - Mobile/IoT & HPCC Systems

      ​Fujio will discuss the challenges around IoT and address the following questions:

      As there are more mobile and embedded devices all generating more data, what does that mean now and for the future?
      What has to change in an organization's infrastructure to keep up?
      And how can I best take advantage this new stream of information?

      2. Jacob Pellock, Sr Director Software Engineering, LexisNexis - Operationalizing jobs on Thor utilizing Python, Git and HPCC Systems client tools - Part I

      So you’ve setup your HPCC Systems cluster and you’ve written your ECL code. Now you want to take the ECL you’ve written into production. Jacob will explain what technologies we’ve leveraged in bringing our LexisNexis data warehouse into production.

      3. Roger Dev, Sr Architect, LexisNexis - Basic Linear Algebra Subsystem (BLAS) and Parallel Block BLAS (PBBlas) libraries for HPCC Systems.

      Manipulation of matrix data via Linear Algebra operations lies at the heart of many data-mining and machine-learning techniques. New modules for HPCC provide highly scalable and performant implementations of these operations. BLAS provides an industry-standardized set of highly-optimized linear algebra operations. PBBlas extends these operations to mega-scale, splitting the operations into parallelizable units that can be balanced across an HPCC cluster. This talk provides an introduction to BLAS, describes the techniques and features of PBBlas, and provides an overview of the PBBlas interface.

      4. Richard Taylor, Chief Trainer, HPCC Systems - HPCC Systems Training: Updates and Deep Dives on Cool Code

      Richard will be presenting an update on what’s going on with ECL/HPCC/SALT/KEL training courses. He will also be selecting some interesting code snippets.

      Read more >