Remote HPCC Systems/ECL Training

Abstract: 

Learn why the truly open source HPCC Systems platform is Better at Big Data and learn how ECL can empower you to build powerful data queries with ease. HPCC Systems is a comprehensive, dedicated data lake platform makes combining different types of data easier and faster than competing platforms — even data stored in massive, mixed schema data lakes — and it scales very quickly as your data needs grow.


Data Visualization, NLP, and ML made easy with HPCC Systems ECL, is for ODSC West Attendees who want to explore and learn the Open Source HPCC Systems platform and ECL language in three different phases. The Workshop will take students through the basics of the HPCC Systems Enterprise Control Language, and demonstrate Data Patterns and Visualization, Natural Language Parsing, and an introduction to ECL Machine Learning using a Learning Trees Regression Model. Code examples and lesson materials will be included.

Session Outline
Phase 1 – Introduction to HPCC Systems/ECL and Data Visualization
The first hour unleashes the power of ECL with Data Ingestion, Hygiene, Profiling, and Visualization.

Phase 2 – ECL and Natural Language Processing
Exploring the ECL support for NLP, including Parsing PATTERNs, the versatile PARSE function, and NLP Record Functions.

Phase 3 – ECL and Machine Learning
An introduction to Machine Learning and how it is used with ECL, using the powerful and easy-to-use HPCC ML Bundles. We will explore the ease of setup and demonstrate a simple Learning Trees example.

Bio: 

Bob Foreman has worked with LexisNexis and the open source big data HPCC Systems technology platform and the ECL programming language for more than 9 years, and has been a technical trainer for over 25 years. He is the developer and designer of the HPCC Systems Online Training Courses, and is the Senior Instructor for all classroom and remote training. This includes: Introduction to ECL (Part 1 and 2) - Concepts and Queries, the Extract, Transform and Load (ETL) Process, Advanced ECL (Part 1 and 2) - Working with Relational Data, Super files, Working with XML, and Free-form Text Parsing, ROXIE ECL - Indexes and Queries, Complex Query Development, and Applied ECL – ECL Code Generation Tools.

Hugo supports the development and delivery of training programs for the HPCC Systems platform in the Brazil region since 2019. Hugo has worked for over 15 years on various technical roles in the IT industry with a focus on High Performance Computing. He is also a part time researcher on Information Systems and a member of the UK Academy for Information Systems.