HPCC - Open-Source Platform High-Performance Computing on Large-Scale Data with Bob Foreman ODSC's Ai X Podcast

    • Technology

Listen on Apple Podcasts
Requires macOS 11.4 or higher

In this episode of the Ai X Podcast, Bob Foreman, Lead Software Engineering LexisNexis Risk Solutions, joins us to discuss the High-Performance Computer Cluster (HPCC) project, an open-source, massive parallel-processing computing platform for data processing and analytics.

Bob has spent more than a decade working with HPCC Systems and with the Enterprise Control Language as a course developer and trainer. Not only is he a highly experienced developer, but he is also the designer of the HPCC Systems online training courses and is the senior instructor for all classroom and remote-based training.

Join him for a deep dive into the HPCC project to discover how it simplifies complex data analysis at scale and why it’s an ideal tool for students, startups, or companies exploring or running POC for large-scale data-intensive computing.

Topics:
- Guest Background and Professional Development
- Overview of LexisNexis as a company
- Guest current role
- What high-speed data engineering involves and why it's important in big data solutions
- Main components of the HPCC Systems platform
- Where the HPCC Systems platform fits in the data landscape
- The role of Thor, Roxie, and ECL in the platform
- Example of how ECL (Enterprise Control Language) can be used to manipulate and analyze large datasets
- How Roxie enhances the performance of real-time data querying and analysis
- How to get started with HPCC's open-source massive parallel system.
- Working with small sets of data before getting to large data sets
- The Machine Learning Library native to HPCC
- How the HPCC platform fits into the latest trends of Machine Learning and Generative AI
- HPCC-related events in 2024
- ODSC Workshops and how HPCC community initiatives
- How to follow HPCC updates

Show Notes:
Learn more about Bob Foreman: https://www.linkedin.com/in/bobforeman/
Learn more about the HPCC Platform: https://github.com/hpcc-systems/HPCC-Platform |
https://hpccsystems.com/about/#Platform
HPCC call ECL bundles: https://github.com/hpcc-systems/ecl-bundles
HPCC Systems Machine Learning Library: https://hpccsystems.com/download/free-modules/hpcc-systems-machine-learning-library/

Bob’s Educational Resources
Bob’s online course: https://hpccsystems.com/training/free-online-learning-with-hpcc-systems/

HPCC community initiatives: https://www.missingkids.org/ourwork/ncmecdata

This episode was sponsored by:
Ai+ Training https://aiplus.training/
Home to hundreds of hours of on-demand, self-paced AI training, ODSC interviews, free webinars, and certifications in in-demand skills like LLMs and Prompt Engineering

And created in partnership with ODSC https://odsc.com/
The Leading AI Training Conference, featuring expert-led, hands-on workshops, training sessions, and talks on cutting-edge AI topics and

Never miss an episode, subscribe now!

In this episode of the Ai X Podcast, Bob Foreman, Lead Software Engineering LexisNexis Risk Solutions, joins us to discuss the High-Performance Computer Cluster (HPCC) project, an open-source, massive parallel-processing computing platform for data processing and analytics.

Bob has spent more than a decade working with HPCC Systems and with the Enterprise Control Language as a course developer and trainer. Not only is he a highly experienced developer, but he is also the designer of the HPCC Systems online training courses and is the senior instructor for all classroom and remote-based training.

Join him for a deep dive into the HPCC project to discover how it simplifies complex data analysis at scale and why it’s an ideal tool for students, startups, or companies exploring or running POC for large-scale data-intensive computing.

Topics:
- Guest Background and Professional Development
- Overview of LexisNexis as a company
- Guest current role
- What high-speed data engineering involves and why it's important in big data solutions
- Main components of the HPCC Systems platform
- Where the HPCC Systems platform fits in the data landscape
- The role of Thor, Roxie, and ECL in the platform
- Example of how ECL (Enterprise Control Language) can be used to manipulate and analyze large datasets
- How Roxie enhances the performance of real-time data querying and analysis
- How to get started with HPCC's open-source massive parallel system.
- Working with small sets of data before getting to large data sets
- The Machine Learning Library native to HPCC
- How the HPCC platform fits into the latest trends of Machine Learning and Generative AI
- HPCC-related events in 2024
- ODSC Workshops and how HPCC community initiatives
- How to follow HPCC updates

Show Notes:
Learn more about Bob Foreman: https://www.linkedin.com/in/bobforeman/
Learn more about the HPCC Platform: https://github.com/hpcc-systems/HPCC-Platform |
https://hpccsystems.com/about/#Platform
HPCC call ECL bundles: https://github.com/hpcc-systems/ecl-bundles
HPCC Systems Machine Learning Library: https://hpccsystems.com/download/free-modules/hpcc-systems-machine-learning-library/

Bob’s Educational Resources
Bob’s online course: https://hpccsystems.com/training/free-online-learning-with-hpcc-systems/

HPCC community initiatives: https://www.missingkids.org/ourwork/ncmecdata

This episode was sponsored by:
Ai+ Training https://aiplus.training/
Home to hundreds of hours of on-demand, self-paced AI training, ODSC interviews, free webinars, and certifications in in-demand skills like LLMs and Prompt Engineering

And created in partnership with ODSC https://odsc.com/
The Leading AI Training Conference, featuring expert-led, hands-on workshops, training sessions, and talks on cutting-edge AI topics and

Never miss an episode, subscribe now!

Top Podcasts In Technology

Acquired
Ben Gilbert and David Rosenthal
All-In with Chamath, Jason, Sacks & Friedberg
All-In Podcast, LLC
Lex Fridman Podcast
Lex Fridman
Search Engine
PJ Vogt, Audacy, Jigsaw
Hard Fork
The New York Times
TED Radio Hour
NPR