90 episodes

Gradient Dissent is a machine learning podcast hosted by Lukas Biewald that takes you behind-the-scenes to learn how industry leaders are putting deep learning models in production at Facebook, Google, Lyft, OpenAI, and more.

Gradient Dissent: Exploring Machine Learning, AI, Deep Learning, Computer Vision Lukas Biewald

    • Technology
    • 4.8 • 48 Ratings

Gradient Dissent is a machine learning podcast hosted by Lukas Biewald that takes you behind-the-scenes to learn how industry leaders are putting deep learning models in production at Facebook, Google, Lyft, OpenAI, and more.

    Providing Greater Access to LLMs with Brandon Duderstadt, Co-Founder and CEO of Nomic AI

    Providing Greater Access to LLMs with Brandon Duderstadt, Co-Founder and CEO of Nomic AI

    On this episode, we’re joined by Brandon Duderstadt, Co-Founder and CEO of Nomic AI. Both of Nomic AI’s products, Atlas and GPT4All, aim to improve the explainability and accessibility of AI.
    We discuss:
    - (0:55) What GPT4All is and its value proposition.
    - (6:56) The advantages of using smaller LLMs for specific tasks. 
    - (9:42) Brandon’s thoughts on the cost of training LLMs. 
    - (10:50) Details about the current state of fine-tuning LLMs. 
    - (12:20) What quantization is and what it does. 
    - (21:16) What Atlas is and what it allows you to do.
    - (27:30) Training code models versus language models.
    - (32:19) Details around evaluating different models.
    - (38:34) The opportunity for smaller companies to build open-source models. 
    - (42:00) Prompt chaining versus fine-tuning models.
    Resources mentioned:
    Brandon Duderstadt - https://www.linkedin.com/in/brandon-duderstadt-a3269112a/
    Nomic AI - https://www.linkedin.com/company/nomic-ai/
    Nomic AI Website - https://home.nomic.ai/
    Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.
    #OCR #DeepLearning #AI #Modeling #ML

    • 1 hr 1 min
    Exploring PyTorch and Open-Source Communities with Soumith Chintala, VP/Fellow of Meta, Co-Creator of PyTorch

    Exploring PyTorch and Open-Source Communities with Soumith Chintala, VP/Fellow of Meta, Co-Creator of PyTorch

    On this episode, we’re joined by Soumith Chintala, VP/Fellow of Meta and Co-Creator of PyTorch. Soumith and his colleagues’ open-source framework impacted both the development process and the end-user experience of what would become PyTorch.
    We discuss:
    - The history of PyTorch’s development and TensorFlow’s impact on development decisions.
    - How a symbolic execution model affects the implementation speed of an ML compiler.
    - The strengths of different programming languages in various development stages.
    - The importance of customer engagement as a measure of success instead of hard metrics.
    - Why community-guided innovation offers an effective development roadmap.
    - How PyTorch’s open-source nature cultivates an efficient development ecosystem.
    - The role of community building in consolidating assets for more creative innovation.
    - How to protect community values in an open-source development environment.
    - The value of an intrinsic organizational motivation structure.
    - The ongoing debate between open-source and closed-source products, especially as it relates to AI and machine learning.


    Resources:
    - Soumith Chintala
    https://www.linkedin.com/in/soumith/
    - Meta | LinkedIn
    https://www.linkedin.com/company/meta/
    - Meta | Website
    https://about.meta.com/
    - Pytorch
    https://pytorch.org/



    Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.




    #OCR #DeepLearning #AI #Modeling #ML

    • 1 hr 8 min
    Advanced AI Accelerators and Processors with Andrew Feldman of Cerebras Systems

    Advanced AI Accelerators and Processors with Andrew Feldman of Cerebras Systems

    On this episode, we’re joined by Andrew Feldman, Founder and CEO of Cerebras Systems. Andrew and the Cerebras team are responsible for building the largest-ever computer chip and the fastest AI-specific processor in the industry.
    We discuss:
    - The advantages of using large chips for AI work.
    - Cerebras Systems’ process for building chips optimized for AI.
    - Why traditional GPUs aren’t the optimal machines for AI work.
    - Why efficiently distributing computing resources is a significant challenge for AI work.
    - How much faster Cerebras Systems’ machines are than other processors on the market.
    - Reasons why some ML-specific chip companies fail and what Cerebras does differently.
    - Unique challenges for chip makers and hardware companies.
    - Cooling and heat-transfer techniques for Cerebras machines.
    - How Cerebras approaches building chips that will fit the needs of customers for years to come.
    - Why the strategic vision for what data to collect for ML needs more discussion.
    Resources:
    Andrew Feldman - https://www.linkedin.com/in/andrewdfeldman/
    Cerebras Systems - https://www.linkedin.com/company/cerebras-systems/
    Cerebras Systems | Website - https://www.cerebras.net/
    Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.
    #OCR #DeepLearning #AI #Modeling #ML

    • 1 hr
    Enabling LLM-Powered Applications with Harrison Chase of LangChain

    Enabling LLM-Powered Applications with Harrison Chase of LangChain

    On this episode, we’re joined by Harrison Chase, Co-Founder and CEO of LangChain. Harrison and his team at LangChain are on a mission to make the process of creating applications powered by LLMs as easy as possible.
    We discuss:
    - What LangChain is and examples of how it works. 
    - Why LangChain has gained so much attention. 
    - When LangChain started and what sparked its growth. 
    - Harrison’s approach to community-building around LangChain. 
    - Real-world use cases for LangChain.
    - What parts of LangChain Harrison is proud of and which parts can be improved.
    - Details around evaluating effectiveness in the ML space.
    - Harrison's opinion on fine-tuning LLMs.
    - The importance of detailed prompt engineering.
    - Predictions for the future of LLM providers.

    Resources:

    Harrison Chase - https://www.linkedin.com/in/harrison-chase-961287118/
    LangChain | LinkedIn - https://www.linkedin.com/company/langchain/
    LangChain | Website - https://docs.langchain.com/docs/



    Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.



    #OCR #DeepLearning #AI #Modeling #ML

    • 51 min
    Deploying Autonomous Mobile Robots with Jean Marc Alkazzi at idealworks

    Deploying Autonomous Mobile Robots with Jean Marc Alkazzi at idealworks

    On this episode, we’re joined by Jean Marc Alkazzi, Applied AI at idealworks. Jean focuses his attention on applied AI, leveraging the use of autonomous mobile robots (AMRs) to improve efficiency within factories and more.
    We discuss:
    - Use cases for autonomous mobile robots (AMRs) and how to manage a fleet of them. 
    - How AMRs interact with humans working in warehouses.
    - The challenges of building and deploying autonomous robots.
    - Computer vision vs. other types of localization technology for robots.
    - The purpose and types of simulation environments for robotic testing.
    - The importance of aligning a robotic fleet’s workflow with concrete business objectives.
    - What the update process looks like for robots.
    - The importance of avoiding your own biases when developing and testing AMRs.
    - The challenges associated with troubleshooting ML systems.
    Resources: 
    Jean Marc Alkazzi - https://www.linkedin.com/in/jeanmarcjeanazzi/
    idealworks |LinkedIn - https://www.linkedin.com/company/idealworks-gmbh/
    idealworks | Website - https://idealworks.com/
    Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.
    #OCR #DeepLearning #AI #Modeling #ML

    • 58 min
    How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

    How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

    On this episode, we’re joined by Stella Biderman, Executive Director at EleutherAI and Lead Scientist - Mathematician at Booz Allen Hamilton.
    EleutherAI is a grassroots collective that enables open-source AI research and focuses on the development and interpretability of large language models (LLMs).
    We discuss:
    - How EleutherAI got its start and where it's headed.
    - The similarities and differences between various LLMs.
    - How to decide which model to use for your desired outcome.
    - The benefits and challenges of reinforcement learning from human feedback.
    - Details around pre-training and fine-tuning LLMs.
    - Which types of GPUs are best when training LLMs.
    - What separates EleutherAI from other companies training LLMs.
    - Details around mechanistic interpretability.
    - Why understanding what and how LLMs memorize is important.
    - The importance of giving researchers and the public access to LLMs.
    Stella Biderman - https://www.linkedin.com/in/stellabiderman/
    EleutherAI - https://www.linkedin.com/company/eleutherai/
    Resources:
    - https://www.eleuther.ai/
    Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.

    #OCR #DeepLearning #AI #Modeling #ML

    • 57 min

Customer Reviews

4.8 out of 5
48 Ratings

48 Ratings

left_shark ,

Great guests and host

This podcast has a great set of guests and the host does a great job of asking insightful questions. Would recommend for real world practitioners!

kootsoop ,

Thoughtful

I’ve just started listening in the last few weeks, but all the episodes I’ve listened to are thoughtful and grounded. They think around the issues of interest to the guests (and host), and they bring it back to reality and don’t get taken in by the hype. There’s certainly positivity around AI / ML, but with a discerning eye.

Kyler Bruno ,

Hidden knowledge

This podcast is a window into ML of today and the future

Top Podcasts In Technology

Lex Fridman
Jason Calacanis
The New York Times
NPR
Ben Gilbert and David Rosenthal
Jack Rhysider

You Might Also Like

Sam Charrington
Alessio + swyx
Machine Learning Street Talk (MLST)
Changelog Media
Conviction | Pod People
Jon Krohn