41 episodes

TalkRL podcast is All Reinforcement Learning, All the Time.
In-depth interviews with brilliant people at the forefront of RL research and practice.
Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute.
Hosted by Robin Ranjit Singh Chauhan.

TalkRL: The Reinforcement Learning Podcast Robin Ranjit Singh Chauhan

    • Technology
    • 4.9 • 22 Ratings

TalkRL podcast is All Reinforcement Learning, All the Time.
In-depth interviews with brilliant people at the forefront of RL research and practice.
Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute.
Hosted by Robin Ranjit Singh Chauhan.

    Natasha Jaques 2

    Natasha Jaques 2

    Hear about why OpenAI cites her work in RLHF and dialog models, approaches to rewards in RLHF, ChatGPT, Industry vs Academia, PsiPhi-Learning, AGI and more! 
    Dr Natasha Jaques is a Senior Research Scientist at Google Brain.
    Featured References
    Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog Natasha Jaques, Asma Ghandeharioun, Judy Hanwen Shen, Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard  Sequence Tutor: Conservative Fine-Tuning of Sequence Generation Models with KL-control Natasha Jaques, Shixiang Gu, Dzmitry Bahdanau, José Miguel Hernández-Lobato, Richard E. Turner, Douglas Eck  PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning Angelos Filos, Clare Lyle, Yarin Gal, Sergey Levine, Natasha Jaques, Gregory Farquhar 
    Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience Marwa Abdulhai, Natasha Jaques, Sergey Levine 
    Additional References  


    Fine-Tuning Language Models from Human Preferences, Daniel M. Ziegler et al 2019  

    Learning to summarize from human feedback, Nisan Stiennon et al 2020  

    Training language models to follow instructions with human feedback, Long Ouyang et al 2022  

    • 46 min
    Jacob Beck and Risto Vuorio

    Jacob Beck and Risto Vuorio

    Jacob Beck and Risto Vuorio on their recent Survey of Meta-Reinforcement Learning.  Jacob and Risto are Ph.D. students at Whiteson Research Lab at University of Oxford.   
    Featured Reference   
    A Survey of Meta-Reinforcement LearningJacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson   
    Additional References  


    VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning, Luisa Zintgraf et al  

    Mastering Diverse Domains through World Models (Dreamerv3), Hafner et al    

    Unsupervised Meta-Learning for Reinforcement Learning (MAML), Gupta et al  

    Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices (DREAM), Liu et al  

    RL2: Fast Reinforcement Learning via Slow Reinforcement Learning, Duan et al  

    Learning to reinforcement learn, Wang et al  

    • 1 hr 7 min
    John Schulman

    John Schulman

    John Schulman, OpenAI cofounder and researcher, inventor of PPO/TRPO talks RL from human feedback, tuning GPT-3 to follow instructions (InstructGPT) and answer long-form questions using the internet (WebGPT), AI alignment, AGI timelines, and more!

    • 44 min
    Sven Mika

    Sven Mika

    Sven Mika of Anyscale on RLlib present and future, Ray and Ray Summit 2022, applied RL in Games / Finance / RecSys, and more!

    • 34 min
    Karol Hausman and Fei Xia

    Karol Hausman and Fei Xia

    Karol Hausman and Fei Xia of Google Research on newly updated (PaLM-)SayCan, Inner Monologue, robot learning, combining robotics with language models, and more!

    • 1 hr 3 min
    Sai Krishna Gottipati

    Sai Krishna Gottipati

    Sai Krishna Gottipati of AI Redefined on RL for synthesizable drug discovery, Multi-Teacher Self-Play, Cogment framework for realtime multi-actor RL, AI + Chess, and more!

    • 1 hr 8 min

Customer Reviews

4.9 out of 5
22 Ratings

22 Ratings

Alcool91 ,

Exposes a wide array of topics

I am a first-year PhD student and I love this podcast. It helps me to be exposed to so many idea and find fellow RL researchers. Thank you Robin for putting this together.

If you are looking for a podcast to help you advance in the field and become a better RL researcher than this is the podcast for you!5

natolambert ,

Unexpected gem to learn deeply about RL

I had the pleasure of finding this podcast as a listener and then being on it within a month or two. Robin does a great job and is here to help improve the experience for the community.

Will help listeners rapidly get up to date with the happenings in reinforcement learning.

Jackson jabba ,

Love the pod cast!

Great to have a podcast that digs into the technical detail of RL!!

Top Podcasts In Technology

Jason Calacanis
Lex Fridman
The New York Times
NPR
Reid Hoffman
Jack Rhysider

You Might Also Like

Machine Learning Street Talk (MLST)
Sam Charrington
Changelog Media
NVIDIA
Dwarkesh Patel
Andreessen Horowitz