57 min

Pierluca D'Oro and Martin Klissarov TalkRL: The Reinforcement Learning Podcast

    • Technology

Pierluca D'Oro and Martin Klissarov on Motif and RLAIF, Noisy Neighborhoods and Return Landscapes, and more!  
Pierluca D'Oro is PhD student at Mila and visiting researcher at Meta.
Martin Klissarov is a PhD student at Mila and McGill and research scientist intern at Meta.  
Featured References 
Motif: Intrinsic Motivation from Artificial Intelligence Feedback  Martin Klissarov*, Pierluca D'Oro*, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff  Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control  Nate Rahn*, Pierluca D'Oro*, Harley Wiltzer, Pierre-Luc Bacon, Marc G. Bellemare 
To keep doing RL research, stop calling yourself an RL researcher Pierluca D'Oro 

Pierluca D'Oro and Martin Klissarov on Motif and RLAIF, Noisy Neighborhoods and Return Landscapes, and more!  
Pierluca D'Oro is PhD student at Mila and visiting researcher at Meta.
Martin Klissarov is a PhD student at Mila and McGill and research scientist intern at Meta.  
Featured References 
Motif: Intrinsic Motivation from Artificial Intelligence Feedback  Martin Klissarov*, Pierluca D'Oro*, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff  Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control  Nate Rahn*, Pierluca D'Oro*, Harley Wiltzer, Pierre-Luc Bacon, Marc G. Bellemare 
To keep doing RL research, stop calling yourself an RL researcher Pierluca D'Oro 

57 min

Top Podcasts In Technology

Podcast o technologii
Kanał o technologii
Bo czemu nie?
Krzysztof Kołacz
Acquired
Ben Gilbert and David Rosenthal
Techstorie - rozmowy o technologiach
TOK FM - Sylwia Czubkowska, Joanna Sosnowska
Lex Fridman Podcast
Lex Fridman
AI CODZIENNIE - czyli co słychać w sztucznej inteligencji
Michał Dobrzański