1h 57 min

Can we build a generalist agent? Dr. Minqi Jiang and Dr. Marc Rigter Machine Learning Street Talk (MLST)

    • Tecnologia

Dr. Minqi Jiang and Dr. Marc Rigter explain an innovative new method to make the intelligence of agents more general-purpose by training them to learn many worlds before their usual goal-directed training, which we call "reinforcement learning".

Their new paper is called "Reward-free curricula for training robust world models" https://arxiv.org/pdf/2306.09205.pdf

https://twitter.com/MinqiJiang
https://twitter.com/MarcRigter

Interviewer: Dr. Tim Scarfe

Please support us on Patreon, Tim is now doing MLST full-time and taking a massive financial hit. If you love MLST and want this to continue, please show your support! In return you get access to shows very early and private discord and networking. https://patreon.com/mlst

We are also looking for show sponsors, please get in touch if interested mlstreettalk at gmail.

MLST Discord: https://discord.gg/machine-learning-street-talk-mlst-937356144060530778

Dr. Minqi Jiang and Dr. Marc Rigter explain an innovative new method to make the intelligence of agents more general-purpose by training them to learn many worlds before their usual goal-directed training, which we call "reinforcement learning".

Their new paper is called "Reward-free curricula for training robust world models" https://arxiv.org/pdf/2306.09205.pdf

https://twitter.com/MinqiJiang
https://twitter.com/MarcRigter

Interviewer: Dr. Tim Scarfe

Please support us on Patreon, Tim is now doing MLST full-time and taking a massive financial hit. If you love MLST and want this to continue, please show your support! In return you get access to shows very early and private discord and networking. https://patreon.com/mlst

We are also looking for show sponsors, please get in touch if interested mlstreettalk at gmail.

MLST Discord: https://discord.gg/machine-learning-street-talk-mlst-937356144060530778

1h 57 min

Top de podcasts em Tecnologia

IA: A Próxima Vaga
Francisco Pinto Balsemão
Acquired
Ben Gilbert and David Rosenthal
Lex Fridman Podcast
Lex Fridman
Ciber Minuto
Observador Lab
Darknet Diaries
Jack Rhysider
Waveform: The MKBHD Podcast
Vox Media Podcast Network