How should David represent the data needed to train his machine learning system? What does a tic-tac-toe board “look” like to ML? Should he train it on games or on individual boards? How does this decision affect how and how well the machine will learn to play? Plus, an intro to reinforcement learning, the approach Yannick will be taking.
For more information about the show, check out pair.withgoogle.com/thehardway.
You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.
信息
- 节目
- 频率一日一更
- 发布时间2020年7月22日 UTC 13:38
- 长度23 分钟
- 单集2
- 分级儿童适宜