10 episodi

A writer and a software engineer from Google's People + AI Research team explore the human choices that shape machine learning systems by building competing tic-tac-toe agents.

Tic-Tac-Toe the Hard Way People + AI Research

    • Tecnologia

A writer and a software engineer from Google's People + AI Research team explore the human choices that shape machine learning systems by building competing tic-tac-toe agents.

    Howdy, and the myth of “pouring in data”

    Howdy, and the myth of “pouring in data”

    David and Yannick get started on their project to build competing machine learning systems that play tic-tac-toe. They discuss the human choices that will shape their systems along the way.

    • 22 min
    What does a tic-tac-toe board look like to machine learning?

    What does a tic-tac-toe board look like to machine learning?

    David delves into questions around data and training for his model including: What does a tic-tac-toe board “look” like to ML? Plus, an intro to reinforcement learning, the approach Yannick will be taking.

    • 23 min
    From tic-tac-toe moves to ML model

    From tic-tac-toe moves to ML model

    Once we have the data we need—thousands of sample games—how do we turn it into something the ML can train itself on? That means understanding how training works, and what a model is.

    • 21 min
    Beating random: What it means to have trained a model

    Beating random: What it means to have trained a model

    David did it! He trained a machine learning model to play tic-tac-toe! How did his model do against a player that makes random tic-tac-toe moves?

    • 17 min
    Give that model a treat! : Reinforcement learning explained

    Give that model a treat! : Reinforcement learning explained

    Switching gears, we focus on how Yannick’s been training his model using reinforcement learning. He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves.

    • 26 min
    Head to Head: the Big ML Smackdown!

    Head to Head: the Big ML Smackdown!

    David and Yannick’s tic-tac-toe ML agents face-off against each other in tic-tac-toe!

    • 25 min

Top podcast nella categoria Tecnologia

Il Disinformatico
RSI - Radiotelevisione svizzera
Apple Events (video)
Apple
Lex Fridman Podcast
Lex Fridman
SaggioPodcast by SaggiaMente
EasyPodcast
TED Tech
TED Tech
AI: La Nuova Era
Samuel Algherini