Latest Tic tac toe Podcast Episodes

Lessons learned

Tic-Tac-Toe the Hard Way - July 22, 2020 15:58 - 33 minutes
What have we learned about machine learning and the human decisions that shape it? And is machine learning perhaps changing our minds about how the world outside of machine learning — also known as the world — works? For more information about the show, check out pair.withgoogle.com/th...

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe

Head to Head: The Even Bigger ML Smackdown!

Tic-Tac-Toe the Hard Way - July 22, 2020 15:55 - 24 minutes
Yannick and David’s systems play against each other in 500 games. Who’s going to win? And what can we learn about how the ML may be working by thinking about the results? See the agents play each other in Tic-Tac-Two! For more information about the show, check out pair.withgoogle.com/...

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe

Enter tic-tac-two

Tic-Tac-Toe the Hard Way - July 22, 2020 15:50 - 21 minutes
David’s variant of tic-tac-toe that we’re calling tic-tac-two is only slightly different but turns out to be far more complex. This requires rethinking what the ML system will need in order to learn how to play, and how to represent that data. For more information about the show, chec...

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe

Head to Head: the Big ML Smackdown!

Tic-Tac-Toe the Hard Way - July 22, 2020 15:45 - 25 minutes
David and Yannick’s tic-tac-toe ML agents face-off against each other in tic-tac-toe! See the agents play each other! For more information about the show, check out pair.withgoogle.com/thehardway/. You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe

Give that model a treat! : Reinforcement learning explained

Tic-Tac-Toe the Hard Way - July 22, 2020 15:35 - 26 minutes
Switching gears, we focus on how Yannick’s been training his model using reinforcement learning. He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves. Resources: Deep Learning ...

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe

Beating random: What it means to have trained a model

Tic-Tac-Toe the Hard Way - July 22, 2020 15:30 - 17 minutes
David did it! He trained a machine learning model to play tic-tac-toe! (Well, with lots of help from Yannick.) How did the whole training experience go? How do you tell how training went? How did his model do against a player that makes random tic-tac-toe moves? For more information a...

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe

From tic-tac-toe moves to ML model

Tic-Tac-Toe the Hard Way - July 22, 2020 13:55 - 21 minutes
Once we have the data we need—thousands of sample games--how do we turn it into something the ML can train itself on? That means understanding how training works, and what a model is. Resources: See a definition of one-hot encoding For more information about the show, check out pair....

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe

What does a tic-tac-toe board look like to machine learning?

Tic-Tac-Toe the Hard Way - July 22, 2020 13:38 - 23 minutes
How should David represent the data needed to train his machine learning system? What does a tic-tac-toe board “look” like to ML? Should he train it on games or on individual boards? How does this decision affect how and how well the machine will learn to play? Plus, an intro to reinfor...

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe

Howdy, and the myth of “pouring in data”

Tic-Tac-Toe the Hard Way - July 21, 2020 21:05 - 22 minutes
Welcome to the podcast! We’re Yannick and David, a software engineer and a non-technical writer. Over the next 9 episodes we’re going to use two different approaches to build machine learning systems that play two versions of tic-tac-toe. Building a machine learning app requires humans ...

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe

Introducing Tic-Tac-Toe the Hard Way

Tic-Tac-Toe the Hard Way - July 21, 2020 20:57 - 2 minutes
Introducing the podcast where a writer and a software engineer explore the human choices that shape machine learning systems by building competing tic-tac-toe agents. Brought to you by Google's People + AI Research team. More at: pair.withgoogle.com/thehardway

Technology Education machine learning human-centered reinforcement learning supervised learning tic-tac-toe