Noam Brown

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And the person responded that it's just so much harder to make an AI that can talk with you and cooperate with you than it is to make an AI that can fight you.

1016.177 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And I think once this technology develops further and you can reach a point where not every single line of dialogue has to be scripted, it unlocks a lot of potential for new kinds of games, much more positive interactions that are not so focused on fighting.

1024.613 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And I'm really looking forward to that.

1038.526 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

All right.

1107.392 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So there's different ways to find a Nash Equilibrium.

1107.653 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So the way that we do it is with this process called self-play.

1109.956 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Basically, we have this algorithm that starts by playing totally randomly, and it learns how to play the game by playing against itself.

1115.503 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So it will start playing the game totally randomly.

1123.788 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And then if it's playing poker, it'll eventually get to the end of the game and make $50.

1127.735 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And then it will review all of the decisions that it made along the way and say, what would have happened if I had chosen this other action instead?

1133.304 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

You know, if I had raised here instead of called, what would the other player have done?

1142.013 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And because it's playing against a copy of itself, it's able to do that counterfactual reasoning.

1146.679 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So they can say, okay, well, if I took this action and the other person takes this action and then I take this action and eventually I make $150 instead of 50.

1150.804 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And so it updates the regret value for that action.

1159.696 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Regret is basically like how much does it regret having not played that action in the past?

1164.442 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And when it encounters that same situation again, it's going to pick actions that have higher regret with higher probability.

1168.692 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Now, it'll just keep simulating the games this way.

1175.922 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It'll keep accumulating regrets for different situations.

1179.127 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And in the long run, if you pick actions that have higher regret with higher probability in the correct way, it's proven to converge to a Nash equilibrium.

1182.812 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's true for all games.

1196.705 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment