Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Noam Brown

๐Ÿ‘ค Speaker
1199 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And the person responded that it's just so much harder to make an AI that can talk with you and cooperate with you than it is to make an AI that can fight you.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And I think once this technology develops further and you can reach a point where not every single line of dialogue has to be scripted, it unlocks a lot of potential for new kinds of games, much more positive interactions that are not so focused on fighting.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And I'm really looking forward to that.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

All right.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So there's different ways to find a Nash Equilibrium.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So the way that we do it is with this process called self-play.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Basically, we have this algorithm that starts by playing totally randomly, and it learns how to play the game by playing against itself.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So it will start playing the game totally randomly.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And then if it's playing poker, it'll eventually get to the end of the game and make $50.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And then it will review all of the decisions that it made along the way and say, what would have happened if I had chosen this other action instead?

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

You know, if I had raised here instead of called, what would the other player have done?

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And because it's playing against a copy of itself, it's able to do that counterfactual reasoning.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So they can say, okay, well, if I took this action and the other person takes this action and then I take this action and eventually I make $150 instead of 50.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And so it updates the regret value for that action.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Regret is basically like how much does it regret having not played that action in the past?

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And when it encounters that same situation again, it's going to pick actions that have higher regret with higher probability.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Now, it'll just keep simulating the games this way.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It'll keep accumulating regrets for different situations.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And in the long run, if you pick actions that have higher regret with higher probability in the correct way, it's proven to converge to a Nash equilibrium.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's true for all games.