Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Noam Brown

๐Ÿ‘ค Speaker
1199 total appearances

Appearances Over Time

Podcast Appearances

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's true for chess.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's true for poker.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's particularly useful for poker.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

This is counterfactual regret minimization.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So this counterfactual regret minimization is a kind of self-play.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's a principled kind of self-play that's proven to converge to Nash Equilibria, even in imperfect information games.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Now you can have other forms of self-play and people use other forms of self-play for perfect information games where you have more flexibility.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

The algorithm doesn't have to be as theoretically sound in order to converge to that class of games because it's a simpler setting.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Exactly.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Yeah.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Self-play is not tied specifically to neural nets.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's a kind of reinforcement learning basically.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Okay.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And I would also say this process of like trying to reason, oh, what would the value have been if I had taken this other action instead?

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

This is very similar to how humans learn to play a game like poker, right?

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Like you probably played poker before and with your friends, you probably ask like, oh, would you have called me if I raised there?

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And that's a person trying to do the same kind of like learning from a counterfactual that the AI is doing.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Yeah.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Now where the neural nets come in, I said like, okay, if it's in that situation again, then it will choose the action that has high regret.

Lex Fridman Podcast
#344 โ€“ Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Now the problem is that poker is such a huge game.