Noam Brown

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's true for chess.

1198.187 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's true for poker.

1199.068 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's particularly useful for poker.

1199.709 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

This is counterfactual regret minimization.

1204.595 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

So this counterfactual regret minimization is a kind of self-play.

1216.511 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's a principled kind of self-play that's proven to converge to Nash Equilibria, even in imperfect information games.

1219.695 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Now you can have other forms of self-play and people use other forms of self-play for perfect information games where you have more flexibility.

1225.763 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

The algorithm doesn't have to be as theoretically sound in order to converge to that class of games because it's a simpler setting.

1233.01 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Exactly.

1254.232 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Yeah.

1254.732 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Self-play is not tied specifically to neural nets.

1255.013 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

It's a kind of reinforcement learning basically.

1257.255 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Okay.

1260.098 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And I would also say this process of like trying to reason, oh, what would the value have been if I had taken this other action instead?

1260.218 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

This is very similar to how humans learn to play a game like poker, right?

1267.445 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Like you probably played poker before and with your friends, you probably ask like, oh, would you have called me if I raised there?

1270.688 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

And that's a person trying to do the same kind of like learning from a counterfactual that the AI is doing.

1277.454 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Yeah.

1288.297 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Now where the neural nets come in, I said like, okay, if it's in that situation again, then it will choose the action that has high regret.

1288.417 View full episode →

Lex Fridman Podcast

#344 – Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation

Now the problem is that poker is such a huge game.

1295.706 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment