Marcus Hutter

So occasionally you give a reward signal and you ask the agent to maximize reward, but not greedily sort of, you know, the next one, next one, because that's very bad in the long run if you're greedy.

2656.864 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

But over the lifetime of the agent.

2666.517 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

So let's assume the agent lives for m time steps, let's say dies in sort of 100 years sharp.

2670.063 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

That's just the simplest model to explain.

2674.491 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

So it looks at the future reward sum and asks, what is my action sequence, or actually more precisely my policy, which leads in expectation

2677.336 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

because I don't know the world, to the maximum reward sum.

2687.273 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

Let me give you an analogy.

2691.699 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

In chess, for instance, we know how to play optimally in theory.

2693.722 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

It's just a minimax strategy.

2697.908 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

I play the move which seems best to me under the assumption that the opponent plays the move which is best

2699.81 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

for him, so worst for me, under the assumption that I play, again, the best move, and then you have this expected max three to the end of the game, and then you backpropagate, and then you get the best possible move.

2705.6 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

So that is the optimal strategy, which von Neumann already figured out a long time ago, for playing adversarial games.

2718.424 View full episode →

Lex Fridman Podcast

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

Luckily, or maybe unluckily for the theory, it becomes harder.

2726.508 View full episode →

← Previous Page 20 of 46 Next →

Report any issue

Marcus Hutter

Voice Profile Active

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment