Marcus Hutter
๐ค SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
And before you get it, you want to predict it, again, based on your past action and observation sequence.
You just condition extra on your actions.
There's an interesting alternative that you also try to predict your own actions.
If you want.
In the past or the future?
In your future actions.
Wait, let me wrap.
I think my brain is broke.
We should maybe discuss that later after I've explained the IC model.
Exactly.
That's very important.
The whole history from birth sort of of the agent.
And we can come back to that also why this is important here.
Often, you know, in RL, you have MDPs, macro decision processes, which are much more limiting.
OK, so now we can predict
conditioned on actions.
So even if we influence the environment, but prediction is not all we want to do, right?
We also want to act really in the world.
And the question is how to choose the actions.
And we don't want to greedily choose the actions, you know, just, you know, what is best in the next time step.