Alex Imas
π€ SpeakerAppearances Over Time
Podcast Appearances
I think, forgetting about utilitarian philosophy or whatever,
like just a pure von Neumann probe has, I don't know what the, is this an accurate way to say it?
They just have high marginal value for like the random solar system they'll occupy because that turns into like more solar systems or turns into more solar systems.
But like a von Neumann probe is a thing that can exist, right?
And that's like a very greedy optimizer.
Right, yeah.
But it's just like, what does the world look like in a world where like von Neumann probes are possible?
Is it possible labor share is high?
One of the biggest problems in RL right now is credit assignment because you have these extremely long rollouts and you need to know why they succeeded or failed.
One of Cursor's researchers, Sasha Rush, gave me a Blackboard lecture on how they use targeted RL with textual feedback to deal with this problem and train Composer 2.5.
I filmed on my iPhone, so apologies for the camera work.
After Cursor injects these hint tokens, they run another forward pass.
The trajectory itself doesn't change, but the hint causes the model to assign lower probability to the error tokens.
Cursor then trains the original model to match those probabilities, basically teaching it to downweight these specific mistakes.
There's a lot more nuance that we couldn't include in this mineral.
If you want to watch the full thing, I posted it on my Twitter.
And if you want to try out Composer 2.5, head to cursor.com slash dwarkesh.
Do economists have any advice for countries which are not in the AI production chain?
If you're not either producing the AI models, you're not producing the hardware that goes into AI models, if you're not Korea making HBM or Taiwan making with the FAS or not the Netherlands with ASML,
Like, what is India or Nigeria?