Jeremiah
๐ค SpeakerAppearances Over Time
Podcast Appearances
Did you hear about the guy who made a working language model in Minecraft using redstone circuits?
Pretty amazing, isn't it?
His version is barely GPT-2 level, but there's no reason we can't scale that up.
Once we create full-sized data centers in Minecraft, everyone will want to do their training runs there.
What do you mean, why?
Real world data centers cost billions of dollars, raise electricity prices, waste, she briefly scans the room to confirm Andy Masley isn't around, then continues, water, and they're getting increasingly politically unpopular and hard to build.
We can short-circuit all of that by putting the data centers in Minecraft instead.
But you have to have the Minecraft world being simulated by real computers, right?
So don't you still need the data center in order to play the Minecraft?
Oh, I'm sure you need some computer, but it's a question of leverage.
One high-end gaming computer playing Minecraft can include a whole world with continents, mountain ranges, forests, and oceans.
You can fit thousands of data centers in that world.
So with even one real-world computer, you've saved billions on chips and construction costs.
You take a moment to consider how best to explain this.
So, um, every computation has to be done somewhere, right?
So you can, in theory, build a working data center on Minecraft, but it will take billions of blocks.
We've got clawed code working on it.
No, I'm saying it would take billions of blocks, and simulating the training circuit and all those billions of blocks in perfect detail would take just as many real-world computations as running the training in the real world.