Jaeden Schafer
๐ค SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
It just basically is going to be a lot better for knowledge work.
I mean, and by a lot better, I mean, we're seeing, you know, a 10% jump here or, you know, 12% jump here, which is pretty significant.
On some of the coding benchmarks, so SWE Bench Pro, this is a software engineering bench pro, the model is getting slightly better than the last version.
So I mean, this is good, but beyond just getting slightly better, it is actually quite a bit faster.
So if anybody has used a lot of these software tools, specifically we use Cloud Code AI Box.
My developer sends me screenshots of like, because of these really long elaborate tasks that it's doing on our backend, our code base.
And I swear it's like a goal for him to see how long he can get Cloud Code to run continuously without stopping on a project he gives it.
It's funny because I'm, you know, vibe coding stuff on Lovable and I usually get a Lovable response back to me in like, you know, a minute or two.
He has it go for like three and a half hours doing a task.
So when this model gets faster, I'm excited because hopefully that three and a half hours gets cut down on some of the stuff that we're working on.
I think one of the things that it's also very good at is for real computer interaction.
There is an OS World Verified.
It basically evaluates how well an AI can operate a desktop environment.
It's, you know, pretty much just like takes a screenshot and then it uses the keyboard and mouse commands to go and click stuff.
Right now it has about a 75% success rate.
I wish I could use it more.