Jaeden Schafer
๐ค SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
I think Anthropic is doing better in this, but...
75% success rate, like they are improving, their success rate is up a bit.
I still don't think it's the best.
There's a major focus on kind of how it is being used professionally.
OpenAI says their model right now is significantly better at basically giving the kind of deliverables that people use in real work.
So things like spreadsheets, presentations, financial models, legal analysis.
All of those, they've done a bunch of different tasks and they had one performed by a junior investment banker analyst.
It got 87% compared to 68% that GPT-5.2 got.
Some human evaluators also preferred it.
Um, about 68% of the time they said it had better visuals and better structure.
So there's some cool stuff.
Cool features that you might actually use today.
This is the one I'm very excited about.
It has what they're calling steerability, but basically when you're, when you're talking to chat GPT, it's, it's available in the API too, which is I think crazy, but it's on chat GPT.
If you're talking to chat GPT and you can kind of see it's reasoning, right?
Like it's thinking through some stuff and it puts a couple of steps down and you realize it's going in the wrong direction.
You know, maybe you're like, Hey, I'm trying to visit like, uh, the best beach in the
for surfing and it's like okay looking at beaches in kawaii and you're like oh crap like i'm in california i don't want to see kawaii and you're like then you can type a message like specifically in california and mid like prompt mid response it actually takes into account what you just said and is you know steerability it's going to go and incorporate that into its uh into what it's looking at