Andy Halliday
๐ค SpeakerAppearances Over Time
Podcast Appearances
I'm well, thank you.
I'm glad to have an extra day to prep for my trip to Canada next week.
Yeah, I'm sharing a screen here from artificial analysis that shows you how when Google finally drops where they are into the product space, they don't just, you know, kind of eke above Google.
the running pack, they jump ahead.
So here you see on the artificial analysis intelligence index up here, which is a combination of different factors that illustrate intelligence.
And I've isolated the intelligence thing down here.
It's interesting that there's a difference this way.
But if you look at the combination of its skills, which includes a genetic performance and, you know, other not purely kind of reasoning approaches that this is where, you know, Gemini 3.1 Pro Preview is really shining well above all.
anthropic which is right neck and neck with with opus 4.6 max a couple of points ahead of gpt 5.2 thinking high notice how uh you know glm5 and kimmy k 2.5 are right behind the leaders these are both chinese models out there that are available to you
DeepSeek V3.2 is pretty far behind on the artificial analysis intelligence index.
But now let's look down here at the agentic index.
It's interesting to me that Google Gemini 3.1 Pro preview on its agentic skills is way behind Opus 4.6 Max.
and regular Opus 4.6 at 64.
So that's a pretty big spread back to Gemini 3.1 Pro Preview.
What does that mean?
For most of us, probably not a lot, unless you're actually using the entire Google ecosystem to set up a harness for multi-agent coding or multi-agent workflow management.
You might want to...
not focus entirely on Gemini 3.1 Pro Preview.
Make Opus available to whatever system you're architecting.
Now, a lot of it depends on memory and the ability to do either recursion, where you process things while creating an intermediate capture of what the results of that are, and then opening up the context window again with new inference based on that new context and so on.