Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Andy

👤 Person
20707 total appearances

Appearances Over Time

Podcast Appearances

The Daily AI Show
Who Is Winning The AI Model Wars?

And that because of this compaction technology that OpenAI introduced.

The Daily AI Show
Who Is Winning The AI Model Wars?

And then yesterday, we mentioned that another new model, I can't keep them straight in my head, also used, or Gemini actually demonstrated as part of their

The Daily AI Show
Who Is Winning The AI Model Wars?

new technology they they have a context condensation or they use some other term that is about context uh you know summarization and passage to the next player so let me share this chart here and i i want to just show some things that kind of make sense of all of that a little bit more than than all the possible uh interpretations and comparisons that we can make here

The Daily AI Show
Who Is Winning The AI Model Wars?

So I think someone else has to put that up on the screen for us.

The Daily AI Show
Who Is Winning The AI Model Wars?

So here we have the AI model performance benchmark comparison across various metrics.

The Daily AI Show
Who Is Winning The AI Model Wars?

And you see Gemini 3 in deep think mode is here at the top left with this batch of winning scores, largely in the area of intelligence and reasoning.

The Daily AI Show
Who Is Winning The AI Model Wars?

Humanity's last exam, significantly above

The Daily AI Show
Who Is Winning The AI Model Wars?

Gemini 3 Pro in the DeepThink mode.

The Daily AI Show
Who Is Winning The AI Model Wars?

And the next player is way down here at 30.7%, which was GPT-5 Pro.

The Daily AI Show
Who Is Winning The AI Model Wars?

Look at Arc AGI 2, which is a pure kind of reasoning test for intelligence.

The Daily AI Show
Who Is Winning The AI Model Wars?

As recently as just a few months ago, all of the models struggled to get 3% on Arc AGI 2.

The Daily AI Show
Who Is Winning The AI Model Wars?

And now Gemini 3 DeepThink does 45%, Opus 4.5 at 37.6, and the other players, including GPT, way down in the teens.

The Daily AI Show
Who Is Winning The AI Model Wars?

So Gemini, I think the takeaway here is that Gemini is the smartest model.

The Daily AI Show
Who Is Winning The AI Model Wars?

Now look at Opus 4.5, pretty darn smart still, you know, pushing the number close up to the numbers on these top reasoning tests.

The Daily AI Show
Who Is Winning The AI Model Wars?

But in terms, excuse me, in terms of coding, this is where it shines.

The Daily AI Show
Who Is Winning The AI Model Wars?

So Opus 4.5, you know, we were all excited about the SONNET 4.5.

The Daily AI Show
Who Is Winning The AI Model Wars?

Well, Opus 4.5 is really the new, you know, top of the charts offering from Anthropic and it's

The Daily AI Show
Who Is Winning The AI Model Wars?

focused on coding to a large degree.

The Daily AI Show
Who Is Winning The AI Model Wars?

And then for multimodal work, it's not necessary to apply Gemini 3's deep thinking, but you can see that the top scores are achieved for all of these multimodal tests by Gemini 3 Pro.

The Daily AI Show
Who Is Winning The AI Model Wars?

And that kind of breaks down the players here.