Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing
22012 total appearances

Appearances Over Time

Podcast Appearances

The Daily AI Show
Who Is Winning The AI Model Wars?

So they're making it affordable, and it is the top model.

The Daily AI Show
Who Is Winning The AI Model Wars?

a state-of-the-art model when it comes to coding benchmarks benchmarks okay but then open ai came out with codex max using 5.1 and that one introduced this com context compaction technology that allows for breakthroughs in the context continuity of authentic work using

The Daily AI Show
Who Is Winning The AI Model Wars?

Basically, there's no end to how long this thing can go on reasoning without losing track of what it's doing.

The Daily AI Show
Who Is Winning The AI Model Wars?

And that because of this compaction technology that OpenAI introduced.

The Daily AI Show
Who Is Winning The AI Model Wars?

And then yesterday, we mentioned that another new model, I can't keep them straight in my head, also used, or Gemini actually demonstrated as part of their

The Daily AI Show
Who Is Winning The AI Model Wars?

new technology they they have a context condensation or they use some other term that is about context uh you know summarization and passage to the next player so let me share this chart here and i i want to just show some things that kind of make sense of all of that a little bit more than than all the possible uh interpretations and comparisons that we can make here

The Daily AI Show
Who Is Winning The AI Model Wars?

So I think someone else has to put that up on the screen for us.

The Daily AI Show
Who Is Winning The AI Model Wars?

So here we have the AI model performance benchmark comparison across various metrics.

The Daily AI Show
Who Is Winning The AI Model Wars?

And you see Gemini 3 in deep think mode is here at the top left with this batch of winning scores, largely in the area of intelligence and reasoning.

The Daily AI Show
Who Is Winning The AI Model Wars?

Humanity's last exam, significantly above

The Daily AI Show
Who Is Winning The AI Model Wars?

Gemini 3 Pro in the DeepThink mode.

The Daily AI Show
Who Is Winning The AI Model Wars?

And the next player is way down here at 30.7%, which was GPT-5 Pro.

The Daily AI Show
Who Is Winning The AI Model Wars?

Look at Arc AGI 2, which is a pure kind of reasoning test for intelligence.

The Daily AI Show
Who Is Winning The AI Model Wars?

As recently as just a few months ago, all of the models struggled to get 3% on Arc AGI 2.

The Daily AI Show
Who Is Winning The AI Model Wars?

And now Gemini 3 DeepThink does 45%, Opus 4.5 at 37.6, and the other players, including GPT, way down in the teens.

The Daily AI Show
Who Is Winning The AI Model Wars?

So Gemini, I think the takeaway here is that Gemini is the smartest model.

The Daily AI Show
Who Is Winning The AI Model Wars?

Now look at Opus 4.5, pretty darn smart still, you know, pushing the number close up to the numbers on these top reasoning tests.

The Daily AI Show
Who Is Winning The AI Model Wars?

But in terms, excuse me, in terms of coding, this is where it shines.

The Daily AI Show
Who Is Winning The AI Model Wars?

So Opus 4.5, you know, we were all excited about the SONNET 4.5.

The Daily AI Show
Who Is Winning The AI Model Wars?

Well, Opus 4.5 is really the new, you know, top of the charts offering from Anthropic and it's