Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Nathaniel Whittemore

πŸ‘€ Speaker
24919 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 4
Confidence: High

Appearances Over Time

Podcast Appearances

The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Specifically, while Claude Opus 48 is now slightly above GPT-55 in terms of its intelligence index score, Claude achieves that score while using about 80 or 90% more tokens, meaning it's significantly less token efficient and actually placing both Opus 47 and 48 outside of the most attractive quadrant.

974.732 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

The release of Gemini 3.5 Flash also saw a lot of this discourse around it as well.

992.568 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

While the overall intelligence was much higher on Gemini 3.5 Flash than 3 Flash, the cost to run the tests was more than five times as much as 3 Flash, moving 3.5 from just at the edge of the most attractive quadrant to firmly outside of it.

997.398 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

All of this is finding its way into the popular discourse as well.

1011.907 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

For example, YouTuber and AI entrepreneur Theo recently tweeted, Meanwhile, perception of token efficiency is also part of why Codex has become so much more popular among developers.

1014.952 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Biniyam wrote,

1031.035 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Codex has gotten noticeably better at token efficiency lately.

1032.157 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Same tasks that used to eat up a ton of tokens now feel way more reasonable.

1035.34 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Fundamental analysis on X wrote, GPT-55 and Opus 48 sit around one point apart on the intelligence index, 60.2 versus 61.4.

1039.104 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Their token pricing is almost a match, $5 input on both, $30 versus $25 output.

1047.572 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

So why is there a 40% gap in the cost of running the full index?

1053.798 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

And the answer, of course, as we just saw, is that the Opus models burned way more tokens to complete the index.

1057.402 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Fundy writes, That's the whole game now.

1063.271 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Per-token pricing is the rate and tokens to completion is the actual invoice.

1065.654 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

A model can win on price per token and lose badly on price per task, because the reasoning trace, the restatement, the overthinking is the multiplier nobody printed on the spec sheet.

1069.901 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

This is why the cheapest per-token model is routinely the most expensive per outcome.

1079.115 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Researchers have a name for it called the overthinking task.

1083.241 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Smaller, cheaper models that ramble can cost more in total than a pricier model that's terse and converges fast.

1086.005 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

The buyer side implication is the part the market hasn't priced in yet.

1091.411 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

A, the flagship layer now competes on token efficiency, not just capability.

1094.454 View full episode β†’