Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Nathaniel Whittemore

πŸ‘€ Speaker
25560 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 9
Confidence: High

Appearances Over Time

Podcast Appearances

The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

And it won't just be the big labs.

1190.208 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

You're also going to see the agent labs and even app layer companies experiment with their own models, their own harnesses, and their own routing systems in order to get better token efficiency, which is exactly what I meant when I said that every AI business model is now to some extent a token efficiency play.

1191.852 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

We saw this with Cursor's Composer 2.5, which completes coding tasks in the range of the state-of-the-art from both Cloud and OpenAI, but with a radically higher efficiency.

1207.37 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Interestingly, we also just got something from legal AI firm Harvey along the same lines.

1216.321 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

This week, Harvey tweeted, We partnered with Fireworks AI to train open-source models for legal.

1221.067 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Here's what we found.

1226.394 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

One...

1227.715 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Hybrid legal agents can beat frontier models on quality and cost by routing selectively to a frontier advisor.

1228.396 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

We tested a hybrid setup where GLM 5.1 served as the primary worker routing tasks to Opus 4.7 as an advisor when needed.

1234.225 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

GLM invoked Opus sparingly, just 0.83 times per task on average.

1241.597 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

The hybrid setup beat Opus on both quality and cost.

1246.004 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

They also found that post-training can push open models to frontier-level legal performance.

1249.169 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

With a little bit of post-training on Kimi's K2.6 model, they were able to move Kimi ahead of Opus on their legal agent benchmark and to do so for 11 times cheaper than Opus alone.

1254.136 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Writes Patrick Oyo, this is the multi-model routing thesis proved in production on one of the hardest benchmarks in enterprise AI.

1264.029 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

The insight isn't that open source beat frontier.

1270.178 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

It's that smart routing beat brute force.

1272.962 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Using the most expensive model for every task is not a quality strategy.

1275.325 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

It's a laziness tax.

1279.032 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

The teams building routing layers that send each task to the right model at the right cost are now demonstrably ahead on both dimensions simultaneously.

1280.395 View full episode β†’
The AI Daily Brief: Artificial Intelligence News and Analysis
How Companies Are Becoming AI Token Efficient

Inference optimization just became a first-class competitive advantage.

1287.428 View full episode β†’