Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andy Halliday

👤 Speaker
7659 total appearances

Appearances Over Time

Podcast Appearances

The Daily AI Show
OpenAI’s Secret Training Playbook

And the first bit of news is that there's a company called H Company that has just set a new desktop computer use benchmark.

The Daily AI Show
OpenAI’s Secret Training Playbook

outperforming GPT 5.4 and Opus 4.6.

The Daily AI Show
OpenAI’s Secret Training Playbook

And remember, computer use was originated by Claude.

The Daily AI Show
OpenAI’s Secret Training Playbook

Claude, computer use was the initial announcement of this very important agentic capability, which is, okay, you need the agent that's working with you, that AI, to be able to manipulate your computer.

The Daily AI Show
OpenAI’s Secret Training Playbook

Well, what's tremendous about this piece of news is that it's only a 10 billion active parameter device.

The Daily AI Show
OpenAI’s Secret Training Playbook

$35 billion architecture.

The Daily AI Show
OpenAI’s Secret Training Playbook

It's very small.

The Daily AI Show
OpenAI’s Secret Training Playbook

And it beats GPT-5.4, which scored 75% on this OS World verified benchmark for computer use.

The Daily AI Show
OpenAI’s Secret Training Playbook

It beats it by several points.

The Daily AI Show
OpenAI’s Secret Training Playbook

So 78.5%.

The Daily AI Show
OpenAI’s Secret Training Playbook

nine we'll call it 79 78.9 79 call it against 75 for gpt 5.4 so it's better at that and it is activating only 10 billion parameters to do that in a 35 billion thing now 35 billion is actually practical on a local machine

The Daily AI Show
OpenAI’s Secret Training Playbook

like you have to have a pretty big ram you know you know and you know high-end processor to do it but you can run this computer use thing so you can imagine now that being distilled and brought down smaller and smaller until we have extremely proficient computer using agents that are local to our machines and it's an open weight model it's open source um

The Daily AI Show
OpenAI’s Secret Training Playbook

That's an incredible improvement there.

The Daily AI Show
OpenAI’s Secret Training Playbook

Now I want to just jump over on the same subject, agents and agentic systems.

The Daily AI Show
OpenAI’s Secret Training Playbook

There's a company over in France called Nou Research, N-O-U-S, like us in French.

The Daily AI Show
OpenAI’s Secret Training Playbook

Nou Research.

The Daily AI Show
OpenAI’s Secret Training Playbook

has the Hermes line of AI models.

The Daily AI Show
OpenAI’s Secret Training Playbook

And we used to talk about them back in 2023.

The Daily AI Show
OpenAI’s Secret Training Playbook

They haven't really hit the news much in recent times, but they've just delivered a local agent that's patterned after OpenClaw and challenges OpenClaw, but it's the first serious self-improving version of a claw.

The Daily AI Show
OpenAI’s Secret Training Playbook

Okay, so most local agents have memory and they execute tasks, but this one introduces a genuinely different kind of architecture, which has a do it, learn from it, and improve loop built into the process.