Nathaniel Whittemore

The new Frontier model, which like I said is called GLM 5.1, achieved a 58.4 on SweeBench Pro, beating GPT 5.4 and Opus 4.6, who scored 57.7 and 57.3 respectively.

1161.908 View full episode →

The AI Daily Brief: Artificial Intelligence News and Analysis

All of AI's New Models and Tools

Z.ai also provided a mixed benchmark that included Terminal Bench 2.0 and NL2 repo as well, which had GLM 5.1 slightly behind the two US leaders but ahead of Gemini 3.1 Pro.

1173.991 View full episode →

The AI Daily Brief: Artificial Intelligence News and Analysis

All of AI's New Models and Tools

Still, if those benchmarks hold, it puts GLM 5.1 in the top echelon of frontier models with a clear separation from QEN 3.6 Plus and KimiKey 2.5.

1184.858 View full episode →

The AI Daily Brief: Artificial Intelligence News and Analysis

All of AI's New Models and Tools

And indeed, what most people are clinging onto is the fact that this is a full open source release with commercial licensing.

1194.415 View full episode →

The AI Daily Brief: Artificial Intelligence News and Analysis

All of AI's New Models and Tools

It's a gigantic 754 billion parameter model, so you're not going to be running it locally on a Mac Mini.

1199.644 View full episode →

The AI Daily Brief: Artificial Intelligence News and Analysis

All of AI's New Models and Tools

Still, it gives developers the opportunity to build on top of current generation state-of-the-art models for kind of the first time.

1204.472 View full episode →

The AI Daily Brief: Artificial Intelligence News and Analysis

All of AI's New Models and Tools

We've been tracking the apparent shift in Chinese lab strategy away from open source recently, but this release suggests that leading Chinese labs are at least still somewhat willing to give away their best performing models.

1210.171 View full episode →

The AI Daily Brief: Artificial Intelligence News and Analysis

All of AI's New Models and Tools

In terms of performance, ZAI provided a few impressive examples in agents and coding.

1219.451 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment