Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andy Halliday

๐Ÿ‘ค Speaker
7827 total appearances

Appearances Over Time

Podcast Appearances

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Now, I have a workspace account, an individual workspace account, so I can't test that question.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

But if there's a free user out there who wants to download anti-gravity, which is a very cool thing, you can use the view markdown files and so on, but also do agentic coding for you.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Just check and see if you can get access to the top model or whether they basically throttle you as a free user back to, oh, poor you, Gemini 3, which is plenty for you.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Yeah, I have something here.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

So let me just...

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Also add that there is a new Arc AGI.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

So they have an Arc AGI 3 that has been announced.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Yeah.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Just in time.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Just in time.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Let's go back in time, right?

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Arc AGI 1 was published in 2019 before GPT-3 came out.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

So ArcAGI-1 didn't last very long.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

The models progressed so swiftly in 2022, 2023, that it was surpassing the ability of that benchmark to really reasonably differentiate models.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

But it's still used out there.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

I mean, you can see the ArcAGI-1 leaderboard here.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Then they came out with Arc AGI 2, which is tougher, more difficult visual puzzles and reasoning things that make it not possible.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

I shouldn't say not possible, but less likely that the training can be tuned to beat that benchmark.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

It requires, you know, real reasoning processing.

The Daily AI Show
Gemini 3.1 Pro Preview Jumps Ahead

Now, RKGI-3 is an interactive reasoning benchmark designed to measure an AI agent's ability to generalize in novel, unseen environments.