Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Andy Halliday

๐Ÿ‘ค Speaker
8321 total appearances

Appearances Over Time

Podcast Appearances

The Daily AI Show
Spotify Engineers Stopped Writing Code

surpassed everybody else in the International Math Olympiad.

The Daily AI Show
Spotify Engineers Stopped Writing Code

So in pure math reasoning.

The Daily AI Show
Spotify Engineers Stopped Writing Code

But it doesn't stop there.

The Daily AI Show
Spotify Engineers Stopped Writing Code

So research problems don't come with clean answers.

The Daily AI Show
Spotify Engineers Stopped Writing Code

So you have to go through proofs and refutations and all these other things.

The Daily AI Show
Spotify Engineers Stopped Writing Code

And that's what this proof process is.

The Daily AI Show
Spotify Engineers Stopped Writing Code

So it's not just coming up with the answer.

The Daily AI Show
Spotify Engineers Stopped Writing Code

It's you've got to demonstrate the logic and the conclusions that you arrived at.

The Daily AI Show
Spotify Engineers Stopped Writing Code

And so Google built a research agent called Aletheia.

The Daily AI Show
Spotify Engineers Stopped Writing Code

and this is the first I've ever heard of it, on top of Google DeepThink.

The Daily AI Show
Spotify Engineers Stopped Writing Code

And it generates proofs, checks them with a natural language verifier, revises weak steps in the process, and restarts if the logic fails.

The Daily AI Show
Spotify Engineers Stopped Writing Code

And it will also come back and say, oh, I can't solve this problem.

The Daily AI Show
Spotify Engineers Stopped Writing Code

But what it's done is it's gotten to 90% on the International Math Olympiad Proof Bench Advanced program.

The Daily AI Show
Spotify Engineers Stopped Writing Code

So it's not just one benchmark that's being mastered.

The Daily AI Show
Spotify Engineers Stopped Writing Code

This model, particularly Google DeepThink, is really kind of hitting all the marks when it comes to advanced scientific mathematics and logic research by AI.

The Daily AI Show
Spotify Engineers Stopped Writing Code

Well,

The Daily AI Show
Spotify Engineers Stopped Writing Code

I think that you're right.

The Daily AI Show
Spotify Engineers Stopped Writing Code

It has to be giving people pause, people who have had the most advanced intelligence as demonstrated by their ability in mathematics particularly.

The Daily AI Show
Spotify Engineers Stopped Writing Code

Because that's โ€“ it's mind-boggling to me how โ€“ I mean, I have trouble with arithmetic in my head.

The Daily AI Show
Spotify Engineers Stopped Writing Code

Yeah.