Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing
9976 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 7
Confidence: High

Appearances Over Time

Podcast Appearances

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

Lots of different, like a vast knowledge of many different existing results and techniques that can be trained on it.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

Many more than most mathematicians can keep in their head.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So it's willing to like systematically explore answers, mix and match different approaches and see what works.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

You can also, a lot of these systems will use a formal proof verifier so it can try a bunch of stuff and see what works.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So it's been really big for mathematics.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So this result, what Thomas Bloom is saying is this result is not some brand new capability that we didn't know that AI had.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

He's saying it's in the trajectory of those existing type of results we've been doing for the last years.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

It falls in that sweet spot where AI-enabled math tools really work well.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

Now, I want to be fair.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

There's two things about OpenAI's result that do separate it from this sort of existing recent explosion in AI, augmented computers-aided math work.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

Number one, this was done, at least they claim, just purely with an LLM prompt, right?

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

The tools that mathematicians are using tend to be using modular architectures with many different types of models hooked together.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

You'll have an LLM

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

You'll have a formal proof verifier, usually using a formal verification language like Lean, which LLMs can speak very well.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

You'll have some sort of complicated control logic.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

You'll have specialized training of the LLMs on very specific types of math techniques that are relevant to the fields you're looking at.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

This was not that.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

There was no elaborate scaffolding.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

It was actually just a prompt to a reasoning machine that just talked for like 150 pages, and in there they found an answer.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So that is different about this result.