Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing
9724 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 5
Confidence: High

Appearances Over Time

Podcast Appearances

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

Here's the best.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

Here's the most possible point.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So they didn't replace Erdos' conjecture with a better conjecture.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

They didn't prove Erdos' conjecture, but they provided a counterexample construction that showed the thing he thought was the right answer couldn't possibly be right.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

Now, how did they do this?

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

Well, they used a reasoning LLM.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So that's an LLM that has been tuned to essentially talk out loud, to sort of think out loud and wander with its thoughts.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

We first saw the first reasoning models back in 2024 with O1 and the O models, a deep sequence reasoning model as well.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

Basically, reasoning models are a way of taking an LLM, which are static and have no memory,

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

and having them approximate something like more dynamic computation with memory, because it can sort of, as it rambles, right?

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

It's looking at everything it said so far when it produces the new token.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So it can, if you're rambling, you're thinking out loud, you can use all of that thinking in producing the new token.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So it's like you have some memory and this wandering can be somewhat dynamic.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

You can get some basic like iterative or looping type thinking in it, right?

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So they use the reasoning model.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

And what they did is, I don't know how many times they prompted it or on what questions they prompted it, but on one of the times they prompted it about this particular problem, the model spit out a very long transcript of an answer.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

And a team of expert mathematicians poured over this answer, and in this long chain of thought transcript, they identified in there...

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

the core idea that became the counterexample.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So these mathematicians then pulled that counterexample idea out of this transcript, they polished it, they wrote it properly, they elaborated it, and put it into a short, concise, much more human-readable paper, and that's what OpenAI actually posted.

Deep Questions with Cal Newport
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check

So the LLM did not post this sort of elegant