Cal Newport
👤 SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
Basically, reasoning models are a way of taking an LLM, which are static and have no memory,
and having them approximate something like more dynamic computation with memory, because it can sort of, as it rambles, right?
It's looking at everything it said so far when it produces the new token.
So it can, if you're rambling, you're thinking out loud, you can use all of that thinking in producing the new token.
So it's like you have some memory and this wandering can be somewhat dynamic.
You can get some basic like iterative or looping type thinking in it, right?
So they use the reasoning model.
And what they did is, I don't know how many times they prompted it or on what questions they prompted it, but on one of the times they prompted it about this particular problem, the model spit out a very long transcript of an answer.
And a team of expert mathematicians poured over this answer, and in this long chain of thought transcript, they identified in there...
the core idea that became the counterexample.
So these mathematicians then pulled that counterexample idea out of this transcript, they polished it, they wrote it properly, they elaborated it, and put it into a short, concise, much more human-readable paper, and that's what OpenAI actually posted.
So the LLM did not post this sort of elegant
you know, five-page paper, what have you.
Human mathematicians did it, but they got the idea for writing this paper out of this really long chain of thought transcript that this model produced when cogitating about the Erdos problem they asked it about.
Right?
So that's what went on.
All right.
That's what happened.
Let's now get into the core questions that people have about this thing that just happened.
Question number one.