Jeremiah
๐ค SpeakerAppearances Over Time
Podcast Appearances
If you like it, you can subscribe at astralcodex10.substack.com.
Welcome to the Astral Codex X podcast for the 26th of February, 2026.
Title, Next Token Predictor is an AI's job, not its species.
This is an audio version of Astral Codex X, Scott Alexander's Substack.
If you like it, you can subscribe at astralcodex10.substack.com.
1.
In The Argument, link in post, Kelsey Piper gives a good description of the ways that AIs are more than just next token predictors or stochastic parrots.
For example, they also use fine-tuning and RLHF.
But commenters, while appreciating the subtleties she introduces, object that they're still just extra layers on top of a machine that basically runs on next-token prediction.
Here's a comment.
Quote, No, it's just next token prediction on a biased set.
If you fine-tune on a set of recipes, it will be more likely to predict recipes.
If you fine-tune on answers that have been selected by humans to be typical of a helpful assistant, it will be more likely to predict text characteristics of a helpful assistant.
Next token prediction is a structural property, the structural property, of what these models are.
It can't be changed by fine-tuning.
Scott writes, I want to approach this from a different direction.
I think overemphasizing next token prediction is a confusion of levels.
On the levels where AI is a next token predictor, you are also a next token, technically next sense datum predictor.
On the levels where you're not a next token predictor, AI isn't one either.
Putting all the levels in graphic form, this is a chart that shows two sequences of steps, one for human and one for LLM, with the comparable levels next to each other.