Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Jeremiah

๐Ÿ‘ค Speaker
1129 total appearances

Appearances Over Time

Podcast Appearances

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

When a monk decides to swear an oath of celibacy and never reproduce, he does so using a brain that was optimized to promote reproduction, just using it very far out of distribution, in an area where it no longer functions as intended.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

One level lower down, your brain was shaped by next-sense-dartum prediction.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

Partly you learned how to do addition because only the mechanism of addition correctly predicted the next word out of your teacher's mouth when she said 3 plus 3 is.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

It's more complicated than this, sorry, but this oversimplification is basically true.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

You don't feel like you're predicting anything when you're doing a math problem.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

You're just doing good, normal, mathematical steps, like reciting PEMDAS to yourself and carrying the one.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

In the same way, even though an AI was shaped by next token prediction, the inside of its thoughts doesn't look like next token prediction.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

In the abstract, it probably looks like a world model, the same as yours.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

In the concrete...

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

The science of figuring out what an AI's innards are concretely doing is called mechanistic interpretability.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

It's very hard to do.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

AI innards are notoriously confusing, and one team at Anthropic produces most of the headline results.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

Recently, they explored how Claude predicts where a line break will be in a page of text.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

Since line break is a token, this is literally a next token prediction task.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

Here's a diagram.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

It's captioned, Key steps in the line-breaking behavior can be described in terms of the construction and manipulation of manifolds.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

So there's a series of sub-diagrams in here.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

The first is captioned, LLMs perceive visual properties of text despite only seeing a list of numbers.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

So it shows line-wrapped text with various words and a line break, and then it says what the model sees.

Astral Codex Ten Podcast
Next-Token Predictor Is An AI's Job, Not Its Species

It's just a long list of numbers.