Nicholas Andresen

👤 Speaker

498 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

How AI is Learning to Think in Secret by Nicholas Andresen Published on January 6, 2026 On Thinkish, Neuralese, and the End of Readable Reasoning In September 2025, researchers published the internal monologue of OpenAI's GPT-03 as it decided to lie about scientific data.

0.031 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

This is what it thought.

24.446 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

There's an image here.

26.489 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Description

28.372 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Pardon?

38.118 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

This looks like someone had a stroke during a meeting they didn't want to be in, but their hand kept taking notes.

39.66 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

That transcript comes from a recent paper published by researchers at Apollo Research and OpenAI on catching AI system scheming.

45.77 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

To understand what's happening here, and why one of the most sophisticated AI systems in the world is babbling about synergy-customizing illusions, it first helps to know how we ended up being able to read AI thinking in the first place.

53.341 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

That story starts, of all places, on 4chan.

66.569 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

In late 2020, anonymous posters on 4chan started describing a prompting trick that would change the course of AI development.

70.757 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

It was almost embarrassingly simple.

78.372 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Instead of just asking GPT-3 for an answer, ask it instead to show its work before giving its final answer.

80.917 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Suddenly, it started solving math problems that had stumped it moments before.

87.899 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

To see why, try multiplying 8734 by 6892 in your head.

92.966 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

If you're like me, you start fine.

99.274 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

8734 times 2 is 17468.

101.036 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Then you need to hold on to that while computing 8734 by 90, which is, let's see, 9 times.

106.243 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

4 is 36.

114.313 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Carry the 3.

116.656 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Wait, what was that first number?

118.51 View full episode →

← Previous Page 1 of 25 Next →

Report any issue