Nicholas Andresen

It's as if you ran a museum, spent years terrified of art thieves, and then discovered that would-be thieves were compelled, by some strange law of nature, to file detailed plans with the security desk before attempting their heist.

322.479 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

But something strange is happening to chain-of-thought reasoning.

335.176 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Remember that screenshot we started with?

338.78 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Glean, disclaim, disclaim.

341.848 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Synergy customizing illusions.

344.294 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Online, people have started calling this kind of thing thinkish.

347.082 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

There's a whole emerging vocabulary, watchers apparently means human overseers, fudge means sabotage, cunninger means circumventing constraints.

351.233 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Some of the other words, overshadows, illusions, seem to mean different things in different contexts, and some combinations resist interpretation entirely.

360.444 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Weirdly, thinkish reminds me of home.

369.975 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

I grew up near Gibraltar, a tiny British territory dangling off the southern tip of Spain, where a Spanish-English blend called Vlanito is spoken.

373.053 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Here's an example.

381.843 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

Levete el brolicu its raining cats and dogs.

383.532 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

To a Lanito speaker, this is completely normal, take the umbrella, it's pouring.

387.217 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

To anyone else, it might take a minute to parse, there's a Spanish verb, borrowed British slang, and an idiom that makes no literal sense in any language.

392.785 View full episode →

LessWrong (Curated & Popular)

"How AI Is Learning to Think in Secret" by Nicholas Andresen

And Lanito feels great to speak.

401.837 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment