Rob Wiblin

👤 Speaker

3881 total appearances

Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1

Confidence: Medium

Appearances Over Time

Podcast Appearances

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

That may sound a little odd, but they're probably right about that.

1104.433 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

The thing is that other things become the primary bottleneck.

1106.575 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

To know whether automated AI R&D is on the way or beginning to kick off,

1110.08 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

We're apparently now relying on these general impressions from anthropic stuff, that this thing is powerful, but it doesn't yet seem good enough to replace many of us yet.

1114.675 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

But I think we can apply some common sense to the big picture here.

1125.229 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

Mythos has given us AI advances that we previously thought would take six months in just three months.

1128.653 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

That naturally brings forward the point at which we're going to be able to automate the development of AI models by three months.

1134.02 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

And if it's a sign that AI advances are now going to continue at twice the pace that they were before, then that effectively halves the time that we have to prepare for that point.

1139.667 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

I don't know whether that means 10 years becomes five years or four years becomes two years, but the direction of the effect and the size of the effect is clear enough.

1148.179 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

Before we wrap up, I want to draw your attention to a recurring theme in these reports that really stood out to me.

1155.81 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

This is the first time that an AI company has published 300 pages about a model that it's decided not to release, despite the fact that it might earn them tens of billions of dollars if they did, maybe hundreds of billions of dollars.

1161.318 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

It's also the first time that Anthropic decided to delay giving its own staff access to one of its models.

1172.329 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

With every previous Claude, their practice has just been to let staff use it as soon as it's judge-ready during training.

1177.515 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

But with Mythos, they were worried enough about it being misaligned and causing havoc or sabotage on their own systems.

1182.76 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

that they held it back and ran a 24-hour alignment test before letting any employees use it.

1187.305 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

But according to them, that wasn't enough.

1192.473 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

The retrospective on that found that the 24-hour window did not pressure test the model sufficiently and that the most concerning behaviors only became evident later through much more extended use.

1194.916 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

One of their lead researchers, Sam Bowman, he commented this week that working with this model has been a wild ride.

1204.411 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

We've come a long way on safety, but we still expect the next capability jump of this scale to be a huge challenge.

1208.697 View full episode →

80,000 Hours Podcast

How scary is Claude Mythos? 303 pages in 21 minutes

The system card says directly that their current methods could easily be inadequate to prevent catastrophic misaligned actions in significantly more advanced systems.

1214.626 View full episode →

← Previous Page 64 of 195 Next →

Report any issue

Rob Wiblin

Voice Profile Active

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment