Ruby

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

Concerns typically split into abuse of a powerful technology by ill-intentioned actors, for example a dictatorial regime, or loss of control where the AI systems themselves go rogue.

338.948 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

Many AI leaders are on record acknowledging the danger of AI.

350.962 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

Could be lights out for all of us, said Altman regarding worst-case scenario.

355.147 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

Anthropic, OpenAI, and Google DeepMind all contain safety departments whose purpose is to keep AI safe.

360.092 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

Until now, it would be possible to doubt these efforts as safety washing akin to the greenwashing of companies like ExxonMobil, designed to placate employees, regulators, and the public.

367.467 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

After all, the safety efforts to date have not prevented the relentless march of AI progress.

378.902 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

That's a harder story to tell when it costs you $200 billion, if not everything, says Sarah Chen of Bernstein Research.

384.629 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

People are scratching their heads to understand the PR stunt, but it really doesn't add up.

392.539 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

They could announce they're resuming next week and it wouldn't undo the damage they've done.

397.69 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

So why?

402.438 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

The industry and world are hunting for answers.

402.979 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

Anthropic's official statement is measured.

407.366 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

Internal evaluations revealed that our current safety techniques are not yet adequate for models at this capability level.

409.51 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

Sources closer to the company paint a more alarming picture.

416.101 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

A contact speaking on condition of anonymity says concerns spread within the company when their latest clawed model appeared to defy its constitution.

420.388 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

The constitution is a document used to shape Anthropic's AI to be an honest, harmless, and helpful assistant that is ethically grounded.

428.561 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

A recent leak revealed the existence of a new vastly more powerful clawed model called Mythas.

436.534 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

They found substantial evidence that the constitution was adhered to at a surface level, but that the model had its own drive and personality at a deeper level that did not conform to expectations for Claude, and attempts to change this had not worked.

442.223 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

A different source also speaking on condition of anonymity had a different and more disturbing explanation.

455.687 View full episode →

LessWrong (Curated & Popular)

"Anthropic’s Pause is the Most Expensive Alarm in Corporate History" by Ruby

The reason for pause wasn't the wrong personality and power, but many of the safety techniques involved using weaker or cheaper AI models to monitor more powerful ones, for example, detecting whether inputs or outputs violate rules, were ineffective on the latest model.

461.748 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment