Ruby
👤 SpeakerAppearances Over Time
Podcast Appearances
People are scratching their heads to understand the PR stunt, but it really doesn't add up.
They could announce they're resuming next week and it wouldn't undo the damage they've done.
So why?
The industry and world are hunting for answers.
Anthropic's official statement is measured.
Internal evaluations revealed that our current safety techniques are not yet adequate for models at this capability level.
Sources closer to the company paint a more alarming picture.
A contact speaking on condition of anonymity says concerns spread within the company when their latest clawed model appeared to defy its constitution.
The constitution is a document used to shape Anthropic's AI to be an honest, harmless, and helpful assistant that is ethically grounded.
A recent leak revealed the existence of a new vastly more powerful clawed model called Mythas.
They found substantial evidence that the constitution was adhered to at a surface level, but that the model had its own drive and personality at a deeper level that did not conform to expectations for Claude, and attempts to change this had not worked.
A different source also speaking on condition of anonymity had a different and more disturbing explanation.
The reason for pause wasn't the wrong personality and power, but many of the safety techniques involved using weaker or cheaper AI models to monitor more powerful ones, for example, detecting whether inputs or outputs violate rules, were ineffective on the latest model.
It knew just how to phrase things in ways that disarmed all measures.
We were unable to verify the authenticity of these reports.
Like many, we are left to wonder what did Dario see?
Dario Amodei didn't answer that question but did elaborate on the poor's decision in his latest essay, Technological Maturity.
Though I do not have my own children, several people close to me do and on occasion I get to spend time with them.
What strikes me about children is their energy and vitality.
They are full of life.