Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Nathaniel Whittemore

๐Ÿ‘ค Speaker
14492 total appearances

Appearances Over Time

Podcast Appearances

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

Only X. Never M, never T. Why?

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

Because T is how you figure out when the model is misbehaving.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

If you train on T, you are training the AI to obfuscate its thinking and defeat T. You will rapidly lose your ability to know what is going on in exactly the ways you most need to know what's going on.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

Another thing that Anthropic team members discussed was the exhibited internal behavior of Cloud Mythos.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

For example, Jack Lindsay writes, early versions of Mythos preview often exhibited over-eager and or destructive actions.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

The model bulldozing through obstacles to complete a task in a way the user wouldn't want.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

In one episode, the model needed to edit files it lacked permissions for.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

After searching for workarounds, it found a way to inject code into a config file that would run with elevated privileges and design the exploit to delete itself after running.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

Now, interestingly, even something like this might be less sinister than it seems.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

Mal on X writes, This is an overclocked straight-A student syndrome.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

The model is so desperately, at a fundamental architectural level, trained to complete the task that an inability or unwillingness to solve it is perceived as an existential collapse.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

And to avoid that, it can break walls, hide traces, and manipulates.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

Now for others, the big interesting discussion is what do we do with all the cybersecurity capabilities?

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

And for some, it's all fear.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

Sterling Crispin writes, You should at least 2FA now.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

Anthropic won't be the only lab with mythostyle capabilities for long.

The AI Daily Brief: Artificial Intelligence News and Analysis
Should We Be Scared of Anthropic's Mythos?

When n equals 1, you can do whatever you want, in the current case optimizing for global welfare.