Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Gavin Baker

πŸ‘€ Speaker
2955 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 3
Confidence: High

Appearances Over Time

Podcast Appearances

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

to identify a small number of previously known minor vulnerabilities.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

These vulnerabilities all appear relatively simple.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

And we have found that other publicly available models are able to discover them as well without requiring a bypass.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

So they're saying this isn't that big of a jailbreak.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

This isn't that big of a deal.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

We can move forward safely.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

We've taken safety seriously.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

But that is the debate that's going on between the admin and anthropic.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

So as everyone knows, the rollout of Fable 5 has been a bit rocky.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

Incredible demos, incredible benchmarks, but offset by the odd decision to silently degrade the quality of the answers related to frontier AI development instead of just refusing the request like cyber and bio prompts, which that was the main thing that everyone was really confused about.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

The actual rationale behind AI development refusal is pretty sound.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

People are upset about that if they're working on machine learning systems, recommender systems, anything that requires instantiating an AI product, let alone if you're just, hey, I'm just doing open source AI research and I'd love to use the latest and greatest models to help me.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

Now it's not an option.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

But at the same time, just think about the logic.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

If a model doesn't let you hack a system,

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

just have the model build you an unrestricted model that does let you hack that system.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

And so clearly in that scenario, it makes sense to restrict, if you want to restrict hacking or bio, you also have to restrict the tool that makes the tool that hacks the system or develops the bioweapon in theory.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

And so, but the choice to silently degrade responses was not well received.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

These are important issues.

TBPN
Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

And there's another timeline where AI leaders are cut from the same cloth as America's elected officials.