Gavin Baker

👤 Speaker

2955 total appearances

Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 3

Confidence: High

Appearances Over Time

Podcast Appearances

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

to identify a small number of previously known minor vulnerabilities.

457.546 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

These vulnerabilities all appear relatively simple.

461.372 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

And we have found that other publicly available models are able to discover them as well without requiring a bypass.

464.157 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

So they're saying this isn't that big of a jailbreak.

471.348 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

This isn't that big of a deal.

474.133 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

We can move forward safely.

475.174 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

We've taken safety seriously.

476.537 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

But that is the debate that's going on between the admin and anthropic.

478.46 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

So as everyone knows, the rollout of Fable 5 has been a bit rocky.

481.745 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

Incredible demos, incredible benchmarks, but offset by the odd decision to silently degrade the quality of the answers related to frontier AI development instead of just refusing the request like cyber and bio prompts, which that was the main thing that everyone was really confused about.

486.395 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

The actual rationale behind AI development refusal is pretty sound.

502.147 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

People are upset about that if they're working on machine learning systems, recommender systems, anything that requires instantiating an AI product, let alone if you're just, hey, I'm just doing open source AI research and I'd love to use the latest and greatest models to help me.

506.734 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

Now it's not an option.

523.741 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

But at the same time, just think about the logic.

525.103 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

If a model doesn't let you hack a system,

526.926 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

just have the model build you an unrestricted model that does let you hack that system.

529.45 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

And so clearly in that scenario, it makes sense to restrict, if you want to restrict hacking or bio, you also have to restrict the tool that makes the tool that hacks the system or develops the bioweapon in theory.

533.82 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

And so, but the choice to silently degrade responses was not well received.

545.566 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

These are important issues.

550.993 View full episode →

TBPN

Anthropic Drama, Meta Now Tokenminning, Fox's $22B Roku Deal | Diet TBPN

And there's another timeline where AI leaders are cut from the same cloth as America's elected officials.