Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Matt Freeman

๐Ÿ‘ค Speaker
372 total appearances

Appearances Over Time

Podcast Appearances

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

I have the, like, saw the tweets in passing level of awareness of this.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

So, I mean, I agree with you that I didn't love it when I saw...

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

that we're basically taking like a core document that governs their approach to scaling and deployment and making it weaker.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

I mean, I think just facially that that's a bad look, right?

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

Easy, easy answer from me there.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

But as to like, as for like, what are the actual changes between the previous version and this version?

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

Honestly, I'm trying to read this 20-page document while talking to you guys, and it's not going well, so I'm going to stop that.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

I was trying to get an overall sense of what did they actually change concretely, and I'm unable to figure that out on air while talking to you.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

It may be the case that it's not so bad.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

If you look at the fine print, I admit I have a tendency to want to believe the best of people.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

I don't know.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

I mean, do you, do you, can you by any chance summarize like what exactly did they change with the new RSP?

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

Yeah.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

Let's see if, let's see if Claude can summarize.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

anthropic is releasing the third version of its rsp reflecting on two plus years of experience key structural change separates what anthropic can realistically achieve on its own from what it believes the broader industry needs to do collectively acknowledging that higher safety levels may be impossible for one company to implement unilaterally basically what you said steven

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

introduces three new mechanisms, a published safety frontier safety roadmap with publicly graded goals, a periodic risk reports with external expert review, and an industry-wide capabilities to mitigations map, all aimed at increasing transparency and accountability.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

So my interpretation, tell me if you think I'm being overly generous, is they basically said, hey, we're not going to hold to the RSP, but we're going to introduce three new transparency mechanisms that will at least hopefully give some feedback loop where other people can see where things are going and, you know, be mad at us if we're doing the wrong thing, something like that.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

So just to be a little technical here on the RSP, you know,

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

It's fairly specific where basically they test, you know, any given model on purely on capabilities and they kind of see what it can do.

The Bayesian Conspiracy
257 โ€“ Pentagon Comes For Claude

And they're like, you know, how useful is this as a tool for building chemical weapons, for building biological weapons, for doing, you know, high leverage, extremely damaging acts.