Holden Karnofsky

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Responsible Scaling Policy v3 by Holden Karnofsky Published on February 24, 2026

0.031 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

All views are my own, not Anthropic's.

11.017 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

After that is an FAQ section.

329.332 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Heading How it's going The good and the bad

686.095 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

My most recent ATK interview elaborates on this viewpoint.

891.777 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Subheading.

896.212 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Goal 1.

897.675 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Forcing functions for improved risk mitigations.

899.058 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Subheading.

903.025 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

A partial success story.

904.448 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

But I can't be confident that it will work out.

2156.521 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Time will tell.

2160.047 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Subheading.

2162.091 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

How much pressure there is depends on the general level of concern for AI risks.

2255.071 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Heading A revised, but not overturned, vision for RSPs.

2260.218 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

But I think they can make us safer.

2345.552 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Heading.

2348.377 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Q&A.

2349.639 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

Subheading.

2350.821 View full episode →

LessWrong (Curated & Popular)

"Responsible Scaling Policy v3" by HoldenKarnofsky

On the move away from implied unilateral commitments.

2352.243 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment