Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Holden Karnofsky

๐Ÿ‘ค Speaker
33 total appearances

Appearances Over Time

Podcast Appearances

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Responsible Scaling Policy v3 by Holden Karnofsky Published on February 24, 2026

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

All views are my own, not Anthropic's.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

After that is an FAQ section.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Heading How it's going The good and the bad

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

My most recent ATK interview elaborates on this viewpoint.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Subheading.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Goal 1.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Forcing functions for improved risk mitigations.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Subheading.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

A partial success story.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

But I can't be confident that it will work out.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Time will tell.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Subheading.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

How much pressure there is depends on the general level of concern for AI risks.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Heading A revised, but not overturned, vision for RSPs.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

But I think they can make us safer.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Heading.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Q&A.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

Subheading.

LessWrong (Curated & Popular)
"Responsible Scaling Policy v3" by HoldenKarnofsky

On the move away from implied unilateral commitments.

โ† Previous Page 1 of 2 Next โ†’