Ryan Kidd

The mainline kind of meta strategy that the AI safety community seems to be pursuing on the whole, we're talking in terms of funding, in terms of sheer numbers of people and resources deployed, not necessarily in terms of less wrong posts written or something, right?

420.221 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

But in terms of resources deployed is this AI control strategy, which is where basically you build, perhaps it's better called alignment MVP.

434.714 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

which is a term coined by Jan Leakey, former head of super alignment at OpenAI, now co-lead of alignment science at Anthropic.

444.383 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

And alignment MVP is an AI system that is a minimum viable product for accelerating the pace of alignment research differentially over capabilities research such that

450.173 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

we get the right outcome so basically you're getting ai's to do your homework and there's been a lot of debate on this uh there's a very strong camp in in the direction of like this just never will work because as soon as an ai system is strong enough to be useful it's dangerous right i think you know quad code shows this is not the case for at least for software engineering but perhaps for people who think that aligning ai systems requires like serious research taste

461.512 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

they would probably say that this quality code is nowhere near there, right?

487.28 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Where generally AI systems are nowhere near that level of research taste ability.

490.958 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Now,

495.179 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

All of the things that you're mentioning that pay off only in 2063 scenarios, presumably they only pay off in that many over that time period.

496.392 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Not necessarily because of like, I don't know, human challenge trials or something.

503.682 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Maybe that, maybe that makes a difference if you're interested in like, I don't know, making humans more intelligent with genetic engineering or some of the crazy things that are being tossed around.

506.526 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

But if you're mainly interested in like, oh, this thing is going to take decades of technical work.

515.078 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Maybe you can compress those decades into a really short period.

519.544 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment