Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Ryan Kidd

πŸ‘€ Speaker
958 total appearances

Appearances Over Time

Podcast Appearances

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

I'd say Marius Halpan as well to some extent with his deception evals work.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

Yeah, like, and probably dozens of people.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

I'm just like sharing some of the more

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

the names that come more easily to mind, but just many, many people have come through maths with this.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

We're super open to individuals who have this kind of archetype.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

And note, a connector, right?

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

They have empirical skills, they have theoretical skills.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

So they could probably succeed in a bunch of different ways, right?

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

But they're uniquely spec'd out to connect those two things.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

Now, there are some mentors and projects that are much more suited to this kind of thing than others.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

People like Richard Ngo, historically Evan Hubinger.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

I think actually Evan Hubinger has been like probably the most dominant connector driving force at maths over our time.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

But he's not a mentor in the next program, unfortunately.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

He doesn't have time.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

But yeah.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

There's many different opportunities at Mass for this kind of thing.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

I think even in some of the interp streams as well, it's very possible to enter an interpretability stream and bring it with it like some model of the kind of theory-based interpretability mechanism or strategy that you want to pursue and then see that executed on.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

That's happened several times.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

I have many takes here.

Future of Life Institute Podcast
Can AI Do Our Alignment Homework? (with Ryan Kidd)

So obviously I advocate a portfolio and that's, that's has historically sponsored a bunch of projects.