Ryan Kidd

👤 Speaker

958 total appearances

Appearances Over Time

Podcast Appearances

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Yeah, I think the first Matz cohorts were a little bit more directionless than the later cohorts.

1885.728 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Definitely, I think safety research really kicked into gear after we had ChatGBT.

1892.821 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Not to say that was the only cause, but there were a lot of things happening around that time.

1898.831 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

And I think that...

1902.458 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Definitely larger, more capable models have enabled certain types of essential safety research you could not do with smaller models.

1903.7 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

We're talking like interpretability on models that actually have coherent concepts embedded in them.

1911.053 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Though we'll say there's probably plenty of work to still be done on GPT-2 small.

1916.623 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

But linear probes and whatnot at a high level can target some of our frontier models.

1920.49 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

You know, QAN, these Chinese models are particularly good for that.

1924.955 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Certain types of debate, like we had the first interesting empirical debate paper only after models were good enough to debate.

1928.979 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

And there's many, many other such examples.

1935.505 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Like all the control literature I think just could not have happened as well.

1937.087 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Sorry if that's too much.

1941.091 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

I mean, yeah, for plenty of interpretability research, people aren't, people aren't using the frontier models.

2047.067 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

You don't have access to them.

2051.135 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

I mean, sure, people in the labs are, but at Mass, there's tons of really excellent papers that keep getting produced, and from many other sources, right?

2053.702 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

Eleuther AI, Far AI, et cetera.

2061.46 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

They're doing world-class interpretability research on sub-frontier models.

2064.006 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

because today's sub-frontier model, today's Quinn or DeepSeq or Llama or whatever, it's like yesterday's frontier model in terms of capabilities.

2069.817 View full episode →

Future of Life Institute Podcast

Can AI Do Our Alignment Homework? (with Ryan Kidd)

We're at that point where these models are all above the waterline for doing really excellent research.

2081.441 View full episode →

← Previous Page 14 of 48 Next →

Report any issue