David Duvenaud

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

But I think that's like some of the most basic groundwork that needs to be done at this point is like clarify what we're even talking about.

6493.635 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Yeah.

6525.239 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So actually, I had the exact same thought.

6526 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

And that's why that leads me to one of the projects that I'm working on, like the actual technical projects that I'm working on, which is me and a few people, including Alec Radford, who's like one of the creators of GPT, who's now sort of like unemployed and just doing fun research projects, is trying to train a historical LLM, like a LLM that's only trained up on data up to like, let's say, 1930 and then like maybe 40, 1950.

6527.702 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

And the idea being that

6549.369 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

As you said, it's hard to operationalize these questions like, I don't know, what fraction of humans are employed?

6551.412 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

It might not really matter or be the right question to ask.

6557.36 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

What we'd rather ask is something more like, what is the future newspaper headline?

6559.703 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Or given a leader, what's their Wikipedia page or something like that?

6562.928 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

It's more like freeform sort of things.

6566.813 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

And the cool thing is that

6569.176 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

LLMs, you can query them to predict this sort of thing, right?

6571.139 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Like, write me a newspaper headline from 2030 or whatever.

6575.464 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

I mean, they're not going to do a good job unless they have a lot of scaffolding and specific training.

6577.647 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

But we can validate that kind of scaffolding on historical data using these historical LLMs.

6581.552 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So the idea is you train a model only on data up to 1930, then you ask it to predict the likelihood that it would give to a headline in 1940 or some other free-form text.

6587.279 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

And you can evaluate their likelihoods on this text

6597.05 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

in the past and then you can also use the same scaffolding on a model train up to 2025 and then ask it to predict like headlines in 2035 and get a rough idea of like or you can iterate on your scaffolding by seeing how well it does on like past data

6599.253 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So that's been the huge slap so far is like constantly finding different sources of unintentional data poisoning and like mislabeled data and things like that.

6640.276 View full episode →

80,000 Hours Podcast

Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So, I mean, their elements can help you because there's sort of like a chicken and egg.

6646.562 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment