Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

David Duvenaud

๐Ÿ‘ค Speaker
1059 total appearances

Appearances Over Time

Podcast Appearances

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

But I think that's like some of the most basic groundwork that needs to be done at this point is like clarify what we're even talking about.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Yeah.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So actually, I had the exact same thought.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

And that's why that leads me to one of the projects that I'm working on, like the actual technical projects that I'm working on, which is me and a few people, including Alec Radford, who's like one of the creators of GPT, who's now sort of like unemployed and just doing fun research projects, is trying to train a historical LLM, like a LLM that's only trained up on data up to like, let's say, 1930 and then like maybe 40, 1950.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

And the idea being that

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

As you said, it's hard to operationalize these questions like, I don't know, what fraction of humans are employed?

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

It might not really matter or be the right question to ask.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

What we'd rather ask is something more like, what is the future newspaper headline?

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Or given a leader, what's their Wikipedia page or something like that?

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

It's more like freeform sort of things.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

And the cool thing is that

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

LLMs, you can query them to predict this sort of thing, right?

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Like, write me a newspaper headline from 2030 or whatever.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

I mean, they're not going to do a good job unless they have a lot of scaffolding and specific training.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

But we can validate that kind of scaffolding on historical data using these historical LLMs.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So the idea is you train a model only on data up to 1930, then you ask it to predict the likelihood that it would give to a headline in 1940 or some other free-form text.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

And you can evaluate their likelihoods on this text

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

in the past and then you can also use the same scaffolding on a model train up to 2025 and then ask it to predict like headlines in 2035 and get a rough idea of like or you can iterate on your scaffolding by seeing how well it does on like past data

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So that's been the huge slap so far is like constantly finding different sources of unintentional data poisoning and like mislabeled data and things like that.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So, I mean, their elements can help you because there's sort of like a chicken and egg.