Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

David Duvenaud

๐Ÿ‘ค Speaker
1059 total appearances

Appearances Over Time

Podcast Appearances

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Like once you have an element that has a rough idea of like what sort of thing happened in what time, then when I see some like reference to like genetic engineering and like some like 1930s data, it's like, oh, that no one used that phrase at this point.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

And then you can use that to like help clean the data more.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

But it's like I think this is like an Achilles heel of this approach.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Yeah, it's also.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

actually another technical problem of data poisoning just through the questions you ask.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So if you are just doing metaculous style, like is there going to be a war between India and Pakistan this year?

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

It's actually hard because when you tune your scaffolding to go back, most of the questions you ask about, you're asking because something happened, right?

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So it's like, imagine a future person comes back and asks me if I'm worried about, I don't know, Lithuania invading Canada.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

I'd be like, well, I wasn't until you asked me, right?

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Yeah, so it's easy to sort of like unintentionally poison your, or rather incentivize your model to be the opposite of the nothing ever happens guy, to just be like, yes, whatever you're asking, like there was a 1% chance it happened.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

How do you avoid that?

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Well, so then, I mean, you try to, I guess I'll say that's one nice thing about the open-ended just generate text approach, because then you have to normalize over all possible newspaper headlines.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So that actually already guards against this sort of validation poisoning problem.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

But then that has its own problem because the likelihood is very sensitive to styles.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Maybe there's a new nickname for the president in the future, and if one model guesses it or thinks it's plausible, another one doesn't, and that ends up dominating the likelihood.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

So there's a bunch of interesting technical problems here, and I am a technical person, and that's like...

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

actually might be greatest fear is that I just end up nerd sniping myself and spending time on like fun technical problems instead of the problems that matter.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Exactly.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

Oh, absolutely.

80,000 Hours Podcast
Why 'Aligned AI' Could Still Kill Democracy | David Duvenaud, ex-Anthropic team lead

I mean, I think everyone agrees that sort of going forward, history is just happening faster.