Daniel Kokotajlo

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Classical liberalism just has been a helpful way to navigate the world when we're under this kind of epistemic hell of one thing changing just – you know, people who have – yeah.

7357.649 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Anyways, maybe one of you can actually flesh out that thought.

7370.015 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Better react to it if you disagree.

7372.601 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Here, here.

7374.305 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I agree.

7374.726 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So, so far, these systems, as they become smarter, seem to be more reliable agents who are more likely to do the thing I expect them to do.

7401.187 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Why does, like, I think in your scenario, at least one of the stories, you have two different stories, one with a slowdown, where we more aggressively, I'll let you characterize it.

7411.179 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

But in one half of the scenario, why does the story end in humanity getting disempowered and the thing just having its own crazy values and taking over?

7419.59 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Yeah, so...

7775.973 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

It seems like this community is very interested in solving this problem at a technical level of making sure AIs don't lie to us, or maybe they lie to us in the scenarios exactly where we would want them to lie to us or something.

7778.731 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Whereas, you know, as you were saying, humans have these exact same problems.

7793.885 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

They reward hack.

7798.51 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

They are unreliable.

7799.971 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

They obviously do cheat and lie.

7801.793 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And the way we've solved it with humans is just checks and balances, decentralization.

7803.555 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

You could, like, lie to your boss and keep lying to your boss, but over time it's just not going to work out with you or you become president or something.

7811.584 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Yeah, exactly.

7819.112 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

One or the other.

7819.552 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So if you believe in this extremely fast takeoff if a lab is one month ahead, then that's the endgame and this thing takes over.

7821.736 View full episode →

Dwarkesh Podcast

2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

But even then, I know I'm combining so many different topics.

7828.649 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment