Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Daniel Kokotajlo

👤 Person
608 total appearances

Appearances Over Time

Podcast Appearances

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And there is this – with lying, there is this thing where it's just really hard to keep an inconsistent false world model alive.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

working with the people around you, and that's why psychopaths often get caught.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And so if you have all these AIs that are deployed to the economy and they're all working towards this big conspiracy, I feel like one of them who's siloed or loses internet access and has to confabulate a story will just get caught, and then you're like, wait, what the fuck?

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And then, you know, you catch it before it's, like, taken over the world.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So it is the case that certain things that people would have considered egregious misalignment in the past are happening.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

But also certain things which people who are especially worried about misalignment said would be impossible to solve have just been solved in the normal course of getting more capabilities.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Like Eliezer had that thing about can you even specify what you want the AI to do without the AI totally misunderstanding you and then just converting the universe to paperclothes.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And now just by the nature of

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

GPT-4 having to understand natural language.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

It totally has a common sense understanding of what you're trying to make it do, right?

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So I think this sort of like trend cuts both ways, basically.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

It seems like in the whole scenario, a big part of why certain things happen is because of this race with China.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

And if you read the scenarios, basically the difference between the one where things go well and the one where things don't go well is whether we decide to slow down despite that risk.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

I guess the question I really want to know the answer to is like, one, it just seems like you're saying, well, it's a mistake to try to race against China or to race intensely against China.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

it leads to nationalization and it leads to us not prioritizing alignment.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Maybe I should have asked you that at the beginning of the conversation.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Let's talk about geopolitics next.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

So describe to me how you foresee the relationship between the government and the AI labs to proceed, how you expect that relationship in China to proceed, and how you expect the relationship between the US and China to proceed.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Three simple questions.

Dwarkesh Podcast
2027 Intelligence Explosion: Month-by-Month Model — Scott Alexander & Daniel Kokotajlo

Yes, no, yes, no, yes, no.