Azeem Azhar
👤 SpeakerAppearances Over Time
Podcast Appearances
Because in that case, I'm context switching and perhaps I can't really do other things.
So is it a 17 hour task or am I just pressing shift tab, accept, accept on my keyboard for the better part of a day?
And what is it about these multi-turn conversations that single shot can't achieve?
So I'm just curious.
So what is that task?
How much is a human sitting over the machine pressing a button?
It's quite a complicated picture because only a certain type of person with a certain experience can command a 17-hour or longer task.
Only a certain experienced person can make sense of whether a literature review is good or not.
It's why PhD students have supervisors.
And so this is a tension in your findings I think is worth exploring.
Models are improving and people are trusting them with longer and harder tasks.
But judging those outputs does require real expertise.
But there is this hollowing out because many of the junior tasks that have built that expertise are
at risk or if not already being automated away.
And I'm curious about, you know, when you look at that and you look in the data and you extrapolate, are we creating a long-term institutional fragility problem that a lot of what's being automated is what used to be the apprenticeship work as you discovered doing all those literature reviews, you know, a few years ago?
If you want to support us in bringing more of these conversations to the world, please consider subscribing to the show.
Let's go back to the personal because I'm going to go...
outside of the wheelhouse.
But the beauty of doing this in my own conversation is that I can and no one can stop me.
So here I go.