Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing
Podcast Image

80,000 Hours Podcast

"AI doesn't work" – the story behind the stat that misled millions

28 Apr 2026

Transcription

Chapter 1: What misleading statistic about AI pilots contributed to market sell-offs?

0.031 - 8.582 Rob Wiblin

If you were following technology news in August last year, you almost certainly heard about this MIT study showing that 95% of generative AI pilots at companies were failing.

0

Chapter 2: What does the MIT report actually reveal about AI pilot success rates?

9.343 - 16.331 Rob Wiblin

This result was big enough to contribute to a NASDAQ sell-off. People worked hard to come up with sophisticated explanations for how something this crazy could be true.

0

Chapter 3: What defines success in AI projects according to the study?

16.892 - 29.768 Rob Wiblin

And it was repeated by Forbes, Axios, The Hill, the Harvard Business Review, and dozens of others, becoming a staple of elite opinion and one of the most enduring and widely cited statistics in the AI is overhyped backlash.

0

Chapter 4: How does the sample size affect the credibility of the AI study?

29.748 - 30.209 Rob Wiblin

The problem?

0

Chapter 5: Why was the MIT report not available when it went viral?

30.79 - 33.254 Rob Wiblin

The study behind these headlines is incredibly weak.

0

Chapter 6: What conflicts of interest are present in the MIT AI study?

33.795 - 51.283 Rob Wiblin

Worse than you could imagine. And that headline is also a completely incorrect description of what it found, even taking the study entirely on its own terms. The story behind this study will demonstrate that whenever you see a juicy headline, even one with an attractive conclusion, and even one rewarding to come from MIT, it might just be complete nonsense.

0

Chapter 7: What is the real lesson learned from the AI pilot statistics?

53.052 - 60.764 Rob Wiblin

The most important thing to know is that this report did not show that 95% of generative AI pilots at companies are failing, as almost all journalists claimed.

0

61.585 - 82.7 Rob Wiblin

Rather, the report found that of all the organisations surveyed, 60% had investigated custom enterprise AI tools, 20% had gotten to the point of actually doing some pilot project with them, and 5% of the total had gone on to successfully deploy those tools in production. 80% of companies simply never piloted any custom task-specific generative AI.

0

83.32 - 92.429 Rob Wiblin

Saying that 95% of them were failing is like saying 95% of Tinder users have failing marriages when 80% of the people you're talking about have never even gone on a date in the first place.

0

93.35 - 105.823 Rob Wiblin

Moreover, according to their own survey, the primary reason why pilots didn't progress to deployment wasn't that they were going badly, but just the very familiar and generic organizational unwillingness to adopt new tools.

0

106.36 - 126.041 Rob Wiblin

Now, the media is definitely at fault for putting a completely wrong number in their headlines, but the report also makes this mistake about its own graph, referring to a 95% failure rate for enterprise AI solutions. Wrong. These results actually show that among the 20% who did in fact pilot a custom AI tool, about 25% or a quarter were successful.

126.962 - 141.081 Unknown

It's not really a low strike rate for a pilot project, and it's a success rate five times higher than what everyone was told about. And in reality, a 25% success rate is actually very impressive. Once you appreciate the bar, a project had to clear to count as a success in this study.

141.843 - 157.846 Rob Wiblin

To qualify as a success, an AI application had to show a marked and sustained productivity or profit and loss impact within six months. They don't define marked or sustained, but a marked and sustained improvement in profitability or productivity within six months is obviously a high bar for any new project to clear.

158.408 - 174.669 Rob Wiblin

It's widely understood that enterprise tech deployments often take years to show bottom line impacts, even if they're going quite well. And notice that by this standard, an AI project that merely breaks even, that's a failure. A project that has benefited the company in some way that hasn't yet markedly affected productivity or profits? Failure.

175.412 - 186.876 Rob Wiblin

And a project that's on track to be profitable next year but isn't yet? Equally, a failure. Do you get the sense that maybe these authors would prefer to find that these projects aren't working? Well, we'll come back to that.

Comments

There are no comments yet.

Please log in to write the first comment.