80,000 Hours Podcast

"AI doesn't work" – the story behind the stat that misled millions

28 Apr 2026

10 min

1951 words

2 speakers

28 Apr 2026

Audio

Description

You might have heard that 95% of corporate AI pilots are failing. It was a widely cited AI statistic in 2025, repeated by media outlets and commentators everywhere. It helped trigger a Nasdaq selloff and became a pillar of the "AI is overhyped" case. The problem: 95% fail is 100% wrong. The real finding, once you read the underlying MIT report carefully, points in roughly the opposite direction:80% of surveyed companies had never piloted a custom AI tool at all.Among the companies that deployed pilots, a quarter reported success — according to an extremely high bar set by the researchers — within six months. Over 90% of staff at all surveyed companies were using tools like ChatGPT regularly for their work.None of that made the headlines. Nor did the fact that the study’s authors are all developing or selling the "agentic AI framework" technology the report recommends as the solution to this supposed epidemic of failing AI. Host Rob Wiblin breaks down how an opaque, conflicted, barely scrutinised report carrying the MIT label managed to move markets and shape global opinions on AI’s real-world utility.Learn more, video, and full transcript: https://80k.info/mit-ai-study This episode was recorded on February 13, 2026.Chapters:The AI myth that moved global markets (00:00)The math was totally wrong (00:52)The bar for success was insanely high (01:46)The study ignores its own best finding (03:28)The sample was tiny (04:49)The report wasn’t even available when it went viral (05:54)The hidden conflicts of interest (06:58)The real lesson (09:28)Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon MonsourCamera operator: Dominic ArmstrongProduction: Nick Stockton, Elizabeth Cox, and Katy Moore

Chapters

1. What misleading statistic about AI pilots contributed to market sell-offs? 2. What does the MIT report actually reveal about AI pilot success rates? 3. What defines success in AI projects according to the study? 4. How does the sample size affect the credibility of the AI study? 5. Why was the MIT report not available when it went viral? 6. What conflicts of interest are present in the MIT AI study? 7. What is the real lesson learned from the AI pilot statistics?

Featured

Rob Wiblin

Unknown

Topics

NASDAQ MIT Harvard Business Review Nanda

Transcription

Chapter 1: What misleading statistic about AI pilots contributed to market sell-offs?

0.031 - 8.582 Rob Wiblin

If you were following technology news in August last year, you almost certainly heard about this MIT study showing that 95% of generative AI pilots at companies were failing.

Chapter 2: What does the MIT report actually reveal about AI pilot success rates?

9.343 - 16.331 Rob Wiblin

This result was big enough to contribute to a NASDAQ sell-off. People worked hard to come up with sophisticated explanations for how something this crazy could be true.

Chapter 3: What defines success in AI projects according to the study?

16.892 - 29.768 Rob Wiblin

And it was repeated by Forbes, Axios, The Hill, the Harvard Business Review, and dozens of others, becoming a staple of elite opinion and one of the most enduring and widely cited statistics in the AI is overhyped backlash.

Chapter 4: How does the sample size affect the credibility of the AI study?

29.748 - 30.209 Rob Wiblin

The problem?

Chapter 5: Why was the MIT report not available when it went viral?

30.79 - 33.254 Rob Wiblin

The study behind these headlines is incredibly weak.

Chapter 6: What conflicts of interest are present in the MIT AI study?

33.795 - 51.283 Rob Wiblin

Worse than you could imagine. And that headline is also a completely incorrect description of what it found, even taking the study entirely on its own terms. The story behind this study will demonstrate that whenever you see a juicy headline, even one with an attractive conclusion, and even one rewarding to come from MIT, it might just be complete nonsense.

Chapter 7: What is the real lesson learned from the AI pilot statistics?

53.052 - 60.764 Rob Wiblin

The most important thing to know is that this report did not show that 95% of generative AI pilots at companies are failing, as almost all journalists claimed.

61.585 - 82.7 Rob Wiblin

Rather, the report found that of all the organisations surveyed, 60% had investigated custom enterprise AI tools, 20% had gotten to the point of actually doing some pilot project with them, and 5% of the total had gone on to successfully deploy those tools in production. 80% of companies simply never piloted any custom task-specific generative AI.

83.32 - 92.429 Rob Wiblin

Saying that 95% of them were failing is like saying 95% of Tinder users have failing marriages when 80% of the people you're talking about have never even gone on a date in the first place.

93.35 - 105.823 Rob Wiblin

Moreover, according to their own survey, the primary reason why pilots didn't progress to deployment wasn't that they were going badly, but just the very familiar and generic organizational unwillingness to adopt new tools.

106.36 - 126.041 Rob Wiblin

Now, the media is definitely at fault for putting a completely wrong number in their headlines, but the report also makes this mistake about its own graph, referring to a 95% failure rate for enterprise AI solutions. Wrong. These results actually show that among the 20% who did in fact pilot a custom AI tool, about 25% or a quarter were successful.

126.962 - 141.081 Unknown

It's not really a low strike rate for a pilot project, and it's a success rate five times higher than what everyone was told about. And in reality, a 25% success rate is actually very impressive. Once you appreciate the bar, a project had to clear to count as a success in this study.

141.843 - 157.846 Rob Wiblin

To qualify as a success, an AI application had to show a marked and sustained productivity or profit and loss impact within six months. They don't define marked or sustained, but a marked and sustained improvement in profitability or productivity within six months is obviously a high bar for any new project to clear.

158.408 - 174.669 Rob Wiblin

It's widely understood that enterprise tech deployments often take years to show bottom line impacts, even if they're going quite well. And notice that by this standard, an AI project that merely breaks even, that's a failure. A project that has benefited the company in some way that hasn't yet markedly affected productivity or profits? Failure.

175.412 - 186.876 Rob Wiblin

And a project that's on track to be profitable next year but isn't yet? Equally, a failure. Do you get the sense that maybe these authors would prefer to find that these projects aren't working? Well, we'll come back to that.

80,000 Hours Podcast

"AI doesn't work" – the story behind the stat that misled millions

Chapter 1: What misleading statistic about AI pilots contributed to market sell-offs?

Chapter 2: What does the MIT report actually reveal about AI pilot success rates?

Chapter 3: What defines success in AI projects according to the study?

Chapter 4: How does the sample size affect the credibility of the AI study?

Chapter 5: Why was the MIT report not available when it went viral?

Chapter 6: What conflicts of interest are present in the MIT AI study?

Chapter 7: What is the real lesson learned from the AI pilot statistics?

Sign in to Audioscrape

Share this moment

80,000 Hours Podcast

"AI doesn't work" – the story behind the stat that misled millions

Chapter 1: What misleading statistic about AI pilots contributed to market sell-offs?

Chapter 2: What does the MIT report actually reveal about AI pilot success rates?

Chapter 3: What defines success in AI projects according to the study?

Chapter 4: How does the sample size affect the credibility of the AI study?

Chapter 5: Why was the MIT report not available when it went viral?

Chapter 6: What conflicts of interest are present in the MIT AI study?

Chapter 7: What is the real lesson learned from the AI pilot statistics?

Want to see the complete chapter?

Sign in to Audioscrape

Share this moment