AIandBlockchain
Arxiv. Why Smart Prompts Don’t Always Work: The Limits of In-Context Learning
09 Jun 2025
Have you ever wondered how large language models like GPT or Gemini can instantly understand what you want — with just a couple of example lines? No fine-tuning. No retraining. Just... understanding. That’s the magic of in-context learning, and in this episode, we go deep beneath the surface to uncover the mechanics — not just the tricks.🔍 Guided by a research paper from Google DeepMind, we explore:Why in-context learning works (and when it doesn’t)How prompts and prefixes actually influence model behaviorWhat soft prompts are, and why they might outperform plain textThe fundamental limits of prompting as a technique📚 The paper, "Understanding Prompt Tuning and In-Context Learning via Meta-Learning", reveals that prompts aren’t just about choosing the right words — they work because the model updates its internal task representation based on the input context. In other words, it performs a form of Bayesian inference on the fly — no weight changes needed.But here’s the catch:This only works if the task was already present in the training dataAnd if it’s a single, well-defined task, not a mixture of multiple🎯 Here’s the twist: even powerful soft prompts, which modify the model’s internal activations directly, can’t overcome these theoretical limits. If you need a model to handle a totally new or composite task, you’ll likely need weight tuning — via LoRA or full fine-tuning.💡 One mind-blowing result? An untrained transformer model, with the right soft prefix, came surprisingly close to optimal performance. This suggests that the architecture alone holds innate context processing capabilities. 🤯📈 Why this matters for you — whether you're building products or researching AI:Learn when prompting is enough — and when it’s notUnderstand the theoretical boundaries that no amount of tokens can bypassConsider the emerging potential to transfer soft prompts across different models — a future “knowledge layer” for AI?🎧 Don’t miss this episode if you work with LLMs, build AI tools, or just want to understand why these models "get it" — and where that understanding hits its limit.👇 Tell us:What surprised you the most? Are you using soft prompting in your own work?Key Takeaways:In-context learning is Bayesian inference over context, not memorizationSoft prompts can manipulate internal model states more effectively than hard tokensPrompting hits a wall on mixed or novel tasks — weight tuning is needed thereSEO Tags:Niche: #incontextlearning, #softprompting, #metatraining, #bayesianinferencePopular: #AI, #neuralnetworks, #machinelearning, #GPT, #LLMLong-tail: #promptinglimitations, #incontextvsweighttuning, #contextbasedlearningTrending: #transformers2025, #GoogleDeepMind, #LoRARead more: https://arxiv.org/abs/2505.17010
No persons identified in this episode.
This episode hasn't been transcribed yet
Help us prioritize this episode for transcription by upvoting it.
Popular episodes get transcribed faster
Other recent transcribed episodes
Transcribed and ready to explore now
Eric Larsen on the emergence and potential of AI in healthcare
10 Dec 2025
McKinsey on Healthcare
Reducing Burnout and Boosting Revenue in ASCs
10 Dec 2025
Becker’s Healthcare -- Spine and Orthopedic Podcast
Dr. Erich G. Anderer, Chief of the Division of Neurosurgery and Surgical Director of Perioperative Services at NYU Langone Hospital–Brooklyn
09 Dec 2025
Becker’s Healthcare -- Spine and Orthopedic Podcast
Dr. Nolan Wessell, Assistant Professor and Well-being Co-Director, Department of Orthopedic Surgery, Division of Spine Surgery, University of Colorado School of Medicine
08 Dec 2025
Becker’s Healthcare -- Spine and Orthopedic Podcast
NPR News: 12-08-2025 2AM EST
08 Dec 2025
NPR News Now
NPR News: 12-08-2025 1AM EST
08 Dec 2025
NPR News Now