Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

Duarte O.Carmo's articles

#62 From NutriBench to Taralli: How far can you take a prompt?

22 Dec 2025

Description

I walk through how I used the NutriBench dataset and DSPy to rigorously evaluate and improve Taralli’s calorie estimation, moving beyond “vibes” with a clear Accuracy@10% metric. After testing multiple prompts, optimizers, and models, a few-shot setup with Gemini 3 Flash delivered about a 15% accuracy boost over the previous Gemini 2.5 Flash baseline, and I also shipped an offline, on-device LLM mode plus UI updates. I close with practical lessons: hill-climb with data, consider fine-tuning or RAG, and remember that good few-shot examples often win. Relevant links: Original article Taralli: home-cooked food tracking without the B.S. NutriBench paper (arXiv) NutriBench dataset viewer (Hugging Face) WWEIA overview (USDA) FAO/WHO GIFT individual food consumption NutriBench dataset (Hugging Face) NutriBench prompts (gist) Notebook with the process (gist) DSPy: BootstrapFewShotWithRandomSearch DSPy: MIPROv2 DSPy: GEPA overview Example prediction (gist) GEPA prompt example (gist) OpenRouter

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.