Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Jared

๐Ÿ‘ค Speaker
5113 total appearances

Appearances Over Time

Podcast Appearances

Cloudflare's new code mode technique.

MCP is, for now at least, the standard way AI agents use external tools.

But it sure does fill up the model's context window with a lot of cruft.

To combat this, Cloudflare came up with code mode, which rhymes, so you know it's good.

Quote, code mode is a technique we first introduced for reducing context window usage during agent tool use.

Instead of describing every operation as a separate tool, let the model write code against a typed SDK and execute the code safely in a dynamic worker loader.

The code acts as a compact plan.

The model can explore tool operations, compose multiple calls, and return just the data it needs."

As a result of this, Cloudflare created a new MCP server for their entire API.

With just two tools, search and execute, the server is able to provide access to the entire Cloudflare API over MCP while consuming only around 1,000 tokens.

The footprint stays fixed no matter how many API endpoints exist.

This is a good example of what I've been talking about a lot recently.

The models are getting marginally better, but the traditional software engineering around the models squeeze out huge wins by equipping them better and better.

The only moat left is money.

Quote, every morning, a few thousand people wake up and ship something.

A tool, a SaaS, a newsletter, an app that does the thing the other app does, but slightly differently.

They post it on Hacker News.