Andy Halliday

Yeah, there's some tuning there that is not completely revealed yet, but I'm sure, you know, Carl probably read the system card and there's a lot of information available there.

1619.187 View full episode →

The Daily AI Show

Anthropic Drops a Monster Model

But again, this just came out like we're talking about it today because it came out yesterday during the day.

1629.104 View full episode →

The Daily AI Show

Anthropic Drops a Monster Model

Yeah, let me explain how prompt caching works.

1692.593 View full episode →

The Daily AI Show

Anthropic Drops a Monster Model

So if you're in a coding session or just in a conversation with a model and you have something that is like a guidebook for what you're talking about in that conversation, you put that into the prompt.

1695.936 View full episode →

The Daily AI Show

Anthropic Drops a Monster Model

And then what that does is it becomes a part of the prompt that gets sent with each additional query.

1712.534 View full episode →

The Daily AI Show

Anthropic Drops a Monster Model

And so you're basically consuming a lot of tokens repeatedly that you don't need to do.

1721.197 View full episode →

The Daily AI Show

Anthropic Drops a Monster Model

Prompt caching takes something that you set as a context for this whole conversation and puts it in memory, basically.

1727.262 View full episode →

The Daily AI Show

Anthropic Drops a Monster Model

So you're not feeding it through every single time as new context for the inference run.

1736.11 View full episode →

The Daily AI Show

Anthropic Drops a Monster Model

So that prompt caching basically economizes on the number of tokens that are being used in every turn of the conversation.

1743.176 View full episode →

The Daily AI Show

Anthropic Drops a Monster Model

by a special system but that i don't fully understand like how do they actually manage this when they're feeding all of these you know tokens in through the system i i'm not clear about how that works but the the idea is very fundamental and it is it's going to keep that one live in context without you having to feed it in as a prompt each time yeah

1751.183 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment