Kwasi Ankomah

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And this is because maybe we've now got more models that are able to reason.

1327.727 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

So we want to put those models in as a, let's call them a planner kind of agent that might go to some sub-agent.

1333.216 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

So that has become very token intensive.

1340.228 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

So that's one thing.

1344.155 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

The second thing is that because of that,

1345.797 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

speed has become more important, right?

1349.103 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

So the way that we actually talked about it is that if you just do one call to the LLM, so as we talked about inference, right?

1350.905 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

So that one inference that you do, if that's slow, maybe that you're like, oh, you know, three to four seconds.

1357.532 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Okay, I'll take the hit, right?

1364.879 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And then it comes back from the model and I get my answer.

1366.401 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Now, because agents do so many calls, that basically

1369.564 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

That delay gets exponential.

1377.672 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

So, and we've seen this, right?

1379.414 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Like the applications I build, you know, we do 10 to 20 calls.

1380.676 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And if those 10 to 20 calls are each like three seconds, that gets suddenly you're sitting there waiting quite a long time for this.

1384.02 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And I can give you an example of like a coding agent, right?

1391.971 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Coding agent, say coding agent goes in, it takes the prompt and it thinks about something.

1394.074 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

It says, okay, I'm going to give it to this planner agent.

1399.08 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

That planner agent then goes to a sandbox who then runs the code and then speaks it out for analysis.

1401.263 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Yeah.

1406.33 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment