Kwasi Ankomah

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

They swapped that out for like a Lama 8B.

1545.867 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

You don't need 600 billion parameters.

1547.75 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

You don't need it.

1550.113 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And that does two things, right, Corey?

1553.477 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Firstly, it speeds up your application because we can run that at a ludicrous tokens per second.

1555.76 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And probably the most important, the cost of inference reduces dramatically, right?

1560.727 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Because you are no longer paying top dollar for that premium model.

1565.413 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

you're actually swapping it for and the difference in cost is it's essential you're talking you know 40 50 times cheaper so if you can like i was telling you if you take that slice of cost that was costing you a million and you just swap that out it really reduces it so i think that's that's kind of the yeah the second where we say that you know the the inference matters but just the fact that we can swap these models in and out and give you that choice on your hardware starts to really have that power and efficiency story that i like to talk about

1569.558 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Yeah.

1599.275 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Yeah.

1613.949 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

It's so cool.

1614.45 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

It's a fair question.

1615.913 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

You've got two ways of doing this.

1617.295 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

If you are working with us on a cloud basis, like anything, you can just pick the model.

1619.739 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

But if you actually have this hardware yourself, you can load up what we call essentially a bundle.

1627.493 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

So you can choose what you need for your use case.

1634.645 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

So then it's there.

1637.089 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Then

1637.81 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

At that point, Ryan, you just pick it like you would any other model.

1638.391 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Everything to the end user, once you've done that, it's just an open AI.

1641.035 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment