Kwasi Ankomah

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And so essentially, once you figure that out, you're able to, I would say, greatly reduce the energy.

2000.722 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And of course, the full stack does make a huge difference.

2007.01 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

But I would say making sure that the chip architecture is really sound is a key one as well.

2009.714 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Yeah.

2014.8 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

But yeah, I'm not surprised that we're seeing that.

2014.94 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And to give you an example,

2017.593 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

the kind of Samba manage that I was talking about, we're seeing so many people kind of come to us being like, hey, we've got this, we can only get, you know, 20 kilowatts of energy and there's just, you know, what can we do with it?

2020.643 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And there's almost no one who can work for those practices.

2033.659 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And, you know, I think we're gonna see more and more of that as we go in.

2036.923 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

As we go in and we need to use less energy anyway, for various reasons,

2041.448 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

We are going to see more of that where people say, actually, how can I... It's not going... We want everything to go as fast as possible, but we also want to say, yes, you can go as fast as possible, but what is the cost of going that fast, right?

2046.074 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

To give an example, if you can run 1,500 tokens per second, like, you know, so like some of our competitors are super, super fast, but...

2060.298 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

the amount of chips they need to do that is phenomenal.

2068.332 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Right.

2072.179 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And I think that's where we're, we're seeing a lot of like, you know, you see these top nine numbers and I'm like, okay, but look at the, you know, tokens per kilowatt is a key metric.

2072.359 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Like how many tokens do you get per kilowatt of energy that is being used to power those ships?

2081.957 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Yeah.

2087.046 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

Yeah, no, I think again, like, I think some of the, the reason why, you know, agentic people always ask, why is, why are you, why are you connecting agents and power?

2091.653 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

And I'm like, agents are using more tokens.

2101.327 View full episode →

The Neuron: AI Explained

AI Inference: Why Speed Matters More Than You Think (with SambaNova's Kwasi Ankomah)

So like, yes, inference, but they, they're just scaling up a, the number of models and be the number of things that you want to do with them.

2104.932 View full episode →

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment