Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Keri Briske

👤 Person
381 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And I think that's why, you know, you kind of introduced us and we have many size models of Nematron because you do need larger models to inform and train the smaller models.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And so you do and larger models are a little bit more robust and they can kind of generalize a little bit more even in a domain and they have a bigger capacity to learn a new domain.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

So I think that there's room for all kinds of models.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

Yeah, so there was a little bit of thought that went into, it seems kind of natural that we did a small, medium, large, but there was a little bit more thought that went into it.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

For our nano, we really wanted it to fit into smaller GPU formats, right?

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

So that anyone that's either renting a, like what I'd say,

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

an older hardware architecture on the cloud or running on a laptop that has a GPU in it, we want to be able to run that with enough memory and with the efficiency that you need and still have a really great reasoning model.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And so that was the Nano.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And so it's a 9 billion perimeter model.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

At floating point 16, you really only need like an 18 gig memory requirement, right?

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And then again, you can use our tools to quantize it down even further and make it even smaller and more efficient.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

Yeah, so that's kind of what we did for the Nano.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

For the Super, we were thinking we wanted to keep it within a single hopper GPU, data center GPU.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And you can easily do that.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

You can run it in FP8 and that's easy.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

For Ultra, we actually had this debate with our engineering team because a lot of these, again, a lot of the really great large models are larger than a node.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And let me just explain what a node is.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

If you don't know, it's eight GPUs in a box.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

So most of these really large models are multi-node.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

So you have to take up more than eight GPUs just to run one model.