Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing

Keri Briske

👤 Person
381 total appearances

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

Yeah, so I mentioned this idea of distillation, the algorithms to distill models in.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

By the way, there's lots of different ways to distill models.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

There's always kind of new algorithms.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

But you do need the larger model to teach the smaller model.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

You can also take a larger model, and I mentioned it during neural architecture search.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And so you're basically, it's like pulling apart the Legos and kind of figuring out which pieces really matter to the design and then putting it back together and using 30% less pieces.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And so that's basically what you do when you do neural architecture research.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And so you distill it down to a smaller model, but then you retrain it back to the same accuracy.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

So there is a bit of like retraining back into it.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And so the thing is, there's really great success with small models or these SLMs.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

You do give up a little bit though, just a little bit, when I mentioned earlier, like a little bit of robustness, a little bit of capacity to learn, but they are great for

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

specializing for particular tasks.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

So again, I'm talking about systems of models.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

If you have a model that just needs to do a routing, so, you know, someone asks a question and I need to route to whatever the best tool is or whatever the best model that is to answer this question, you don't have to understand the world.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

You just have to understand your environment and what its tasks are.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And you can be a really great router.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

But with some reasoning behind this, you're making great decisions based on that routing instead of a rules-based routing.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

So, yeah, SLMs have had that purpose.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

But you do need that larger model to distill down into it.

The Neuron: AI Explained
NVIDIA’s Kari Briski on How to Use NVIDIA Nemotron Open-Source AI

And, uh, where are you running that?