Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Mohammad Norouzi

πŸ‘€ Speaker
359 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

The a16z Show
AI, Design, and the Power of Open Models

So throughout training, we always measure text accuracy and we update very detailed changes to the model and data and see how that resolves in performance.

The a16z Show
AI, Design, and the Power of Open Models

So I would say

The a16z Show
AI, Design, and the Power of Open Models

A lot of it is really listing all the possible changes and very carefully tuning each element of the model and see what happens.

The a16z Show
AI, Design, and the Power of Open Models

Obviously, we try to gather as much data as possible.

The a16z Show
AI, Design, and the Power of Open Models

One of the standard recipes in the industry is that

The a16z Show
AI, Design, and the Power of Open Models

We take images and we turn them to text using visual language models.

The a16z Show
AI, Design, and the Power of Open Models

The very first models we were training three, four years ago would be based on the alt text that you can find on the internet.

The a16z Show
AI, Design, and the Power of Open Models

That is, each image on the internet may have an alt text field associated with it, which describes what's in the image.

The a16z Show
AI, Design, and the Power of Open Models

But the problem is the alt text is often very short or inaccurate.

The a16z Show
AI, Design, and the Power of Open Models

And what we do now is we train models to go from image to text.

The a16z Show
AI, Design, and the Power of Open Models

And in this case, image to text with detailed bounding box information, detailed element information.

The a16z Show
AI, Design, and the Power of Open Models

If we hear about text and we really want to make sure all the text in the image is correctly described,

The a16z Show
AI, Design, and the Power of Open Models

And then we go from text to image backwards.

The a16z Show
AI, Design, and the Power of Open Models

It's kind of interesting.

The a16z Show
AI, Design, and the Power of Open Models

We gather all the images from the internet.

The a16z Show
AI, Design, and the Power of Open Models

Some of them may have alt text, some of them may not have alt text.

The a16z Show
AI, Design, and the Power of Open Models

And then we use AI to go from image to text.

The a16z Show
AI, Design, and the Power of Open Models

And then we train another AI model to go from text to image.

The a16z Show
AI, Design, and the Power of Open Models

So that's one of the key recipes that results in very good models.

The a16z Show
AI, Design, and the Power of Open Models

It's a very good question.