Mohammad Norouzi

👤 Speaker

359 total appearances

Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1

Confidence: Medium

Appearances Over Time

Podcast Appearances

The a16z Show

AI, Design, and the Power of Open Models

So throughout training, we always measure text accuracy and we update very detailed changes to the model and data and see how that resolves in performance.

472.962 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

So I would say

485.974 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

A lot of it is really listing all the possible changes and very carefully tuning each element of the model and see what happens.

487.297 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

Obviously, we try to gather as much data as possible.

497.012 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

One of the standard recipes in the industry is that

501.859 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

We take images and we turn them to text using visual language models.

506.635 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

The very first models we were training three, four years ago would be based on the alt text that you can find on the internet.

511.941 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

That is, each image on the internet may have an alt text field associated with it, which describes what's in the image.

518.208 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

But the problem is the alt text is often very short or inaccurate.

525.917 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

And what we do now is we train models to go from image to text.

530.682 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

And in this case, image to text with detailed bounding box information, detailed element information.

535.992 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

If we hear about text and we really want to make sure all the text in the image is correctly described,

542.024 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

And then we go from text to image backwards.

548.637 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

It's kind of interesting.

551.001 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

We gather all the images from the internet.

551.803 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

Some of them may have alt text, some of them may not have alt text.

553.526 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

And then we use AI to go from image to text.

556.651 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

And then we train another AI model to go from text to image.

559.897 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

So that's one of the key recipes that results in very good models.

562.822 View full episode →

The a16z Show

AI, Design, and the Power of Open Models

It's a very good question.

588.052 View full episode →

← Previous Page 4 of 18 Next →

Report any issue

Mohammad Norouzi

Voice Profile Active

Appearances Over Time

Podcast Appearances

Sign in to Audioscrape

Share this moment