Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Yoko Li

πŸ‘€ Speaker
68 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 1
Confidence: Medium

Appearances Over Time

Podcast Appearances

The a16z Show
AI, Design, and the Power of Open Models

One thing we were always wondering is that this release, the open source model is so small.

The a16z Show
AI, Design, and the Power of Open Models

It's 9.3 billion parameters.

The a16z Show
AI, Design, and the Power of Open Models

Like previously, a SOTA is probably like 80 billion parameters.

The a16z Show
AI, Design, and the Power of Open Models

It's like 9x the difference.

The a16z Show
AI, Design, and the Power of Open Models

How did you do it?

The a16z Show
AI, Design, and the Power of Open Models

I think you already kind of touched on this.

The a16z Show
AI, Design, and the Power of Open Models

The new open source model is very exciting in that it unlocked a lot of new use cases.

The a16z Show
AI, Design, and the Power of Open Models

It's very photorealistic.

The a16z Show
AI, Design, and the Power of Open Models

I think it can generate up to 2K with a smaller model too.

The a16z Show
AI, Design, and the Power of Open Models

Obviously, there's very precise layout control as well.

The a16z Show
AI, Design, and the Power of Open Models

Do you want to talk about some of the net new use cases that's unlocked by this model?

The a16z Show
AI, Design, and the Power of Open Models

One of the things that stood out to us, which is what the community has been chatting about, is how there's new ways of processing data as you're training the model, which is like you kind of let the model learn what is a bounding box and how to do the layering and color palettes.

The a16z Show
AI, Design, and the Power of Open Models

Do you want to talk more about some of the innovations you had during the training process?

The a16z Show
AI, Design, and the Power of Open Models

What made this model so good with these differentiating features?

The a16z Show
AI, Design, and the Power of Open Models

I saw a lot of JSON prompting in your technical blog, which is very unique.

The a16z Show
AI, Design, and the Power of Open Models

And as I was trying to model, it seems like it was translating the text, the prompt, to a JSON representation with implicit structure.

The a16z Show
AI, Design, and the Power of Open Models

Right.

The a16z Show
AI, Design, and the Power of Open Models

Do you think JSON is a representation for image models going forward, or do you think there's another representation there?

The a16z Show
AI, Design, and the Power of Open Models

And for people who want control or like consistency, I think that'd be key.

The a16z Show
AI, Design, and the Power of Open Models

Yeah.

← Previous Page 1 of 4 Next β†’