Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

John Collison

πŸ‘€ Speaker
587 total appearances

Appearances Over Time

Podcast Appearances

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

Okay, so then let's say we build this thing.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

Just cameras, camera encoder, pixels go in, trajectory go out.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

The quality is sufficient to, you know, drive in the normal case.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

It's not sufficient to deal with the long tail of, you know, the edge cases and hit the high bar of superhuman safety that we require.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

So then you start asking the question, what else do you need?

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

Yes.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

And if all you did was kind of observing how other people drive when you trained the system, maybe observing just passively how people drive and how they interact, maybe also driving the car yourself and then using imitative learning to train it.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

Mind that that's not enough.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

You have to do something in closed loop.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

You have to do things like RLFT, which is also, you know, parallel to what we see outside.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

RLFT?

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

RLFT.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

Reinforcement Learning Based Finetuning.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

Okay, yes.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

So similar to the reinforcement learning with human feedback in the LLM world, right?

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

You want to do maybe closed-loop proper, closed-loop driving where you explore all kinds of different situations and then you give it a reward signal to keep it in distribution.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

For that, then, you need a realistic simulator.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

Right?

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

You also, you know, if you want to have a good RL system, you need to have an opinion for the reward function.

The a16z Show
From Models to Mobility: Building Waymo with Dmitri Dolgov

This is where the credit comes in, right?