Dmitri Dolgov
๐ค SpeakerAppearances Over Time
Podcast Appearances
You know, could you, knowing what you know now, could you have a successful Waymo in market in 2015?
Or was there some enabling technology?
No.
LLMs are good at text or tokens specifically, and obviously perform best at domains that have some kind of single corpus of text they can work on, like coding, where it's very helpful that everything was just kind of textual already.
And part of the success has been creating textual representations for domains so that we can then, you know, put LLMs against them.
Can you describe how you...
encode the world that you're seeing?
I mean, are you just building a 3D map, like a 3D bitmap, essentially?
Yeah, I think of a simple view of N10 being, you know, pixels go in and car actions come out, which is maybe a bit of an oversimplification, but yeah.
I mean, there's something to it.
You're saying you can take an off-the-shelf model, which has nothing to do with driving to start with, and you'll get these good results.
That's right.
In the nominal case.
Yeah, you should not try it on the streets, but it works.
It's like a talking horse.
It's impressive that it's talking.
That's very interesting on the simulating point.
It's just very hard to simulate for an end-to-end model because it's easier to deal in intermediate representations rather than coming up with a pixel-perfect view of the world.
Yeah, yeah, yeah.
What...