David Sacks
๐ค SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
I'm not sure when this idea that you feed the whole model into a context window to train itself and build a new model is going to happen.
I'm not sure when this idea that you feed the whole model into a context window to train itself and build a new model is going to happen.
But I think there's probably a lot of different architectural paths that could be walked here.
But I think there's probably a lot of different architectural paths that could be walked here.
One of which is this idea that you could make much smaller models and then create networks of smaller models that work together where you ultimately have less energy or less cost per token produced out of a
One of which is this idea that you could make much smaller models and then create networks of smaller models that work together where you ultimately have less energy or less cost per token produced out of a
aggregation of models than you did with one single large model.
aggregation of models than you did with one single large model.
I've said this probably three or four times now.
I've said this probably three or four times now.
There's a lot of work and a lot of opportunity ahead in kind of re architecting models and re architecting how models work together to solve problems.
There's a lot of work and a lot of opportunity ahead in kind of re architecting models and re architecting how models work together to solve problems.
My guess is a lot of leadership that that he can bring
My guess is a lot of leadership that that he can bring
to exploring those paths.
to exploring those paths.
And all it takes is a minor breakthrough and your cost per token drops in half.
And all it takes is a minor breakthrough and your cost per token drops in half.
That's a tremendous efficiency game that seems very much on the horizon because some of the early papers, I think I shared one from MIT a few weeks ago, indicate that there's a lot of room to run here in terms of re-architecting models and deployment of models.
That's a tremendous efficiency game that seems very much on the horizon because some of the early papers, I think I shared one from MIT a few weeks ago, indicate that there's a lot of room to run here in terms of re-architecting models and deployment of models.