Dwarkesh Patel
π€ SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
They also added a Redis cache to simulate stale data since that's how real e-commerce sites actually work.
These are the kinds of things that you might not have naively thought to do, but that Labelbox can anticipate.
These details really matter.
Small tweaks are often the difference between cool demos and agents that can actually operate in the real world.
So whether it's correcting traces that you already produced or building an entirely new suite of environments, Labelbox can help you turn your RL projects into working systems.
Reach out at labelbox.com slash dvarkash.
All right, back to Richard.
This alternative paradigm that you're imagining.
The experiential paradigm.
Yes.
Yeah.
I guess maybe what I meant to say is human level general continual learning agent.
Yeah.
What is the reward function?
Is it just predicting the world?
Is it then having a specific effect on it?
What would the general reward function be?
I see.
I guess this AI would be deployed...
to like lots of people would want it to be doing lots of different kinds of things.