Grant Harvey
π€ SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
Yeah, I love it.
Guilty of that for sure.
Oh, well, this leads me to ask another question, which is, do RL environments eventually replace benchmarks or like in terms of agentic settings?
Like, what's your take there?
So you're benchmarking it and you're saying, hey, this is what we're seeing and this is where you really need some help.
And then that's where you kind of... You need some law and some creativity.
Yeah.
Basically, are you trying to make yourselves irrelevant by making the perfect model, or is there always going to be a harder challenge for you that just scale will require you to solve?
Ooh.
We saw that, and I wanted to make sure it was correct, because that's good.
What then is your, I guess, your timeline for when we'll see agents that can handle, I mean, most knowledge work without human supervision?
I mean, you're basically setting the pace here, it feels like.
Wow.
Yeah, I wouldn't be surprised.
Well, on that point, have you tried Opus 4.5?
Do you think it's the same step function change that everyone else thinks?
That's what I feel like is going to happen.
But I mean, there's a lot of user experience stuff that needs to get solved for regular people to really.
To say nothing of the different tastes, how taste plays into judging all of that.
could you ever make an RL environment that's like clusters of taste like people who like these you know six books like will like this type of writing like could you ever like train for taste yeah and I mean I think like