Menu
Sign In Search Podcasts Libraries Charts People & Topics Add Podcast API Blog Pricing

Grant Harvey

πŸ‘€ Speaker
See mentions of this person in podcasts
6730 total appearances
Voice ID

Voice Profile Active

This person's voice can be automatically recognized across podcast episodes using AI voice matching.

Voice samples: 2
Confidence: High

Appearances Over Time

Podcast Appearances

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Yeah.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

That makes sense.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

What about harnesses, I guess?

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Because we were hearing a lot about harnesses in the context of agents, and we're wondering... And benchmarking.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Yeah, if it's worth companies building their own versus something off the shelf.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

I mean, do you have any insight into that and your perspective there?

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So why are eval sets so hard to create?

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

I was going to say, you should hire me then because I feel like I'm a very lazy, clever person.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So I feel like I would try to find the easiest way to do stuff.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Technically it passes, yeah.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

It did what you told it.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

I know which one you're talking about.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

I like that.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

What's your recommendation for someone who, you know, needs is maybe starting to build their own evals and assess things like what what's what do you do or what's your perspective on the best way to eval or write evals, I guess?

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Yeah, I love it.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Guilty of that for sure.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Oh, well, this leads me to ask another question, which is, do RL environments eventually replace benchmarks or like in terms of agentic settings?

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

Like, what's your take there?

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

So you're benchmarking it and you're saying, hey, this is what we're seeing and this is where you really need some help.

The Neuron: AI Explained
Inside the Secret Labs Where AI Learns to Work

And then that's where you kind of... You need some law and some creativity.