Ryan Sean Adams
π€ SpeakerAppearances Over Time
Podcast Appearances
Somebody will write a bot that says, hey, Claude, OpenClaw, go hack me some contracts.
And then that thing will actually have the capacity to do that because the people that
didn't really think too hard about their security because they didn't need to think too hard about their security, those people will not have a good day.
Let's talk about EVM Bench.
This is the paper, the tool?
You guys call it a tool?
Can you define harness?
I'm pretty sure that's a technical term that I think coders will be aware of, but I'm not.
I see.
So like the harness is kind of like a bootloader to get it started.
But then eventually data will take over.
Data and experience will take over the actual internal like operations of the machine.
Yeah, exactly.
Okay, so what does the tool actually do?
Is the tool the thing, like the agent doing the exploiting or doing the patching?
Or is it just the benchmarking?
Like, talk to me about the actual utility here.
That's the actual benchmark is like you guys have established like you guys can actually measure the thing effectively.
We already got all the data.
Yeah.