Alpin Yukseloglu
π€ SpeakerAppearances Over Time
Podcast Appearances
I think
Part of their thesis, and I think part of where the world is clearly going, is that there's more software that is being generated than can be possibly reviewed by humans.
And formal verification is one way to quickly check whether a component of software is actually doing what it says it's doing.
And then obviously in the context of security, it can, especially if the spec is written correctly, it can be a step function change.
Now, it's not a silver bullet in the sense that you still have to write the spec for the formal verification.
So there's still surface for bugs to get in there.
But, you know, you can make the case that actually the surface for bugs in writing a formal verification spec might be lower than writing the code to start with.
And definitely with time, I think all of the best models, all of the best software will probably end up being formally verified.
And if you take the vantage point of an agent and you have two options to choose from, one of them is formally verified and one of them is not.
And the formally verified one might just gain preference just because it has all these nice properties.
The top level goal that we have is to help the model labs develop the crypto capabilities of their models.
And I think that security is one component of that and maybe an increasingly urgent component of that.
But there's so much that EVM Bench does not touch.
So there are other ecosystems and stacks.
There is a protocol layer, which we talked about, where maybe arguably it's more important from a security standpoint that the Ethereum protocol is secure rather than any specific solidity contract.
There are out of protocol components, like how do you land a transaction on chain?
How do you deal with the mempool?
How do you deal with sort of the non-deterministic parts of crypto?
And then obviously there are components that are even farther on the verifiable and intelligence-bound trajectory, like for example, around cryptography and around zero-knowledge proofs, etc.,
And I think all of these are extremely fertile soil for future work.