Burke Holland
๐ค SpeakerAppearances Over Time
Podcast Appearances
When we say agentically, we mean like the agent is like calling tools.
It's running terminal commands.
It's looking stuff up on the internet.
It's acting like a developer.
But the problem was, you know, even for us from where I was sitting, I could see that like, it would get you to like a demo or a prototype, but then it's like,
Then it would get stuck on an error, and then you had a problem because it doesn't understand the error, and neither do you because you didn't write that code.
And that's a problem.
You can't really sit in both spaces.
You can't have an agent...
That is half right half the time.
You need it to be like all right all the time, ideally.
And so in December, it was November, December, I forget, Anthropic releases this model called Opus, which they've had Opus for a while, but they release Opus 4.5.
And it's because my job is to like test these things out and like, what does this tool do?
How does this model work?
I'm testing out Opus 4.5 over the Christmas break.
And the first thing that I did was I tried to, I was building like native windows tools.
I tried to build a tool that would allow me to resize windows on windows to a specific width and height.
I'm just testing models inside of Visual Studio Code with Copilot.
And it just one-shots it.
Just one-shots it.