Nufar Gaspar
๐ค SpeakerAppearances Over Time
Podcast Appearances
If you want to be extra good, you can give it permission to post draft on Slack or DM to yourself for approval before sending.
Moving on to the next layer, the worst thing that happens with your agent OS isn't that it fails, it's that it works confidently and wrongly, and you ship the output before you noticed.
And verification is knowing what to check, and every agent job has its own very quick test.
If you draft emails, you need to basically tone match or you need to check that the facts are correct and so on.
If you're doing data analysis, you have to check the numbers.
So it's very specific to the different tasks or skills that you give the agents.
I encourage you to do at least three to five checks.
In many cases, it's under a minute to run and it will save you a lot of grief after.
And they do get faster with practice.
In the first week, it might be slow and it might get a little bit frustrating, but the more time goes by and the more you trust your agent on low stakes, you can verify just the high stakes one.
And verification is not just about the individual output.
You also need to improve the system over time.
So periodically, I want you to do a retrospective with your agents, audit the system and figure out which parts are underserving you.
And maybe there are skills that are never being called, or maybe there are context files that become stale.
Maybe some agents need updated instructions.
So I want you to, similarly to how you would re-evaluate your employees or your company periodically, you will do that also with your agentic systems.
You can do that either indirectly by auditing each and every agent that you build on top of the system, or you can go directly to audit the OS layers themselves.
And the great thing about the tools is that they let you just do that directly from the tool.
You can literally ask them what is not being used and so on.
And the reason why I'm talking about this audit discipline in the AgentOS program, it's because it's something that without it, your OS has a shelf life of maybe eight weeks before everything goes stale.