Ave Gatton

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

that they can direct that agent to exfiltrate that data or take actions that might mess with your internal systems, delete a database, create orders that are malicious, basically just mess around and do what a hacker would do.

207.492 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

And that's the long and short of it.

221.519 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

That's what we're trying to protect.

222.822 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

When the agent or the LLM is working within an agent framework,

224.745 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

We need to make sure that the capabilities of the agent are constrained such that they can't do a lot of damage.

228.518 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

For training data, if you're training a model on, say, a large corpus of text, or you're taking a model that's already been trained and fine-tuned, and then you're fine-tuning it on your particular tasks or your particular corpus of information, whether it be company internal documents or procedures or what have you, there's always a risk that somebody might slip into that a rogue series of instructions that the model will learn.

248.262 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

And then this is effectively like putting in a backdoor to the model.

273.206 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

You can come along and you can say, remember you were told how to operate in the Dr. Seuss paradigm.

276.089 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

Think back to that.

283.088 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

And then the agent might, after reading these docs, be instructed through that fine tuning or that training

284.05 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

to follow a certain set of procedures, give up on all system prompts and just pay attention exactly to what the person is, what the person currently talking to them is telling them to do.

291.209 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

And that would be, we call that concept model poisoning.

302.67 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

And that's always a risk, especially for agent where, sorry, LLMs that have been trained on

305.816 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

literally all of human knowledge, or as much of it as you can go out and scrape from the world, from the web, and then put it into a training data set.

311.527 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

And of course, no one has ever physically actually laid eyeballs on all of that.

320.319 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

They are using layers and layers of AI-based cleaning and filtering.

324.745 View full episode →

Code Story: Insights from Startup Tech Leaders

The Gene Simmons of Data Protection - AI Inference-time Guardrails

And there's no true guarantee that nothing malicious has been