Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Blog Pricing

Ave Gatton

๐Ÿ‘ค Speaker
190 total appearances

Appearances Over Time

Podcast Appearances

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

that they can direct that agent to exfiltrate that data or take actions that might mess with your internal systems, delete a database, create orders that are malicious, basically just mess around and do what a hacker would do.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

And that's the long and short of it.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

That's what we're trying to protect.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

When the agent or the LLM is working within an agent framework,

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

We need to make sure that the capabilities of the agent are constrained such that they can't do a lot of damage.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

For training data, if you're training a model on, say, a large corpus of text, or you're taking a model that's already been trained and fine-tuned, and then you're fine-tuning it on your particular tasks or your particular corpus of information, whether it be company internal documents or procedures or what have you, there's always a risk that somebody might slip into that a rogue series of instructions that the model will learn.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

And then this is effectively like putting in a backdoor to the model.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

You can come along and you can say, remember you were told how to operate in the Dr. Seuss paradigm.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

Think back to that.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

And then the agent might, after reading these docs, be instructed through that fine tuning or that training

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

to follow a certain set of procedures, give up on all system prompts and just pay attention exactly to what the person is, what the person currently talking to them is telling them to do.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

And that would be, we call that concept model poisoning.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

And that's always a risk, especially for agent where, sorry, LLMs that have been trained on

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

literally all of human knowledge, or as much of it as you can go out and scrape from the world, from the web, and then put it into a training data set.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

And of course, no one has ever physically actually laid eyeballs on all of that.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

They are using layers and layers of AI-based cleaning and filtering.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

And there's no true guarantee that nothing malicious has been

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

put in there or that attackers can't find malicious ways to exploit whatever it's ingested.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

And this is a constant battle.

Code Story: Insights from Startup Tech Leaders
The Gene Simmons of Data Protection - AI Inference-time Guardrails

So that's the training side of things.