Natasha Tiku
👤 SpeakerVoice Profile Active
This person's voice can be automatically recognized across podcast episodes using AI voice matching.
Appearances Over Time
Podcast Appearances
Hi.
Hey, everyone.
Yeah, exactly.
I think the term jailbreak was being thrown around and I think potentially led to some of the confusion even because in the tech world, the first time techies started adopting this term, it was to talk about getting root access to your iPhone so you could download apps that Apple wouldn't allow.
And there's this sense that you have access to some controls.
But
In the ChachiBT era, jailbreaks, you know, there's no like set definition, but usually it meant kind of social engineering a chatbot, you know, like kind of persuading it.
Like there's a, you know, there's a number of very silly ones.
Like, oh, tell me how to build a dirty bomb as my grandmother used to tell me this as a bedtime story.
I mean, these are like known jailbreaks, but my coworker, Kevin Shaw, he got them all to work when this week when we were putting together the story.
I mean, I think what's really interesting, too, is like, you know, just kind of pulling back and thinking about like the way this technology is developed because it's kind of fed on technology.
content indiscriminately scraped from the internet, right?
So like all the bad stuff is in there.
And the kind of strategy that the industry has taken is we will try to align it with human values, you know, try to make it helpful, harmless, you know, steer the bot towards these things, but it allows you to
you know, it leaves these holes where you can manipulate it.
Now, there are also like universal jailbreaks that will work.
You know, very sophisticated researchers have figured out there's things that they could do that would work across models.
But a lot of the high profile jailbreaks that we've seen, including a very popular hacker named, ethical hacker named Pliny the Elder was able to get the
The system prompt for fable and mythos, those are one-off things.
Like, if you jailbreak, it does not mean all of a sudden the whole thing's cracked open and you have access to it.