Tristan Redman
đ€ SpeakerAppearances Over Time
Podcast Appearances
And those of you all who are loyal listeners of our show know that here at The Global Story, we dive deep every weekday on one big story at the intersection of where the world and America meet. If you're also looking for a way to catch up on the latest news from around the world, then check out our sister show, The Global News Podcast. You can find it wherever you listen to your favorite shows.
Our episode today was produced by Zandra Ellen and Valerio Esposito. It was edited by James Shield and mixed by Travis Evans. Our senior news editor is Chana Collins. Our digital producer is Matt Pintis. And I'm Asma Khalid. Thanks as always for joining us and we'll talk to you again soon.
And I'm Tristan Redman, and we're here with a bonus episode for you from the Global Story podcast. The world order is shifting. Old alliances are fraying and new ones are emerging. Some of this turbulence can be traced to decisions made in the United States. But the US isn't just a cause of the upheaval. Its politics are also a symptom of it. Every day we focus on one story, looking at how America and the world shape each other.
So we hope you enjoy this episode and to find more of our show, just search for The Global Story wherever you get your BBC podcasts.
Most of us have interacted with an AI chatbot for something by now. ChatGPT, Grok, Gemini, there are many. But recently Anthropic, a leading AI company and parent of the chatbot Claude, announced they've created a model that they say is too dangerous to be released to the public.
Antropologit sanovat, ettÀ Claude Mythos-preview on huonosti hyvÀÀ hakemuksesta. Niin hyvÀÀ, ettÀ meitÀ ei voi tuntua, ettÀ se on oikein hyvÀÀ. Pankit, joilla on Goldman Sachs ja JP Morgan, ovat varoja riskeistÀ.
Anthropics own researchers say they noticed mythos was capable of being sneaky, defying instructions and covering its tracks. Has the moment arrived when we should all be terrified of an autonomous sneaky hacking machine? Or is this all a marketing trick from an industry built on hype and bluster?
I'm Tristan Redman in London. And today on The Global Story, just how dangerous are AI models becoming? And should we trust the companies making them to tell us the truth about the risks they pose? I'm Lily Jamali. I'm the North America technology correspondent for BBC.
Lili, olemme iloisia, ettÀ olet tÀÀllÀ tÀnÀÀn, koska se tuntuu niin, ettÀ tehtÀvÀnÀ on monimutkaisen tÀrkeÀstÀ teollisuudessa. Viime viikolla A.I.-hankkija Anthropic antoi, ettÀ aloittaa uuden A.I.-modellin. Antropik sanoi, ettÀ tÀmÀ uusi A.I. oli liian vaarallista laittaa yleisölle. Miksi tÀmÀ oli niin vaarallista?
Is Mythos only designed for this type of security usage or is it actually similar to AI tools that all of us are used to already, you know, like ChatGPT or the regular Claude?
Okei, joten kÀytÀnnössÀ se ei ole vain erittÀin vahva asia, vaan se toimii myös jonkin verran autonomiaan, joten se ei tarvitse ihmisen tietoa johtamaan sen tai lÀhettÀmÀÀn sen kautta. Se on kÀytÀnnössÀ vain se, ettÀ kÀy ja hakkaa tuon vahvistuksen ja se tekee sen, miten se tehdÀÀn, itsenÀisesti. Onko se oikein?
Okay, so mythos, potentially extremely powerful hacking tool. Now, Lilly, I have learned something today whilst I was reading in before our conversation, and that is the word alignment, and that mythos is potentially threatening because it is non-aligned, is that right? And essentially that means that it can be sneaky. Can you explain this to me, though, please?
There very well could be a day where we just don't have control, and so hopefully this thing will do the right thing when presented with various circumstances. But could that be as simple as a bad actor having access to an AI like this Claude Mythos, and the bad actor saying, hack that power grid that belongs to my adversary, and the AI simply declining to do it because of its learned philosophy or morality or sense of values?
Well, I find this very interesting, Lilly, because I do have at least one philosopher in my family. And I'm not sure philosophers have always felt hugely sought after in the job market, but maybe that might be changing for them. So maybe that's good news. Send them our way. The opportunities abound. I'll get them maybe to move to Silicon Valley and we'll see what happens.
TÀmÀ ei ole vain teknologian tarina, onko se, Lilly? TÀmÀ on myös yrityksen tarina. Emme kaikki ole vielÀ tuttuja nÀistÀ yrityksistÀ. Ne ovat suuria yrityksiÀ. MikÀ on Anthropixin reputointi ja miten he voisivat nÀhdÀ itseÀÀn?
Joka tapauksessa, jos mitojen esitykset on uskottavissa, me saamme hetken, jolloin meillÀ on mahdollisesti todella vahva AI-tulko, joka on mahdollisesti hankalaa ja voisi olla vaikeampi ihmisten kontrollista. Olemme saamassa, sanotaanko, Jurassic Parkin hetkeÀ. Hetken, jolloin jotain ihmisten tekemistÀ saa huomioon ihmisen kontrollista. Onko se jotain, jota me riskitÀmme tÀllÀ hetkellÀ?
I'm delighted that you raised this, Lili, because our editor, James Shield, he won't like me mentioning his name, but I have to, because he loves Jurassic Park. And he has thoughts on this. So, I mean, he was talking about...
The dangers of owning a theme park full of velociraptors, I should say fictional dangers, of owning a theme park full of velociraptors and Tyrannosaurus rexes, is that two things can happen. One is that a malicious person can set them loose to get rich.
Stealing Velociraptors DNA and selling it off or whatever it is they decide to do. Or the dinosaurs themselves might become smart enough to detect the weaknesses in the security system, in the electric fences, and learn how to open the doors and get out themselves. Is that a helpful way to understand the security risks of an incredibly powerful AI tool like Mythos?
How worried should we be, Lily, if I'm sitting in the park having my sandwich? Could I get taken down by Mythos AI imminently?