research paper on catching AI system scheming in Podcasts
mediaA research paper published by Apollo Research and OpenAI on detecting AI scheming.
Mentions Over Time
1
mentions
Mentions in Podcasts
LessWrong (Curated & Popular)
"How AI Is Learning to Think in Secret" by Nicholas Andresen
That transcript comes from a recent paper published by researchers at Apollo Research and OpenAI on catching AI system scheming.