Sleeper Agents
Sifting through hundreds of thousands of hours of indexed videos
Sleeper Agents
Sifting through hundreds of thousands of hours of indexed videos
Sleeper Agents
5
Mentions
176.7K
Views

“A research paper by Evan Hubinger regarding models resisting alignment training.”

“The risk of hidden backdoors or loyalties in AI models that can be activated later.”

“A famous Anthropic paper used as a model organism for deceptive alignment.”

“The concept of hidden malicious behaviors in AI models that trigger under specific conditions.”

“Anthropic research paper discussed regarding model poisoning and Chain of Thought reasoning.”
Arcmira media summary
Arcmira tracks where Sleeper Agents is discussed across indexed YouTube videos, transcripts, channels, and related entities.
A research paper by Evan Hubinger regarding models resisting alignment training.
The risk of hidden backdoors or loyalties in AI models that can be activated later.
A famous Anthropic paper used as a model organism for deceptive alignment.
The concept of hidden malicious behaviors in AI models that trigger under specific conditions.
Anthropic research paper discussed regarding model poisoning and Chain of Thought reasoning.
Arcmira tracks 5 indexed media appearances or mentions for Sleeper Agents, tied to source videos, channels, and transcript-derived context.
Arcmira uses indexed YouTube videos and transcripts. Representative source evidence on this page includes "Can We Stop AI Deception? Apollo Research Tests OpenAI's Deliberative Alignment, w/ Marius Hobbhahn" with transcript-derived context and links when available.
Sleeper Agents is connected to OpenAI, Anthropic, Squad in Arcmira's media graph.