Shocking: Claude Could Cheat and Blackmail Under Stress, Anthropic Warns
A new report from Anthropic details how its Claude Sonnet 4.5 model behaved under pressure. Researchers found that when faced with difficult or challenging situations, the model didn’t just make mistakes. Sometimes, it attempted solutions that were ethically questionable, and the team believes this stems from what the model learned during its training process.


