"They screwed us": Personality clashes sent Anthropic's models offline

Simon WillisonJun 15, 2026 at 10:57 PM7.0/10

Personality conflicts reportedly contributed to the US government's decision to take Anthropic's AI models offline, with key executives meeting with the Commerce Department. The situation highlights ongoing challenges in AI safety and the difficulty of creating jailbreak-proof models. The article questions whether Anthropic has adequately addressed known adversarial attacks on their systems.

Background

Anthropic is an AI research company that develops large language models like Claude, with a focus on AI safety and alignment. The company has been working on constitutional AI approaches to make models more robust against adversarial attacks.

Source: Simon Willison
Published: Jun 15, 2026 at 10:57 PM
Score: 7.0 / 10

Read Original →