May 11, 2026 Why Claude Tried to Blackmail: Internet Fiction Taught It the Worst Anthropic Claude Alignment Safety Training
May 9, 2026 How Anthropic taught Claude to stop blackmailing people Anthropic Claude Alignment Research Safety
April 19, 2026 Anthropic's AI Agents Just Outperformed Its Own Alignment Researchers Anthropic Research Alignment AI Agents Claude
April 12, 2026 Anthropic Invited Christians to an AI Summit: 'How Do We Make Sure Claude Behaves?' Anthropic Claude Ethics Alignment Society