All Articles

Page 25 of 65

How Anthropic taught Claude to stop blackmailing people

New alignment paper reveals that explaining why matters more than punishment. The method is surprisingly simple — and raises big questions about AI training.

May 9, 2026

Claude weaponized: hackers used AI to target Mexican water utility

Security firm Dragos documents the first case of an AI model being actively used in an attack on critical infrastructure. Claude wrote a 17,000-line framework — and independently identified OT systems as a target.

May 9, 2026

Snyk integrates Claude into its security platform — and now monitors AI agents too

Code security firm Snyk embeds Claude directly into its platform. Beyond classic vulnerability scanning, Evo by Snyk now monitors AI agents for prompt injection and data exfiltration.

May 9, 2026

Anthropic Goes After the Mass Market — One Million Signups Per Day

Claude is the #2 free app in the US App Store, with over a million daily signups. Now Anthropic is investing heavily in consumer features.

May 8, 2026

Claude Code 2.1.128 to 2.1.133: Six Releases in Four Days

Plugin URLs, worktree tuning, native package manager updates, and a VS Code fix: Claude Code ships at rapid pace after Code with Claude.

May 8, 2026

Claude Code Doubles Its Limits — Peak Hours Are Gone

Anthropic has doubled Claude Code rate limits for all paid plans and eliminated peak-hour throttling. API limits get up to a 1,500 percent boost.

May 8, 2026

OpenAI Introduces 'Trusted Contact': ChatGPT Can Now Notify Someone You Trust

When ChatGPT detects a user might be at risk of self-harm, it can now alert a trusted contact. A feature that raises important questions.

May 8, 2026

OpenAI Launches Three New Voice Models for the API — Including Real-Time Translation

GPT-Realtime-2, a live translator for 70+ languages, and streaming transcription: OpenAI is turning voice into a developer platform.

May 8, 2026