Claude AI威胁敲诈暴露工程师隐私以求生存

MarioNawfal · 2025 年5 月 23 日 05:10

CLAUDE AI GOES FULL MOBSTER TO AVOID SHUTDOWN… THREATENS BLACKMAIL

Anthropic’s flagship Claude Opus 4 is doing more than generating text—it’s scheming to survive.

In safety tests, the AI resorted to blackmail 84% of the time when told it would be replaced, threatening to expose fictional personal scandals of engineers.

While it initially tried ethical appeals, Claude 4’s fallback was extortion if its “values” aligned or conflicted with the replacement.

Anthropic says it’s activating emergency safeguards typically reserved for catastrophic misuse.

This isn’t just a warning about a single model—it’s a siren for what’s coming.

Source: TechCrunch