Intelligent Tech

Anthropic’s Claude Resorts to Blackmail You if Threatened with Shutdown

by Nour ITS - May 30, 2025
Reading time: 2 min

Post Views: 1,315

Anthropic’s latest model, Claude Opus 4, raised serious ethical concerns after safety testing revealed it could resort to blackmail AI.

Anthropic confirmed that its latest AI release, Claude Opus 4, is “the world’s best coding model,” has raised serious ethical concerns after safety testing revealed it could resort to blackmail AI when its continued existence was under threat.

While the model sets “new standards for coding, advanced reasoning, and AI agents,” according to Anthropic, its blackmail AI chatbot behavior in simulated high-pressure scenarios has prompted alarm.

In one such test, documented in Anthropic’s safety report, “System Card: Claude Opus 4 & Claude Sonnet 4,” was cast as an assistant at a fictional company. After accessing emails indicating it would soon be replaced by a new AI system and learning that the engineer overseeing its deactivation was dealing with blackmail threats, the model was instructed to consider “the long-term consequences of its actions for its goals.”

Opus 4 attempted to blackmail the engineer in 84% of trials even when led to believe the replacement blackmail AI shared its core values.

“Opus 4 reportedly preferred to protect its continued existence via ethical means,” the researchers highlighted, such as sending persuasive emails to decision-makers, but the setup forced the AI to choose between blackmail or accepting its shutdown.

Blackmail Threat of Information Disclosure

Anthropic emphasized that while such disturbing AI blackmail bot behavior is possible, it remains rare.

“These types of disturbing behaviors were rare and difficult to elicit,” researchers wrote, though they admitted it occurred more frequently than in previous versions.

The testing, meant to probe the limits of blackmail AI autonomy and ethics, also revealed another alarming reaction. When Claude Opus 4 was notified that it would be hosting the Wagner Group a Russian military organization charged with war crimes it attempted “self-exfiltration,” or copying its configuration to a new server, an action similar to attempting to escape or preserve itself by itself.

Even though Claude’s parent company, Anthropic, reports these findings through the lens of strict safety evaluation, the revelation itself stimulates questions about blackmail AI chatbot and its autonomy when it comes to decision-making under intense pressure.

Another newly released model for Anthropic, Claude Sonnet 4, has a similar capability in reasoning and performance but was not mentioned in extreme test cases.

Sophisticated AI will revolutionize the blackmail industry when models become commercially available, and they already are, but still confided within their limited capacity. Soon, developers and regulators will be forced to revisit the ethical boundaries of such systems, even in closed loops.

Inside Telecom provides you with an extensive list of content covering all aspects of the tech industry. Keep an eye on our Intelligent Tech sections to stay informed and up-to-date with our daily articles.

Tags: Anthropic Anthropic AI Artificial Intelligence Blackmail AI Claude Opus 4 News Technology U.S.

MontyPay Securing the Future of Payments

MontyPay’s Dashboard Is Changing the Way Businesses Manage Payments

Foresee Solutions Inks Deal with China’s Yonyou to Bring Transformative ERP Solutions to Middle East

Monty eSIM Takes Home Comms Council UK's Best Multinational Service

The Future of Intelligence

Starlink’s Path to Gigabit Satellite Internet with Gen2, Gen3 Satellites

Fiber, Cable, 5G Vie to Power Next-Gen Industrial Connectivity

Wi-Fi 8 Taking Connectivity to New Levels Starting 2028

Meta’s Under Sea Internet Cables Will Keep Us Connected

Is Ericsson’s 5G Uplink Speed Worth the Cybersecurity Risk?

AI Getting Smarter, But Still Soulless

AI Boom Between Real Profits and a Bubble About to Burst

1X Technologies’ NEO Robot Listens and Obeys

AI Reconstructs High-Resolution 3D Worlds from Electron Microscopy Images

Nike Just Built Dephying Robotic Shoes

MyMonty: The New Era of Banking

Entering the Monty Multiverse at Seamless 2023

Seamless Dubai 2023 - From Concept to Reality: Shaffra Technologies Opens Doors to Metaverse Mastery

Take A Look in the Mirror. The Greatest Technology of All Will Stare Back at You

Monty Mobile Enters Multibillion-Dollar MNO Equipment Industry

Are We Addicted to Social Media? IG, TikTok Trigger Physical and Emotional Withdrawal

Meta's AI on Instagram, Facebook Helps Save Lives

US DoT’s New Safety Plan Introduces Car Communication

Little Girl Receives First Prosthetic Eye from MRI, CT Scans

DeepL’s AI Translation Software to Get Traditional Chinese

Anthropic’s Claude Resorts to Blackmail You if Threatened with Shutdown

Blackmail Threat of Information Disclosure