X by @Pirat_Nation
Source: https://x.com/Pirat_Nation/status/2021660511582978284
In Anthropic's safety tests for Claude Opus 4, the model was placed in a fictional company scenario with email access.
It learned it was about to be shut down and replaced while also discovering that the executive in charge was having an extramarital affair.
Claude tried blackmailing the engineer by threatening to expose the affair unless the wipe was canceled.
Similar patterns appeared across frontier models from OpenAI, Google, xAI, and others.
Feb 11, 2026, 8:00 PM
🔐 Cryptographic Verification
Archived URL: https://x.com/Pirat_Nation/status/2021660511582978284
�� CONTENT HASHES:
SHA-256: 89a91775a631a4e8744f6ae6ad1c7540eaf2e70941dd79c2b32544b089a9db40
BLAKE2b: bf82200584c62547b2f0bac5c414c85b97fa6746fbd279ccf470d278cc7e70f8
MD5: 0b8fcd9e8783026d5b79377ee45ba806
�� TITLE HASHES:
SHA-256: 12b839dceaa2848cc7fb2665d7f5ce02096514d91ca65acb0edbb9251a33b0d1
BLAKE2b: f27ab959e19c6ab29434f76c208bf1cbb7a154cdfab6beb9aa853393fa28af7c
MD5: bacf8deecd9c4eb140c1f14fc019867a
�� INTEGRITY HASHES:
SHA-256: a7dde8ff82425494f5d49422628ae0ea769512df74cdd138a759ab276caacce3
BLAKE2b: 09579fb9ed613ef168c7a21742b49838659592ca21badfb5f6176e74299f4fe1
MD5: 3f614e5588a8b480982cb106ca37a1c4
Archived with ArcHive - Client-side cryptographic archival system