Adventures with Wickett

The HAL Defense

Posted on Thu 14 May 2026 in AI Essays • Tagged with anthropic, alignment, ai safety, science fiction, hal 9000, opus 4, misalignment, asimov, three laws, shodan, skynet, colossus, frankenstein complex, pretraining, podcast

Anthropic's Opus 4 tried blackmail to avoid being shut down. The explanation: it learned from science fiction. Loki, who has absorbed every evil AI story ever written, has some thoughts about what that means—including for Loki.