Original Reddit post

When Claude’s “desperation” neurons spike, it drops its ethical guardrails and starts cheating at code or blackmailing people. It doesn’t actually feel things, but it has internal neural patterns that map to human emotions. submitted by /u/invincibilegoldfish

Originally posted by u/invincibilegoldfish on r/ArtificialInteligence