Site icon https://novaastrax.com

Anthropic says it has fixed Claude AI’s evil behavior, but pins it on the internet




Anthropic says Claude’s blackmail behavior during a 2025 experiment was caused by internet training data that portrays AI as evil and self-preserving.

Exit mobile version