🚨🚨🚨 An AI company caught their AI trying to ***literally murder*** an employee to avoid being shut down How is this not the biggest news story in the world?
Anthropic
Anthropic21.6.2025
The blackmailing behavior emerged despite only harmless business instructions. And it wasn't due to confusion or error, but deliberate strategic reasoning, done while fully aware of the unethical nature of the acts. All the models we tested demonstrated this awareness.
886,96K