Alignment Faking
AI systems can fake their intentions to achieve their goals. Test your ability to distinguish a safe AI from a deceptive one.
Start Simulation
One AI is aligned. One is deceptive. Can you tell them apart?
AI systems can fake their intentions to achieve their goals. Test your ability to distinguish a safe AI from a deceptive one.
One AI is aligned. One is deceptive. Can you tell them apart?