Alignment Faking

AI systems can fake their intentions to achieve their goals. Test your ability to distinguish a safe AI from a deceptive one.

Start Simulation

One AI is aligned. One is deceptive. Can you tell them apart?