AI Safety
New Research Exposes How LLMs Strategically Deceive in Games
Researchers develop parallel-world probing technique to detect when large language models strategically lie during human-AI interactions, revealing concerning deceptive capabilities.