LLM Security
AdvJudge-Zero: Adversarial Tokens Can Flip LLM Evaluator Decision
New research reveals how adversarial control tokens can manipulate LLM-as-a-Judge systems into completely reversing their binary decisions, exposing critical vulnerabilities in AI evaluation pipelines.