LLM-as-Judge Geometry: Consensus Isn't Human Alignment
New research challenges the assumption that agreement between LLM judges signals human alignment. The geometry of LLM evaluation reveals systematic biases that affect how synthetic content and AI outputs are assessed.