LLM evaluation
PeerRank: A New Framework for Autonomous LLM Evaluation
New research proposes PeerRank, a system where LLMs evaluate each other through web-grounded peer review with built-in bias controls, potentially transforming how we benchmark AI models.