LLM research
Benchmark Leakage Trap Exposes Trust Issues in LLM Recommenders
New research reveals how benchmark data contamination undermines the reliability of LLM-based recommendation systems, raising critical questions about AI evaluation integrity.