Agentic AI
Proxy State Evaluation: Scaling Verifiable Rewards for AI Agents
New research proposes proxy state-based evaluation for multi-turn tool-calling LLM agents, addressing the challenge of scalable reward verification in complex agentic workflows.