LLM Agents
Cross-Domain RL Training: Reducing the Generalization Tax for LLM
New research explores how reinforcement learning training affects LLM agent generalization across domains, introducing the concept of 'generalization tax' and strategies to minimize performance degradation.