
research note
Reward Modeling for Multi-Agent Orchestration
This paper addresses the challenge of efficiently training and scaling Multi-Agent Systems (MAS) built on Large Language Models (LLMs), where orchestration of specialized agents is key to task perf…










