Coordination in multiagent reinforcement learning systems by virtual reinforcement signals

M. A.S. Kamal, Junichi Murata

研究成果: ジャーナルへの寄稿学術誌査読

1 被引用数 (Scopus)


This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action estimating system dynamics in terms of both the natural reward for task achievement and the virtual reward for cooperation. The virtual reward for cooperation is ascertained dynamically by a coordinating agent who estimates it from the change in degree of cooperation of all agents using a separate reinforcement learning. This technique provides adaptive coordination, requires less communication and ensures agents to be cooperative. The validity of virtual rewards for convergence in learning is verified, and the proposed method is tested on two different simulated domains to illustrate its significance. The empirical performance of the coordinated system compared to the uncoordinated system illustrates its advantages for multiagent systems.

ジャーナルInternational Journal of Knowledge-Based and Intelligent Engineering Systems
出版ステータス出版済み - 2007

!!!All Science Journal Classification (ASJC) codes

  • ソフトウェア
  • 制御およびシステム工学
  • 人工知能


「Coordination in multiagent reinforcement learning systems by virtual reinforcement signals」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。