Multiagent planning with trembling-hand perfect equilibrium in multiagent POMDPs

Yuichi Yabu, Makoto Yokoo, Atsushi Iwasaki

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

抄録

Multiagent Partially Observable Markov Decision Processes are a popular model of multiagent systems with uncertainty. Since the computational cost for finding an optimal joint policy is prohibitive, a Joint Equilibrium-based Search for Policies with Nash Equilibrium (JESP-NE) is proposed that finds a locally optimal joint policy in which each policy is a best response to other policies; i.e., the joint policy is a Nash equilibrium. One limitation of JESP-NE is that the quality of the obtained joint policy depends on the predefined default policy. More specifically, when finding a best response, if some observation have zero probabilities, JESP-NE uses this default policy. If the default policy is quite bad, JESP-NE tends to converge to a sub-optimal joint policy. In this paper, we propose a method that finds a locally optimal joint policy based on a concept called Trembling-hand Perfect Equilibrium (TPE). In finding a TPE, we assume that an agent might make a mistake in selecting its action with small probability. Thus, an observation with zero probability in JESP-NE will have non-zero probability. We no longer use the default policy. As a result, JESP-TPE can converge to a better joint policy than the JESP-NE, which we confirm this fact by experimental evaluations.

本文言語英語
ホスト出版物のタイトルAgent Computing and Multi-Agent Systems - 10th Pacific Rim International Conference on Multi-Agents, PRIMA 2007, Revised Papers
ページ13-24
ページ数12
DOI
出版ステータス出版済み - 2009
イベント10th Pacific Rim International Conference on Multi-Agents, PRIMA 2007 - Bangkok, タイ
継続期間: 11 21 200711 23 2007

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
5044 LNAI
ISSN(印刷版)0302-9743
ISSN(電子版)1611-3349

その他

その他10th Pacific Rim International Conference on Multi-Agents, PRIMA 2007
Countryタイ
CityBangkok
Period11/21/0711/23/07

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

フィンガープリント 「Multiagent planning with trembling-hand perfect equilibrium in multiagent POMDPs」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル