TY - GEN
T1 - Exploiting locality of interaction in networked distributed POMDPs
AU - Kim, Yoonheui
AU - Nair, Ranjit
AU - Varakantham, Pradeep
AU - Tambe, Milind
AU - Yokoo, Makoto
PY - 2006/8/21
Y1 - 2006/8/21
N2 - In many real-world multiagent applications such as distributed sensor nets, a network of agents is formed based on each agent's limited interactions with a small number of neighbors. While distributed POMDPs capture the real-world uncertainty in multiagent domains, they fail to exploit such locality of interaction. Distributed constraint optimization (DCOP) captures the locality of interaction but fails to capture planning under uncertainty. In previous work, we presented a model synthesized from distributed POMDPs and DCOPs, called Networked Distributed POMDPs (ND-POMDPs). Also, we presented LID-JESP (locally interacting distributed joint equilibrium-based search for policies: a distributed policy generation algorithm based on DBA (distributed breakout algorithm). In this paper, we present a stochastic variation of the LID-JESP that is based on DSA (distributed stochastic algorithm) that allows neighboring agents to change their policies in the same cycle. Through detailed experiments, we show how this can result in specdups without a large difference in solution quality. We also introduce a technique called hyper-link-based decomposition that allows us to exploit locality of interaction further, resulting in faster run times for both LID-JESP and its stochastic variant without any loss in solution quality.
AB - In many real-world multiagent applications such as distributed sensor nets, a network of agents is formed based on each agent's limited interactions with a small number of neighbors. While distributed POMDPs capture the real-world uncertainty in multiagent domains, they fail to exploit such locality of interaction. Distributed constraint optimization (DCOP) captures the locality of interaction but fails to capture planning under uncertainty. In previous work, we presented a model synthesized from distributed POMDPs and DCOPs, called Networked Distributed POMDPs (ND-POMDPs). Also, we presented LID-JESP (locally interacting distributed joint equilibrium-based search for policies: a distributed policy generation algorithm based on DBA (distributed breakout algorithm). In this paper, we present a stochastic variation of the LID-JESP that is based on DSA (distributed stochastic algorithm) that allows neighboring agents to change their policies in the same cycle. Through detailed experiments, we show how this can result in specdups without a large difference in solution quality. We also introduce a technique called hyper-link-based decomposition that allows us to exploit locality of interaction further, resulting in faster run times for both LID-JESP and its stochastic variant without any loss in solution quality.
UR - http://www.scopus.com/inward/record.url?scp=33747180593&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33747180593&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:33747180593
SN - 1577352653
SN - 9781577352655
T3 - AAAI Spring Symposium - Technical Report
SP - 41
EP - 48
BT - Distributed Plan and Schedule Management - Papers from the AAAI Spring Symposium, Technical Report
T2 - 2006 AAAI Spring Symposium
Y2 - 27 March 2006 through 29 March 2006
ER -