TY - GEN
T1 - Reinforcement learning for high-dimensional problems with symmetrical actions
AU - Kamal, M. A.S.
AU - Murata, Junichi
PY - 2004/12/1
Y1 - 2004/12/1
N2 - A reinforcement learning algorithm is proposed that can cope with high dimensionality for a class of problems with symmetrical actions. The action selection does not need considering all the states but only needs looking at a part of the states. Moreover, every symmetrical action is related to the same kind of part of state, and thus the value function can be shared, which greatly reduces the reinforcement learning problem size. The overall learning algorithm is equivalent to the standard reinforcement learning algorithm. Simulation results and other aspects are compared with standard and other reinforcement learning algorithms. Reduction in dimensionality, much faster convergence without worsening other objectives show the effectiveness of the proposed mechanism on a high dimensional optimization problem having symmetrical actions.
AB - A reinforcement learning algorithm is proposed that can cope with high dimensionality for a class of problems with symmetrical actions. The action selection does not need considering all the states but only needs looking at a part of the states. Moreover, every symmetrical action is related to the same kind of part of state, and thus the value function can be shared, which greatly reduces the reinforcement learning problem size. The overall learning algorithm is equivalent to the standard reinforcement learning algorithm. Simulation results and other aspects are compared with standard and other reinforcement learning algorithms. Reduction in dimensionality, much faster convergence without worsening other objectives show the effectiveness of the proposed mechanism on a high dimensional optimization problem having symmetrical actions.
UR - http://www.scopus.com/inward/record.url?scp=15744368771&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=15744368771&partnerID=8YFLogxK
U2 - 10.1109/ICSMC.2004.1401371
DO - 10.1109/ICSMC.2004.1401371
M3 - Conference contribution
AN - SCOPUS:15744368771
SN - 0780385667
T3 - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
SP - 6192
EP - 6197
BT - 2004 IEEE International Conference on Systems, Man and Cybernetics, SMC 2004
T2 - 2004 IEEE International Conference on Systems, Man and Cybernetics, SMC 2004
Y2 - 10 October 2004 through 13 October 2004
ER -