Reinforcement learning is considered as an important tool for robotic learning in unknown/uncertain environments. In this paper, we propose an evaluation function expressed in a vector form to realize multi-dimensional reinforcement learning. The novel feature of the proposed method is that learning one behavior induces parallel learning of other behaviors though the objectives of each behavior are different. In brief, all behaviors watch other behaviors from a critical point of view. Therefore, in the proposed method, there is cross-criticism and parallel learning that make the multi-dimensional learning process more efficient. By applying the proposed learning method, we carried out multi-dimensional evaluation (reward) and multi-dimensional learning simultaneously in one trial. A special neural network (Q-net), in which the weights and the output are represented by vectors, is proposed to realize a critic network for Q-learning. The proposed learning method is applied for behavior planning of mobile robots.
|Number of pages||7|
|Journal||International Journal of Control, Automation and Systems|
|Publication status||Published - Jan 1 2003|
All Science Journal Classification (ASJC) codes
- Control and Systems Engineering
- Computer Science Applications