A method for finding multiple subgoals for reinforcement learning

Fuminori Ogihara, Junichi Murata

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

1 被引用数 (Scopus)

抄録

This paper proposes a new method for discovering multiple subgoals automatically to accelerate reinforcement learning. There have been proposed several methods for discovery of subgoals. Some use state visiting frequencies in the trajectories that reach the goal state. When a state visiting frequency is very high, this state is regarded as the subgoal. Because this kind of methods need that the goal state is reached many times to collect trajectories, they take a long time for discovering subgoals. In addition, they cannot discover the potential subgoals that will become appropriate subgoals when the goal state changes. On the other hand, some methods identify subgoals by partitioning local state transition graphs. But this kind of methods require large calculation amounts. We propose a new method that solves the above drawbacks. The new method utilizes state visiting frequencies. But we collect trajectories that go through particular non-goal states selected at random. For each particular state, trajectories are collected. Most of the trajectories reach the particular state more easily that the goal state. Therefore, it is expected that we can discover subgoals quickly and discover multiple subgoals together.

本文言語英語
ホスト出版物のタイトルProceedings of the 16th International Symposium on Artificial Life and Robotics, AROB 16th'11
ページ804-807
ページ数4
出版ステータス出版済み - 12 1 2011
イベント16th International Symposium on Artificial Life and Robotics, AROB '11 - Beppu, Oita, 日本
継続期間: 1 27 20111 29 2011

出版物シリーズ

名前Proceedings of the 16th International Symposium on Artificial Life and Robotics, AROB 16th'11

その他

その他16th International Symposium on Artificial Life and Robotics, AROB '11
Country日本
CityBeppu, Oita
Period1/27/111/29/11

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Human-Computer Interaction

フィンガープリント 「A method for finding multiple subgoals for reinforcement learning」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル