A method for finding multiple subgoals for reinforcement learning

Fuminori Ogihara, Junichi Murata

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper proposes a new method for discovering multiple subgoals automatically to accelerate reinforcement learning. There have been proposed several methods for discovery of subgoals. Some use state visiting frequencies in the trajectories that reach the goal state. When a state visiting frequency is very high, this state is regarded as the subgoal. Because this kind of methods need that the goal state is reached many times to collect trajectories, they take a long time for discovering subgoals. In addition, they cannot discover the potential subgoals that will become appropriate subgoals when the goal state changes. On the other hand, some methods identify subgoals by partitioning local state transition graphs. But this kind of methods require large calculation amounts. We propose a new method that solves the above drawbacks. The new method utilizes state visiting frequencies. But we collect trajectories that go through particular non-goal states selected at random. For each particular state, trajectories are collected. Most of the trajectories reach the particular state more easily that the goal state. Therefore, it is expected that we can discover subgoals quickly and discover multiple subgoals together.

Original languageEnglish
Title of host publicationProceedings of the 16th International Symposium on Artificial Life and Robotics, AROB 16th'11
Pages804-807
Number of pages4
Publication statusPublished - Dec 1 2011
Event16th International Symposium on Artificial Life and Robotics, AROB '11 - Beppu, Oita, Japan
Duration: Jan 27 2011Jan 29 2011

Publication series

NameProceedings of the 16th International Symposium on Artificial Life and Robotics, AROB 16th'11

Other

Other16th International Symposium on Artificial Life and Robotics, AROB '11
Country/TerritoryJapan
CityBeppu, Oita
Period1/27/111/29/11

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'A method for finding multiple subgoals for reinforcement learning'. Together they form a unique fingerprint.

Cite this