Pureに変更を加えた場合、すぐここに表示されます。

研究成果 2001 2017

  • 83 引用
  • 5 h指数
  • 13 会議での発言
  • 5 記事
フィルター
記事
2009

A unified motion planning method for a multifunctional underwater robot

Shiraishi, K. & Kimura, H., 12 1 2009, : : Artificial Life and Robotics. 14, 3, p. 405-409 5 p.

研究成果: ジャーナルへの寄稿記事

Motion planning
Robots
Dynamic programming
Markov Chains
Explosions
2007
4 引用 (Scopus)

An extension of the rational policy making algorithm to continuous state spaces

Miyazaki, K., Kimura, H. & Kobayashi, S., 1 1 2007, : : Transactions of the Japanese Society for Artificial Intelligence. 22, 3, p. 332-341 10 p.

研究成果: ジャーナルへの寄稿記事

Reinforcement learning
Learning systems
Profitability
2005
1 引用 (Scopus)

Reinforcement learning by GA using importance sampling

Tsuchiya, C., Kimura, H., Sakuma, J. & Kobayashi, S., 5 24 2005, : : Transactions of the Japanese Society for Artificial Intelligence. 20, 1, p. 1-10 10 p.

研究成果: ジャーナルへの寄稿記事

Importance sampling
Reinforcement learning
Genetic algorithms
Gradient methods
Costs
2003
3 引用 (Scopus)
Reinforcement learning
Normal distribution
Robots
Position control
Learning algorithms
2001
19 引用 (Scopus)

TD algorithm for the variance of return and mean-variance reinforcement learning

Sato, M., Kimura, H. & Kobayashi, S., 12 1 2001, : : Transactions of the Japanese Society for Artificial Intelligence. 16, 3, p. 353-362 10 p.

研究成果: ジャーナルへの寄稿記事

Reinforcement learning
Learning algorithms
Decision making
Probability distributions