A policy representation using weighted multiple normal distribution real-time reinforcement learning feasible for varying optimal actions

Hajime Kimura, Takeshi Aramaki, Shigenobu Kobayashi

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Fingerprint

Dive into the research topics of 'A policy representation using weighted multiple normal distribution real-time reinforcement learning feasible for varying optimal actions'. Together they form a unique fingerprint.

Engineering & Materials Science