TD algorithm for the variance of return and mean-variance reinforcement learning

Makoto Sato, Hajime Kimura, Shibenobu Kobayashi

Research output: Contribution to journalArticlepeer-review

20 Citations (Scopus)

Fingerprint

Dive into the research topics of 'TD algorithm for the variance of return and mean-variance reinforcement learning'. Together they form a unique fingerprint.

Computer Science

Psychology

Neuroscience