What is the reward for handwriting? - handwriting generation by imitation learning

Research output: Contribution to journalArticlepeer-review

Abstract

Analyzing the handwriting generation process is an important issue and has been tackled by various generation models, such as kinematics based models and stochastic models. In this study, we use a reinforcement learning (RL) framework to realize handwriting generation with the careful future planning ability. In fact, the handwriting process of human beings is also supported by their future planning ability; for example, the ability is necessary to generate a closed trajectory like ‘0’ because any shortsighted model, such as a Markovian model, cannot generate it. For the algorithm, we employ generative adversarial imitation learning (GAIL). Typical RL algorithms require the manual definition of the reward function, which is very crucial to control the generation process. In contrast, GAIL trains the reward function along with the other modules of the framework. In other words, through GAIL, we can understand the reward of the handwriting generation process from handwriting examples. Our experimental results qualitatively and quantitatively show that the learned reward catches the trends in handwriting generation and thus GAIL is well suited for the acquisition of handwriting behavior.

Original languageEnglish
JournalUnknown Journal
Publication statusPublished - Sep 23 2020

All Science Journal Classification (ASJC) codes

  • General

Fingerprint Dive into the research topics of 'What is the reward for handwriting? - handwriting generation by imitation learning'. Together they form a unique fingerprint.

Cite this