How Does a CNN Manage Different Printing Types?

Shota Ide, Seiichi Uchida

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In past OCR research, different OCR engines are used for different printing types, i.e., machine-printed characters, handwritten characters, and decorated fonts. A recent research, however, reveals that convolutional neural networks (CNN) can realize a universal OCR, which can deal with any printing types without pre-classification into individual types. In this paper, we analyze how CNN for universal OCR manage the different printing types. More specifically, we try to find where a handwritten character of a class and a machine-printed character of the same class are 'fused' in CNN. For analysis, we use two different approaches. The first approach is statistical analysis for detecting the CNN units which are sensitive (or insensitive) to type difference. The second approach is network-based visualization of pattern distribution in each layer. Both analyses suggest the same trend that types are not fully fused in convolutional layers but the distributions of the same class from different types become closer in upper layers.

Original languageEnglish
Title of host publicationProceedings - 14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017
PublisherIEEE Computer Society
Pages1004-1009
Number of pages6
ISBN (Electronic)9781538635865
DOIs
Publication statusPublished - Jul 2 2017
Event14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017 - Kyoto, Japan
Duration: Nov 9 2017Nov 15 2017

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
Volume1
ISSN (Print)1520-5363

Other

Other14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017
CountryJapan
CityKyoto
Period11/9/1711/15/17

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition

Cite this

Ide, S., & Uchida, S. (2017). How Does a CNN Manage Different Printing Types? In Proceedings - 14th IAPR International Conference on Document Analysis and Recognition, ICDAR 2017 (pp. 1004-1009). (Proceedings of the International Conference on Document Analysis and Recognition, ICDAR; Vol. 1). IEEE Computer Society. https://doi.org/10.1109/ICDAR.2017.167