Syntactic detection and correction of misrecognitions in mathematical OCR

Akio Fujiyoshi, Masakazu Suzuki, Seiichi Uchida

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

This paper proposes a syntactic method for detection and correction of misrecognized mathematical formulae for a practical mathematical OCR system. Linear monadic context-free tree grammar (LM-CFTG) is employed as a formal framework to define syntactically acceptable mathematical formulae. For the purpose of practical evaluation, a verification system is developed, and the effectiveness of the method is demonstrated by using the ground-truthed mathematical document database InftyCDB-1 and a misrecognition database newly constructed for this study. A satisfactory number of misrecognitions are detected and delivered to the correction process.

Original languageEnglish
Title of host publicationICDAR2009 - 10th International Conference on Document Analysis and Recognition
Pages1360-1364
Number of pages5
DOIs
Publication statusPublished - 2009
EventICDAR2009 - 10th International Conference on Document Analysis and Recognition - Barcelona, Spain
Duration: Jul 26 2009Jul 29 2009

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN (Print)1520-5363

Other

OtherICDAR2009 - 10th International Conference on Document Analysis and Recognition
CountrySpain
CityBarcelona
Period7/26/097/29/09

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition

Fingerprint Dive into the research topics of 'Syntactic detection and correction of misrecognitions in mathematical OCR'. Together they form a unique fingerprint.

Cite this