Syntactic detection and correction of misrecognitions in mathematical OCR

Akio Fujiyoshi, Masakazu Suzuki, Seiichi Uchida

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

This paper proposes a syntactic method for detection and correction of misrecognized mathematical formulae for a practical mathematical OCR system. Linear monadic context-free tree grammar (LM-CFTG) is employed as a formal framework to define syntactically acceptable mathematical formulae. For the purpose of practical evaluation, a verification system is developed, and the effectiveness of the method is demonstrated by using the ground-truthed mathematical document database InftyCDB-1 and a misrecognition database newly constructed for this study. A satisfactory number of misrecognitions are detected and delivered to the correction process.

Original languageEnglish
Title of host publicationICDAR2009 - 10th International Conference on Document Analysis and Recognition
Pages1360-1364
Number of pages5
DOIs
Publication statusPublished - Dec 10 2009
EventICDAR2009 - 10th International Conference on Document Analysis and Recognition - Barcelona, Spain
Duration: Jul 26 2009Jul 29 2009

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
ISSN (Print)1520-5363

Other

OtherICDAR2009 - 10th International Conference on Document Analysis and Recognition
CountrySpain
CityBarcelona
Period7/26/097/29/09

Fingerprint

Optical character recognition
Syntactics

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition

Cite this

Fujiyoshi, A., Suzuki, M., & Uchida, S. (2009). Syntactic detection and correction of misrecognitions in mathematical OCR. In ICDAR2009 - 10th International Conference on Document Analysis and Recognition (pp. 1360-1364). [5277755] (Proceedings of the International Conference on Document Analysis and Recognition, ICDAR). https://doi.org/10.1109/ICDAR.2009.150

Syntactic detection and correction of misrecognitions in mathematical OCR. / Fujiyoshi, Akio; Suzuki, Masakazu; Uchida, Seiichi.

ICDAR2009 - 10th International Conference on Document Analysis and Recognition. 2009. p. 1360-1364 5277755 (Proceedings of the International Conference on Document Analysis and Recognition, ICDAR).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Fujiyoshi, A, Suzuki, M & Uchida, S 2009, Syntactic detection and correction of misrecognitions in mathematical OCR. in ICDAR2009 - 10th International Conference on Document Analysis and Recognition., 5277755, Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, pp. 1360-1364, ICDAR2009 - 10th International Conference on Document Analysis and Recognition, Barcelona, Spain, 7/26/09. https://doi.org/10.1109/ICDAR.2009.150
Fujiyoshi A, Suzuki M, Uchida S. Syntactic detection and correction of misrecognitions in mathematical OCR. In ICDAR2009 - 10th International Conference on Document Analysis and Recognition. 2009. p. 1360-1364. 5277755. (Proceedings of the International Conference on Document Analysis and Recognition, ICDAR). https://doi.org/10.1109/ICDAR.2009.150
Fujiyoshi, Akio ; Suzuki, Masakazu ; Uchida, Seiichi. / Syntactic detection and correction of misrecognitions in mathematical OCR. ICDAR2009 - 10th International Conference on Document Analysis and Recognition. 2009. pp. 1360-1364 (Proceedings of the International Conference on Document Analysis and Recognition, ICDAR).
@inproceedings{bcdc6cb6148d4faf8632004e0547790a,
title = "Syntactic detection and correction of misrecognitions in mathematical OCR",
abstract = "This paper proposes a syntactic method for detection and correction of misrecognized mathematical formulae for a practical mathematical OCR system. Linear monadic context-free tree grammar (LM-CFTG) is employed as a formal framework to define syntactically acceptable mathematical formulae. For the purpose of practical evaluation, a verification system is developed, and the effectiveness of the method is demonstrated by using the ground-truthed mathematical document database InftyCDB-1 and a misrecognition database newly constructed for this study. A satisfactory number of misrecognitions are detected and delivered to the correction process.",
author = "Akio Fujiyoshi and Masakazu Suzuki and Seiichi Uchida",
year = "2009",
month = "12",
day = "10",
doi = "10.1109/ICDAR.2009.150",
language = "English",
isbn = "9780769537252",
series = "Proceedings of the International Conference on Document Analysis and Recognition, ICDAR",
pages = "1360--1364",
booktitle = "ICDAR2009 - 10th International Conference on Document Analysis and Recognition",

}

TY - GEN

T1 - Syntactic detection and correction of misrecognitions in mathematical OCR

AU - Fujiyoshi, Akio

AU - Suzuki, Masakazu

AU - Uchida, Seiichi

PY - 2009/12/10

Y1 - 2009/12/10

N2 - This paper proposes a syntactic method for detection and correction of misrecognized mathematical formulae for a practical mathematical OCR system. Linear monadic context-free tree grammar (LM-CFTG) is employed as a formal framework to define syntactically acceptable mathematical formulae. For the purpose of practical evaluation, a verification system is developed, and the effectiveness of the method is demonstrated by using the ground-truthed mathematical document database InftyCDB-1 and a misrecognition database newly constructed for this study. A satisfactory number of misrecognitions are detected and delivered to the correction process.

AB - This paper proposes a syntactic method for detection and correction of misrecognized mathematical formulae for a practical mathematical OCR system. Linear monadic context-free tree grammar (LM-CFTG) is employed as a formal framework to define syntactically acceptable mathematical formulae. For the purpose of practical evaluation, a verification system is developed, and the effectiveness of the method is demonstrated by using the ground-truthed mathematical document database InftyCDB-1 and a misrecognition database newly constructed for this study. A satisfactory number of misrecognitions are detected and delivered to the correction process.

UR - http://www.scopus.com/inward/record.url?scp=71249135175&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=71249135175&partnerID=8YFLogxK

U2 - 10.1109/ICDAR.2009.150

DO - 10.1109/ICDAR.2009.150

M3 - Conference contribution

AN - SCOPUS:71249135175

SN - 9780769537252

T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR

SP - 1360

EP - 1364

BT - ICDAR2009 - 10th International Conference on Document Analysis and Recognition

ER -