INFTY - An integrated OCR system for mathematical documents

Masakazu Suzuki, Fumikazu Tamari, Ryoji Fukuda, Seiichi Uchida, Toshihiro Kanahori

Research output: Chapter in Book/Report/Conference proceedingConference contribution

95 Citations (Scopus)

Abstract

An integrated OCR system for mathematical documents, called INFTY, is presented. INFTY consists of four procedures, i.e., layout analysis, character recognition, structure analysis of mathematical expressions, and manual error correction. In those procedures, several novel techniques are utilized for better recognition performance. Experimental results on about 500 pages of mathematical documents showed high character recognition rates on both mathematical expressions and ordinary texts, and sufficient performance on the structure analysis of the mathematical expressions.

Original languageEnglish
Title of host publicationProceedings of the 2003 ACM Symposium on Document Engineering
EditorsC. Vanoirbeek, C. Roisin, E. Munson
Pages95-104
Number of pages10
Publication statusPublished - Dec 1 2003
EventProceedings of the 2003 ACM Symposium on Document Engineering - Grenoble, France
Duration: Nov 20 2003Nov 22 2003

Publication series

NameProceedings of the 2003 ACM Symposium on Document Engineering

Other

OtherProceedings of the 2003 ACM Symposium on Document Engineering
CountryFrance
CityGrenoble
Period11/20/0311/22/03

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Cite this

Suzuki, M., Tamari, F., Fukuda, R., Uchida, S., & Kanahori, T. (2003). INFTY - An integrated OCR system for mathematical documents. In C. Vanoirbeek, C. Roisin, & E. Munson (Eds.), Proceedings of the 2003 ACM Symposium on Document Engineering (pp. 95-104). (Proceedings of the 2003 ACM Symposium on Document Engineering).