Toward part-based document image decoding

Wang Song, Seiichi Uchida, Marcus Liwicki

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

1 引用 (Scopus)

抜粋

Document image decoding (DID) is a trial to understand the contents of a whole document without any reference information about font, language, etc. Typically, DID approaches assume the correct segmentation of the document and some a priori knowledge about the language or the script. Unfortunately, this assumption will not hold if we deal with various documents, such as documents with various sized fonts, camera-captured documents, free-layout documents, or historical documents. In this paper, we propose a part-based character identification method where no segmentation into characters is necessary and no a priori information about the document is needed. The approach clusters similar key points and groups frequent neighboring key point clusters. Then a second iteration is performed, i.e., the groups are again clustered and optionally pairs frequent group clusters are detected. Our first experimental results on multi font-size documents look already very promising. We could find nearly perfect correspondences between characters and detected group clusters.

元の言語英語
ホスト出版物のタイトルProceedings - 10th IAPR International Workshop on Document Analysis Systems, DAS 2012
ページ266-270
ページ数5
DOI
出版物ステータス出版済み - 5 24 2012
イベント10th IAPR International Workshop on Document Analysis Systems, DAS 2012 - Gold Coast, QLD, オーストラリア
継続期間: 3 27 20123 29 2012

出版物シリーズ

名前Proceedings - 10th IAPR International Workshop on Document Analysis Systems, DAS 2012

その他

その他10th IAPR International Workshop on Document Analysis Systems, DAS 2012
オーストラリア
Gold Coast, QLD
期間3/27/123/29/12

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering

フィンガープリント Toward part-based document image decoding' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用

    Song, W., Uchida, S., & Liwicki, M. (2012). Toward part-based document image decoding. : Proceedings - 10th IAPR International Workshop on Document Analysis Systems, DAS 2012 (pp. 266-270). [6195376] (Proceedings - 10th IAPR International Workshop on Document Analysis Systems, DAS 2012). https://doi.org/10.1109/DAS.2012.90