Mathematical Document Categorization with Structure of Mathematical Expressions

Tokinori Suzuki, Atsushi Fujii

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

4 被引用数 (Scopus)

抄録

A mathematical document is a document subjected to mathematical communication, for example, a math paper and discussion in online Q&A community. Mathematical document categorization (MDC) is a task to classify mathematical documents to mathematical categories, e.g. probability theory and set theory. This task is an important task for supporting user search on recent wide-spreaded digital libraries and archiving services. Although Mathematical expressions (ME) in the document could bring an essential information as being in a central part of communication especially in math fields, how to utilize ME for MDC has not been matured. In this paper, we propose the classi cation method based on text combined with structures of ME, which are supposed to re ect conventions and rules specific to a category. Also, we present document collections built for evaluating the MDC systems, with investigation on categorial settings and its statistics. We demonstrate classi cation results that our proposed method outperforms existing methods with state-of-the-art ME modeling on F-measure.

本文言語英語
ホスト出版物のタイトル2017 ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017
出版社Institute of Electrical and Electronics Engineers Inc.
ISBN(電子版)9781538638613
DOI
出版ステータス出版済み - 7 25 2017
外部発表はい
イベント17th ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017 - Toronto, カナダ
継続期間: 6 19 20176 23 2017

出版物シリーズ

名前Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
ISSN(印刷版)1552-5996

会議

会議17th ACM/IEEE Joint Conference on Digital Libraries, JCDL 2017
Countryカナダ
CityToronto
Period6/19/176/23/17

All Science Journal Classification (ASJC) codes

  • Engineering(all)

フィンガープリント 「Mathematical Document Categorization with Structure of Mathematical Expressions」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル