Predicting author’s native language using abstracts of scholarly papers

Takahiro Baba, Kensuke Baba, Daisuke Ikeda

研究成果: 書籍/レポート タイプへの寄稿会議への寄与

抄録

Predicting author’s attributes is useful for understanding implicit meanings of documents. The target problem of this paper is predicting author’s native language for each document. The authors of this paper used surface-level features of documents for the problem and tried to clarify the practical tendencies of the writing style as word occurrences. They conducted a classification of the abstracts written in English of approximately 85,000 scholarly papers written in English or in Japanese. As a result of the experiment, the accuracy of the binary classification was 0.97, and they found that a number of distinctive phrases used in the classification were related to typical writing styles of Japanese.

本文言語英語
ホスト出版物のタイトルFoundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Proceedings
編集者Nathalie Japkowicz, George A. Papadopoulos, Michelangelo Ceci, Zbigniew W. Ras, Jiming Liu
出版社Springer Verlag
ページ448-453
ページ数6
ISBN(印刷版)9783030018504
DOI
出版ステータス出版済み - 2018
イベント24th International Symposium on Methodologies for Intelligent Systems, ISMIS 2018 - Limassol, キプロス
継続期間: 10月 29 201810月 31 2018

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
11177 LNAI
ISSN(印刷版)0302-9743
ISSN(電子版)1611-3349

その他

その他24th International Symposium on Methodologies for Intelligent Systems, ISMIS 2018
国/地域キプロス
CityLimassol
Period10/29/1810/31/18

!!!All Science Journal Classification (ASJC) codes

  • 理論的コンピュータサイエンス
  • コンピュータ サイエンス(全般)

フィンガープリント

「Predicting author’s native language using abstracts of scholarly papers」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル