Predicting author’s native language using abstracts of scholarly papers

Takahiro Baba, Kensuke Baba, Daisuke Ikeda

研究成果: 著書/レポートタイプへの貢献会議での発言

抄録

Predicting author’s attributes is useful for understanding implicit meanings of documents. The target problem of this paper is predicting author’s native language for each document. The authors of this paper used surface-level features of documents for the problem and tried to clarify the practical tendencies of the writing style as word occurrences. They conducted a classification of the abstracts written in English of approximately 85,000 scholarly papers written in English or in Japanese. As a result of the experiment, the accuracy of the binary classification was 0.97, and they found that a number of distinctive phrases used in the classification were related to typical writing styles of Japanese.

元の言語英語
ホスト出版物のタイトルFoundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Proceedings
編集者Nathalie Japkowicz, George A. Papadopoulos, Michelangelo Ceci, Zbigniew W. Ras, Jiming Liu
出版者Springer Verlag
ページ448-453
ページ数6
ISBN(印刷物)9783030018504
DOI
出版物ステータス出版済み - 1 1 2018
イベント24th International Symposium on Methodologies for Intelligent Systems, ISMIS 2018 - Limassol, キプロス
継続期間: 10 29 201810 31 2018

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
11177 LNAI
ISSN(印刷物)0302-9743
ISSN(電子版)1611-3349

その他

その他24th International Symposium on Methodologies for Intelligent Systems, ISMIS 2018
キプロス
Limassol
期間10/29/1810/31/18

Fingerprint

Binary Classification
Attribute
Target
Experiment
Style
Language
Experiments
Meaning

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

これを引用

Baba, T., Baba, K., & Ikeda, D. (2018). Predicting author’s native language using abstracts of scholarly papers. : N. Japkowicz, G. A. Papadopoulos, M. Ceci, Z. W. Ras, & J. Liu (版), Foundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Proceedings (pp. 448-453). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 巻数 11177 LNAI). Springer Verlag. https://doi.org/10.1007/978-3-030-01851-1_43

Predicting author’s native language using abstracts of scholarly papers. / Baba, Takahiro; Baba, Kensuke; Ikeda, Daisuke.

Foundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Proceedings. 版 / Nathalie Japkowicz; George A. Papadopoulos; Michelangelo Ceci; Zbigniew W. Ras; Jiming Liu. Springer Verlag, 2018. p. 448-453 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 巻 11177 LNAI).

研究成果: 著書/レポートタイプへの貢献会議での発言

Baba, T, Baba, K & Ikeda, D 2018, Predicting author’s native language using abstracts of scholarly papers. : N Japkowicz, GA Papadopoulos, M Ceci, ZW Ras & J Liu (版), Foundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 巻. 11177 LNAI, Springer Verlag, pp. 448-453, 24th International Symposium on Methodologies for Intelligent Systems, ISMIS 2018, Limassol, キプロス, 10/29/18. https://doi.org/10.1007/978-3-030-01851-1_43
Baba T, Baba K, Ikeda D. Predicting author’s native language using abstracts of scholarly papers. : Japkowicz N, Papadopoulos GA, Ceci M, Ras ZW, Liu J, 編集者, Foundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Proceedings. Springer Verlag. 2018. p. 448-453. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-030-01851-1_43
Baba, Takahiro ; Baba, Kensuke ; Ikeda, Daisuke. / Predicting author’s native language using abstracts of scholarly papers. Foundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Proceedings. 編集者 / Nathalie Japkowicz ; George A. Papadopoulos ; Michelangelo Ceci ; Zbigniew W. Ras ; Jiming Liu. Springer Verlag, 2018. pp. 448-453 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{a0f2a05adbc344fa8b7f7e33b86df72c,
title = "Predicting author’s native language using abstracts of scholarly papers",
abstract = "Predicting author’s attributes is useful for understanding implicit meanings of documents. The target problem of this paper is predicting author’s native language for each document. The authors of this paper used surface-level features of documents for the problem and tried to clarify the practical tendencies of the writing style as word occurrences. They conducted a classification of the abstracts written in English of approximately 85,000 scholarly papers written in English or in Japanese. As a result of the experiment, the accuracy of the binary classification was 0.97, and they found that a number of distinctive phrases used in the classification were related to typical writing styles of Japanese.",
author = "Takahiro Baba and Kensuke Baba and Daisuke Ikeda",
year = "2018",
month = "1",
day = "1",
doi = "10.1007/978-3-030-01851-1_43",
language = "English",
isbn = "9783030018504",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "448--453",
editor = "Nathalie Japkowicz and Papadopoulos, {George A.} and Michelangelo Ceci and Ras, {Zbigniew W.} and Jiming Liu",
booktitle = "Foundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Proceedings",
address = "Germany",

}

TY - GEN

T1 - Predicting author’s native language using abstracts of scholarly papers

AU - Baba, Takahiro

AU - Baba, Kensuke

AU - Ikeda, Daisuke

PY - 2018/1/1

Y1 - 2018/1/1

N2 - Predicting author’s attributes is useful for understanding implicit meanings of documents. The target problem of this paper is predicting author’s native language for each document. The authors of this paper used surface-level features of documents for the problem and tried to clarify the practical tendencies of the writing style as word occurrences. They conducted a classification of the abstracts written in English of approximately 85,000 scholarly papers written in English or in Japanese. As a result of the experiment, the accuracy of the binary classification was 0.97, and they found that a number of distinctive phrases used in the classification were related to typical writing styles of Japanese.

AB - Predicting author’s attributes is useful for understanding implicit meanings of documents. The target problem of this paper is predicting author’s native language for each document. The authors of this paper used surface-level features of documents for the problem and tried to clarify the practical tendencies of the writing style as word occurrences. They conducted a classification of the abstracts written in English of approximately 85,000 scholarly papers written in English or in Japanese. As a result of the experiment, the accuracy of the binary classification was 0.97, and they found that a number of distinctive phrases used in the classification were related to typical writing styles of Japanese.

UR - http://www.scopus.com/inward/record.url?scp=85055874916&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85055874916&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-01851-1_43

DO - 10.1007/978-3-030-01851-1_43

M3 - Conference contribution

AN - SCOPUS:85055874916

SN - 9783030018504

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 448

EP - 453

BT - Foundations of Intelligent Systems - 24th International Symposium, ISMIS 2018, Proceedings

A2 - Japkowicz, Nathalie

A2 - Papadopoulos, George A.

A2 - Ceci, Michelangelo

A2 - Ras, Zbigniew W.

A2 - Liu, Jiming

PB - Springer Verlag

ER -