Morphological analysis for unsegmented languages using recurrent neural network language model

Hajime Morita, Daisuke Kawahara, Sadao Kurohashi

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

40 被引用数 (Scopus)

抄録

We present a new morphological analysis model that considers semantic plausibility of word sequences by using a recurrent neural network language model (RNNLM). In unsegmented languages, since language models are learned from automatically segmented texts and inevitably contain errors, it is not apparent that conventional language models contribute to morphological analysis. To solve this problem, we do not use language models based on raw word sequences but use a semantically generalized language model, RNNLM, in morphological analysis. In our experiments on two Japanese corpora, our proposed model significantly outperformed baseline models. This result indicates the effectiveness of RNNLM in morphological analysis.

本文言語英語
ホスト出版物のタイトルConference Proceedings - EMNLP 2015
ホスト出版物のサブタイトルConference on Empirical Methods in Natural Language Processing
出版社Association for Computational Linguistics (ACL)
ページ2292-2297
ページ数6
ISBN(電子版)9781941643327
DOI
出版ステータス出版済み - 2015
イベントConference on Empirical Methods in Natural Language Processing, EMNLP 2015 - Lisbon, ポルトガル
継続期間: 9 17 20159 21 2015

出版物シリーズ

名前Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing

その他

その他Conference on Empirical Methods in Natural Language Processing, EMNLP 2015
国/地域ポルトガル
CityLisbon
Period9/17/159/21/15

All Science Journal Classification (ASJC) codes

  • 計算理論と計算数学
  • コンピュータ サイエンスの応用
  • 情報システム

フィンガープリント

「Morphological analysis for unsegmented languages using recurrent neural network language model」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル