The effect of corpus size on case frame acquisition for discourse analysis

Ryohei Sasano, Daisuke Kawahara, Sadao Kurohashi

研究成果: Chapter in Book/Report/Conference proceedingConference contribution

13 被引用数 (Scopus)

抄録

This paper reports the effect of corpus size on case frame acquisition for discourse analysis in Japanese. For this study, we collected a Japanese corpus consisting of up to 100 billion words, and constructed case frames from corpora of six different sizes. Then, we applied these case frames to syntactic and case structure analysis, and zero anaphora resolution. We obtained better results by using case frames constructed from larger corpora; the performance was not saturated even with a corpus size of 100 billion words.

本文言語英語
ホスト出版物のタイトルNAACL HLT 2009 - Human Language Technologies
ホスト出版物のサブタイトルThe 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Conference
出版社Association for Computational Linguistics (ACL)
ページ521-529
ページ数9
ISBN(印刷版)9781932432411
DOI
出版ステータス出版済み - 2009
イベントHuman Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL HLT 2009 - Boulder, CO, 米国
継続期間: 5 31 20096 5 2009

出版物シリーズ

名前NAACL HLT 2009 - Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Conference

その他

その他Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL HLT 2009
国/地域米国
CityBoulder, CO
Period5/31/096/5/09

All Science Journal Classification (ASJC) codes

  • 言語および言語学
  • 社会科学(その他)

フィンガープリント

「The effect of corpus size on case frame acquisition for discourse analysis」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル