多義語分散表現の文脈化

芦原 和樹, 梶原 智之, 荒瀬 由紀, 内田 諭

研究成果: Contribution to journalArticle査読

抄録

Currently, distributed word representations are employed in many natural language processing tasks. However, when generating one representation for each word, the meanings of a polysemous word cannot be differentiated because the meanings are integrated into one representation. Therefore, several attempts have been made to generate different representations per meaning based on parts of speech or the topic of a sentence. However, these methods are too unrefined to deal with polysemy. In this paper, we proposed two methods to generate more subtle multiple word representations. The first method involves generating multiple word representations using the word in a dependency relationship as a clue. The second approach involves employing a bi-directional language model in which a word representation that considers all the words in the context is generated. The results of the extensive evaluation of the Lexical Substitution task and Context-Aware Word Similarity task confirmed the effectiveness of our approaches to generate more subtle multiple word representations. 
寄稿の翻訳タイトルContextualized Multi-Sense Word Embedding
本文言語日本語
ページ(範囲)689-710
ページ数22
ジャーナル自然言語処理
26
4
DOI
出版ステータス出版済み - 2019

フィンガープリント

「多義語分散表現の文脈化」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル