Translation disambiguation for cross-language information retrieval using context-based translation probability

Kazuaki Kishida, Emi Ishita

研究成果: Contribution to journalArticle査読

7 被引用数 (Scopus)

抄録

Disambiguation between multiple translation choices is very important in dictionary-based cross-language information retrieval. In prior work, disambiguation techniques have used term co-occurrence statistics from the collection being searched. Experimentally these techniques have worked well but are based upon heuristic assumptions. In this paper, a theoretically grounded alternative is proposed, one which uses sense disambiguation based upon context terms within the source text. Specifically this paper introduces the concept of translation probabilities incorporating a context term and extends the IBM Model 1 for estimating context-based translation probabilities from a sentence-aligned bilingual corpus. Experimental results in English to Italian bilingual searches show significant performance improvement of the context-based translation probabilities over the case without any disambiguation.

本文言語英語
ページ(範囲)481-495
ページ数15
ジャーナルJournal of Information Science
35
4
DOI
出版ステータス出版済み - 8 2009
外部発表はい

All Science Journal Classification (ASJC) codes

  • 情報システム
  • 図書館情報学

フィンガープリント

「Translation disambiguation for cross-language information retrieval using context-based translation probability」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル