Semi-supervised coupled dictionary learning for cross-modal retrieval in internet images and texts

Xing Xu, Yang Yang, Atsushi Shimada, Rin Ichiro Taniguchi, Li He

研究成果: 書籍/レポート タイプへの寄稿会議への寄与

25 被引用数 (Scopus)

抄録

Nowadays massive amount of images and texts has been emerging on the Internet, arousing the demand of effective cross-modal retrieval. To eliminate the heterogeneity be-tween the modalities of images and texts, the existing sub-space learning methods try to learn a common latent sub-space under which cross-modal matching can be performed. However, these methods usually require fully paired sam-ples (images with corresponding texts) and also ignore the class label information along with the paired samples. In-deed, the class label information can reduce the semantic gap between different modalities and explicitly guide the subspace learning procedure. In addition, the large quan-tities of unpaired samples (images or texts) may provide useful side information to enrich the representations from learned subspace. Thus, in this paper we propose a novel model for cross-modal retrieval problem. It consists of 1) a semi-supervised coupled dictionary learning step to generate homogeneously sparse representations for different modali-ties based on both paired and unpaired samples; 2) a coupled feature mapping step to project the sparse representations of different modalities into a common subspace defined by class label information to perform cross-modal matching. Exper-iments on a large scale web image dataset MIRFlickr-1M with both fully paired and unpaired settings show the effec-tiveness of the proposed model on the cross-modal retrieval task.

本文言語英語
ホスト出版物のタイトルMM 2015 - Proceedings of the 2015 ACM Multimedia Conference
出版社Association for Computing Machinery, Inc
ページ847-850
ページ数4
ISBN(電子版)9781450334594
DOI
出版ステータス出版済み - 10月 13 2015
イベント23rd ACM International Conference on Multimedia, MM 2015 - Brisbane, オーストラリア
継続期間: 10月 26 201510月 30 2015

出版物シリーズ

名前MM 2015 - Proceedings of the 2015 ACM Multimedia Conference

その他

その他23rd ACM International Conference on Multimedia, MM 2015
国/地域オーストラリア
CityBrisbane
Period10/26/1510/30/15

!!!All Science Journal Classification (ASJC) codes

  • メディア記述
  • コンピュータ グラフィックスおよびコンピュータ支援設計
  • コンピュータ ビジョンおよびパターン認識
  • ソフトウェア

フィンガープリント

「Semi-supervised coupled dictionary learning for cross-modal retrieval in internet images and texts」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル