TY - GEN
T1 - Correlated topic model for image annotation
AU - Xu, Xing
AU - Shimada, Atsushi
AU - Taniguchi, Rin Ichiro
PY - 2013
Y1 - 2013
N2 - For the task of image annotation, traditional methods based on probabilistic topic model, such as correspondence Latent Dirichlet Allocation (corrLDA) [1], assumes that image is a mixture of latent topics. However, this kind of models is unable to directly model correlation between topics since topic proportions of an image are generated independently. Our model, called correspondence Correlated Topic Model (corrCTM), extends Correlated Topic Model (CTM) [2] from natural language processing to capture topic correlation from covariance structure of more flexible model distribution. Unlike previous LDA based models, topic proportions are correlated with each other in proposed corrCTM. And the topic correlation propagates from image features to annotation words through a generative process, and finally correspondence between images and words could be generated. We derive an approximate inference and estimation algorithm based on variational method. We examine the performance of our model on two benchmark image datasets, show improved performance over corrLDA for both annotation and modeling word correlation.
AB - For the task of image annotation, traditional methods based on probabilistic topic model, such as correspondence Latent Dirichlet Allocation (corrLDA) [1], assumes that image is a mixture of latent topics. However, this kind of models is unable to directly model correlation between topics since topic proportions of an image are generated independently. Our model, called correspondence Correlated Topic Model (corrCTM), extends Correlated Topic Model (CTM) [2] from natural language processing to capture topic correlation from covariance structure of more flexible model distribution. Unlike previous LDA based models, topic proportions are correlated with each other in proposed corrCTM. And the topic correlation propagates from image features to annotation words through a generative process, and finally correspondence between images and words could be generated. We derive an approximate inference and estimation algorithm based on variational method. We examine the performance of our model on two benchmark image datasets, show improved performance over corrLDA for both annotation and modeling word correlation.
UR - http://www.scopus.com/inward/record.url?scp=84875982111&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84875982111&partnerID=8YFLogxK
U2 - 10.1109/FCV.2013.6485488
DO - 10.1109/FCV.2013.6485488
M3 - Conference contribution
AN - SCOPUS:84875982111
SN - 9781467356206
T3 - FCV 2013 - Proceedings of the 19th Korea-Japan Joint Workshop on Frontiers of Computer Vision
SP - 201
EP - 208
BT - FCV 2013 - Proceedings of the 19th Korea-Japan Joint Workshop on Frontiers of Computer Vision
T2 - 19th Korea-Japan Joint Workshop on Frontiers of Computer Vision, FCV 2013
Y2 - 30 January 2013 through 1 February 2013
ER -