TY - JOUR
T1 - A dataset of pairs of an image and tags for cataloging image-based archives
AU - Suzuki, Tokinori
AU - Nagamizo, Kota
AU - Ikeda, Daisuke
N1 - Funding Information:
We would like to thank, Petra Galuščáková and Prof. Doug Oard for valuable comments and advice when the first author started to develop this dataset in the University of Maryland. This research did not receive any specific grant from funding agencies in the public, commercial or not-for-profit sectors.
Publisher Copyright:
© 2022 The Authors
PY - 2022/12
Y1 - 2022/12
N2 - The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags to be disambiguated, an appropriate Wikipedia page is selected for each of the given tag. We collected images tagged keywords of animal names for that ambiguity and their tags since animal names may refer to not only names of animal but names of other types of objects, e.g., nicknames of sports teams from the photo sharing site Flickr. The tags are linked to the correspondence Wikipedia page judged by annotators. The dataset includes 420 images and 2,464 tags. It is useful for developing a system to link a keyword of an image to an entry of a knowledgebase as well as an image classification system, which include fine-grained classes, e.g. proper nouns of objects, as their classification targets.
AB - The dataset described in this paper contains pairs of images collected from the Web and their tags of keywords, which are linked to appropriate entity pages of Wikipedia, and programs to reproduce experiments. It is assumed for evaluating the disambiguation task, in which given an image and its tags to be disambiguated, an appropriate Wikipedia page is selected for each of the given tag. We collected images tagged keywords of animal names for that ambiguity and their tags since animal names may refer to not only names of animal but names of other types of objects, e.g., nicknames of sports teams from the photo sharing site Flickr. The tags are linked to the correspondence Wikipedia page judged by annotators. The dataset includes 420 images and 2,464 tags. It is useful for developing a system to link a keyword of an image to an entry of a knowledgebase as well as an image classification system, which include fine-grained classes, e.g. proper nouns of objects, as their classification targets.
UR - http://www.scopus.com/inward/record.url?scp=85141502070&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85141502070&partnerID=8YFLogxK
U2 - 10.1016/j.dib.2022.108722
DO - 10.1016/j.dib.2022.108722
M3 - Article
AN - SCOPUS:85141502070
VL - 45
JO - Data in Brief
JF - Data in Brief
SN - 2352-3409
M1 - 108722
ER -