TY - GEN
T1 - Detecting academic papers on the web
AU - Ishita, Emi
AU - Agata, Teru
AU - Ikeuchi, Atsushi
AU - Yosuke, Miyata
AU - Ueda, Shuichi
PY - 2011/7/25
Y1 - 2011/7/25
N2 - Our research goal is to develop a search engine for open access to academic papers. English and Japanese test sets were built for detection of academic papers from 20,000 PDF files in each language using five annotators. Six classifiers were trained using similar features for each language. We report F1 of 0.74 for English and 0.54 for Japanese and argue that similar features could easily be generated for other languages as well.
AB - Our research goal is to develop a search engine for open access to academic papers. English and Japanese test sets were built for detection of academic papers from 20,000 PDF files in each language using five annotators. Six classifiers were trained using similar features for each language. We report F1 of 0.74 for English and 0.54 for Japanese and argue that similar features could easily be generated for other languages as well.
UR - http://www.scopus.com/inward/record.url?scp=79960504404&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79960504404&partnerID=8YFLogxK
U2 - 10.1145/1998076.1998161
DO - 10.1145/1998076.1998161
M3 - Conference contribution
AN - SCOPUS:79960504404
SN - 9781450307444
T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
SP - 413
EP - 414
BT - JCDL'11 - Proceedings of the 2011 ACM/IEEE Joint Conference on Digital Libraries
T2 - 11th Annual International ACM/IEEE Joint Conference on Digital Libraries, JCDL'11
Y2 - 13 June 2011 through 17 June 2011
ER -