TY - GEN
T1 - A ranking scheme for XML information retrieval based on benefit and reading effort
AU - Shimizu, Toshiyuki
AU - Yoshikawa, Masatoshi
PY - 2007
Y1 - 2007
N2 - XML information retrieval (XML-IR) systems search for relevant document fragments in XML documents for given queries. In topk search, users control the size of output by an integer k. In XML-IR, however, each output element varies widely in size. Consequently, total output size of top-k elements is uncontrollable by simply giving an integer k. In addition, search results may have nesting elements. If a system orders result elements simply by their relevance, we may browse the same content more than once due to the nestings. To handle these problems, we propose a new ranking method that enables us to browse search results of XML-IR systems efficiently by introducing the concepts of benefit and reading effort. We also propose an evaluation metrics based on benefit and reading effort, and compared the metrics with existing XML-IR metrics by experiments.
AB - XML information retrieval (XML-IR) systems search for relevant document fragments in XML documents for given queries. In topk search, users control the size of output by an integer k. In XML-IR, however, each output element varies widely in size. Consequently, total output size of top-k elements is uncontrollable by simply giving an integer k. In addition, search results may have nesting elements. If a system orders result elements simply by their relevance, we may browse the same content more than once due to the nestings. To handle these problems, we propose a new ranking method that enables us to browse search results of XML-IR systems efficiently by introducing the concepts of benefit and reading effort. We also propose an evaluation metrics based on benefit and reading effort, and compared the metrics with existing XML-IR metrics by experiments.
UR - http://www.scopus.com/inward/record.url?scp=38149119670&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38149119670&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-77094-7_32
DO - 10.1007/978-3-540-77094-7_32
M3 - Conference contribution
AN - SCOPUS:38149119670
SN - 9783540770930
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 230
EP - 240
BT - Asian Digital Libraries
PB - Springer Verlag
T2 - 10th International Conference on Asian Digital Libraries, ICADL 2007
Y2 - 10 December 2007 through 13 December 2007
ER -