An extension of LCA based XML keyword search

Umaporn Supasitthimethee, Toshiyuki Shimizu, Masatoshi Yoshikawa, Kriengkrai Porkaew

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

One of the most convenient ways to query XML data is a keyword search because it does not require any knowledge about XML structure and without the need to learn a new user interface. However, keyword search interface is very flexible. It is hard for a system to decide which node is likely to be chosen as a return node and how much information should be included in the result. To address this challenge, we propose an extension of LCA based XML keyword search. First, to determine a return node, we provide a query syntax that the users can tell the system which node they are really interested in. In case that the users do not explicitly specify return information, our system will automatically analyze and choose appropriate return nodes by inferring from user keywords. Second, to return a meaningful result, we investigate the problem of the return information in the LCA and the proximity search approaches. To this end, we introduce the Lowest Element Node (LEN) and define our simple rules without any requirement on the schema information such as DTD or XML Schema. Our experiment results indicate that our system not only infers the right return nodes but also generates compact and meaningful results.

Original languageEnglish
Title of host publicationProceedings - 2008 1st International Workshop on Information-Explosion and Next Generation Search, INGS 2008
Pages104-111
Number of pages8
DOIs
Publication statusPublished - 2008
Externally publishedYes
Event2008 1st International Workshop on Information-Explosion and Next Generation Search, INGS 2008 - Shenyang, China
Duration: Apr 26 2008Apr 27 2008

Publication series

NameProceedings - 2008 International Workshop on Information-Explosion and Next Generation Search, INGS 2008

Conference

Conference2008 1st International Workshop on Information-Explosion and Next Generation Search, INGS 2008
Country/TerritoryChina
CityShenyang
Period4/26/084/27/08

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Information Systems

Fingerprint

Dive into the research topics of 'An extension of LCA based XML keyword search'. Together they form a unique fingerprint.

Cite this