A new structure for accelerating xpath location steps

Yaokai Feng, Akifumi Makinouchi

Research output: Contribution to journalArticle

Abstract

Indexing technology is one of the kernel technologies in database management systems, especially for large datasets. XML (eXtensible Markup Language) has been successfully adopted as a universal data exchange format, particularly in the WWW environment. It is an important and basic task to efficiently implement XPath axes on XML documents. Using R*-tree, T. Grust proposed an interesting method to support all XPath axes. In this method, all of the nodes of an XML document are mapped to a point set in a five-dimensional space. T. Grust clarified the fact that each of the XPath axes can be implemented by a range query in the abovementioned five-dimensional space. Thus, R*-tree (one of the popular multidimensional indices) was used to improve the query performance for XPath axes. However, according to our investigations, most of the range queries for the XPath axes are partially-dimensional range queries. If the existing multidimensional indices are used for such range queries, a great deal of information that is irrelevant to the queries must also be read from disk. Based on this observation, a new multidimensional index structure, called Adaptive R*-tree (AR*-tree), is proposed herein to support the XPath axes more efficiently.

Original languageEnglish
Pages (from-to)113-121
Number of pages9
JournalIAENG International Journal of Computer Science
Volume38
Issue number2
Publication statusPublished - May 25 2011

Fingerprint

XML
Electronic data interchange
World Wide Web

All Science Journal Classification (ASJC) codes

  • Computer Science(all)

Cite this

A new structure for accelerating xpath location steps. / Feng, Yaokai; Makinouchi, Akifumi.

In: IAENG International Journal of Computer Science, Vol. 38, No. 2, 25.05.2011, p. 113-121.

Research output: Contribution to journalArticle

@article{60268d8a8be24c53829f603466d0e7eb,
title = "A new structure for accelerating xpath location steps",
abstract = "Indexing technology is one of the kernel technologies in database management systems, especially for large datasets. XML (eXtensible Markup Language) has been successfully adopted as a universal data exchange format, particularly in the WWW environment. It is an important and basic task to efficiently implement XPath axes on XML documents. Using R*-tree, T. Grust proposed an interesting method to support all XPath axes. In this method, all of the nodes of an XML document are mapped to a point set in a five-dimensional space. T. Grust clarified the fact that each of the XPath axes can be implemented by a range query in the abovementioned five-dimensional space. Thus, R*-tree (one of the popular multidimensional indices) was used to improve the query performance for XPath axes. However, according to our investigations, most of the range queries for the XPath axes are partially-dimensional range queries. If the existing multidimensional indices are used for such range queries, a great deal of information that is irrelevant to the queries must also be read from disk. Based on this observation, a new multidimensional index structure, called Adaptive R*-tree (AR*-tree), is proposed herein to support the XPath axes more efficiently.",
author = "Yaokai Feng and Akifumi Makinouchi",
year = "2011",
month = "5",
day = "25",
language = "English",
volume = "38",
pages = "113--121",
journal = "IAENG International Journal of Computer Science",
issn = "1819-656X",
publisher = "International Association of Engineers",
number = "2",

}

TY - JOUR

T1 - A new structure for accelerating xpath location steps

AU - Feng, Yaokai

AU - Makinouchi, Akifumi

PY - 2011/5/25

Y1 - 2011/5/25

N2 - Indexing technology is one of the kernel technologies in database management systems, especially for large datasets. XML (eXtensible Markup Language) has been successfully adopted as a universal data exchange format, particularly in the WWW environment. It is an important and basic task to efficiently implement XPath axes on XML documents. Using R*-tree, T. Grust proposed an interesting method to support all XPath axes. In this method, all of the nodes of an XML document are mapped to a point set in a five-dimensional space. T. Grust clarified the fact that each of the XPath axes can be implemented by a range query in the abovementioned five-dimensional space. Thus, R*-tree (one of the popular multidimensional indices) was used to improve the query performance for XPath axes. However, according to our investigations, most of the range queries for the XPath axes are partially-dimensional range queries. If the existing multidimensional indices are used for such range queries, a great deal of information that is irrelevant to the queries must also be read from disk. Based on this observation, a new multidimensional index structure, called Adaptive R*-tree (AR*-tree), is proposed herein to support the XPath axes more efficiently.

AB - Indexing technology is one of the kernel technologies in database management systems, especially for large datasets. XML (eXtensible Markup Language) has been successfully adopted as a universal data exchange format, particularly in the WWW environment. It is an important and basic task to efficiently implement XPath axes on XML documents. Using R*-tree, T. Grust proposed an interesting method to support all XPath axes. In this method, all of the nodes of an XML document are mapped to a point set in a five-dimensional space. T. Grust clarified the fact that each of the XPath axes can be implemented by a range query in the abovementioned five-dimensional space. Thus, R*-tree (one of the popular multidimensional indices) was used to improve the query performance for XPath axes. However, according to our investigations, most of the range queries for the XPath axes are partially-dimensional range queries. If the existing multidimensional indices are used for such range queries, a great deal of information that is irrelevant to the queries must also be read from disk. Based on this observation, a new multidimensional index structure, called Adaptive R*-tree (AR*-tree), is proposed herein to support the XPath axes more efficiently.

UR - http://www.scopus.com/inward/record.url?scp=79960050389&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79960050389&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:79960050389

VL - 38

SP - 113

EP - 121

JO - IAENG International Journal of Computer Science

JF - IAENG International Journal of Computer Science

SN - 1819-656X

IS - 2

ER -