Efficient evaluation of partially-dimensional range queries in large OLAP datasets

Yaokai Feng, Kunihiko Kaneko, Akifumi Makinouchi

Research output: Contribution to journalArticle

Abstract

In light of the increasing requirement for processing multidimensional queries on OLAP (relational) data, the database community has focused on the queries (especially range queries) on the large OLAP datasets from the view of multidimensional data. It is well-known that multidimensional indices are helpful to improve the performance of such queries. However, we found that much information irrelevant to queries also has to be read from disk if the existing multidimensional indices are used with OLAP data, which greatly degrade the search performance. This problem comes from particularity on the actual queries exerted on OLAP data. That is, in many OLAP applications, the query conditions probably are only with partial dimensions (not all) of the whole index space. Such range queries are called partially-dimensional (PD) range queries in this study. Based on R*-tree, we propose a new index structure, called AR*-tree, to counter the actual queries on OLAP data. The results of both mathematical analysis and many experiments with different datasets indicate that the AR*-tree can clearly improve the performance of PD range queries, esp. for large OLAP datasets.

Original languageEnglish
Pages (from-to)150-171
Number of pages22
JournalInternational Journal of Data Mining, Modelling and Management
Volume3
Issue number2
DOIs
Publication statusPublished - Jul 2011

All Science Journal Classification (ASJC) codes

  • Management Information Systems
  • Modelling and Simulation
  • Computer Science Applications

Fingerprint Dive into the research topics of 'Efficient evaluation of partially-dimensional range queries in large OLAP datasets'. Together they form a unique fingerprint.

Cite this