Efficient evaluation of partially-dimensional range queries in large OLAP datasets

Yaokai Feng, Kunihiko Kaneko, Akifumi Makinouchi

Research output: Contribution to journalArticle

Abstract

In light of the increasing requirement for processing multidimensional queries on OLAP (relational) data, the database community has focused on the queries (especially range queries) on the large OLAP datasets from the view of multidimensional data. It is well-known that multidimensional indices are helpful to improve the performance of such queries. However, we found that much information irrelevant to queries also has to be read from disk if the existing multidimensional indices are used with OLAP data, which greatly degrade the search performance. This problem comes from particularity on the actual queries exerted on OLAP data. That is, in many OLAP applications, the query conditions probably are only with partial dimensions (not all) of the whole index space. Such range queries are called partially-dimensional (PD) range queries in this study. Based on R *-tree, we propose a new index structure, called AR *-tree, to counter the actual queries on OLAP data. The results of both mathematical analysis and many experiments with different datasets indicate that the AR *-tree can clearly improve the performance of PD range queries, esp. for large OLAP datasets.

Original languageEnglish
Pages (from-to)150-171
Number of pages22
JournalInternational Journal of Data Mining, Modelling and Management
Volume3
Issue number2
DOIs
Publication statusPublished - Jul 1 2011

Fingerprint

Online Analytical Processing
Range Query
Query processing
Query
Evaluation
Experiments
R-tree
Multidimensional Data
Online analytical processing
Mathematical Analysis
Partial
Requirements
Experiment

All Science Journal Classification (ASJC) codes

  • Management Information Systems
  • Modelling and Simulation
  • Computer Science Applications

Cite this

Efficient evaluation of partially-dimensional range queries in large OLAP datasets. / Feng, Yaokai; Kaneko, Kunihiko; Makinouchi, Akifumi.

In: International Journal of Data Mining, Modelling and Management, Vol. 3, No. 2, 01.07.2011, p. 150-171.

Research output: Contribution to journalArticle

@article{c5b5c113eedc41e2a94828957f6c5a82,
title = "Efficient evaluation of partially-dimensional range queries in large OLAP datasets",
abstract = "In light of the increasing requirement for processing multidimensional queries on OLAP (relational) data, the database community has focused on the queries (especially range queries) on the large OLAP datasets from the view of multidimensional data. It is well-known that multidimensional indices are helpful to improve the performance of such queries. However, we found that much information irrelevant to queries also has to be read from disk if the existing multidimensional indices are used with OLAP data, which greatly degrade the search performance. This problem comes from particularity on the actual queries exerted on OLAP data. That is, in many OLAP applications, the query conditions probably are only with partial dimensions (not all) of the whole index space. Such range queries are called partially-dimensional (PD) range queries in this study. Based on R *-tree, we propose a new index structure, called AR *-tree, to counter the actual queries on OLAP data. The results of both mathematical analysis and many experiments with different datasets indicate that the AR *-tree can clearly improve the performance of PD range queries, esp. for large OLAP datasets.",
author = "Yaokai Feng and Kunihiko Kaneko and Akifumi Makinouchi",
year = "2011",
month = "7",
day = "1",
doi = "10.1504/IJDMMM.2011.041493",
language = "English",
volume = "3",
pages = "150--171",
journal = "International Journal of Data Mining, Modelling and Management",
issn = "1759-1163",
publisher = "Inderscience Publishers",
number = "2",

}

TY - JOUR

T1 - Efficient evaluation of partially-dimensional range queries in large OLAP datasets

AU - Feng, Yaokai

AU - Kaneko, Kunihiko

AU - Makinouchi, Akifumi

PY - 2011/7/1

Y1 - 2011/7/1

N2 - In light of the increasing requirement for processing multidimensional queries on OLAP (relational) data, the database community has focused on the queries (especially range queries) on the large OLAP datasets from the view of multidimensional data. It is well-known that multidimensional indices are helpful to improve the performance of such queries. However, we found that much information irrelevant to queries also has to be read from disk if the existing multidimensional indices are used with OLAP data, which greatly degrade the search performance. This problem comes from particularity on the actual queries exerted on OLAP data. That is, in many OLAP applications, the query conditions probably are only with partial dimensions (not all) of the whole index space. Such range queries are called partially-dimensional (PD) range queries in this study. Based on R *-tree, we propose a new index structure, called AR *-tree, to counter the actual queries on OLAP data. The results of both mathematical analysis and many experiments with different datasets indicate that the AR *-tree can clearly improve the performance of PD range queries, esp. for large OLAP datasets.

AB - In light of the increasing requirement for processing multidimensional queries on OLAP (relational) data, the database community has focused on the queries (especially range queries) on the large OLAP datasets from the view of multidimensional data. It is well-known that multidimensional indices are helpful to improve the performance of such queries. However, we found that much information irrelevant to queries also has to be read from disk if the existing multidimensional indices are used with OLAP data, which greatly degrade the search performance. This problem comes from particularity on the actual queries exerted on OLAP data. That is, in many OLAP applications, the query conditions probably are only with partial dimensions (not all) of the whole index space. Such range queries are called partially-dimensional (PD) range queries in this study. Based on R *-tree, we propose a new index structure, called AR *-tree, to counter the actual queries on OLAP data. The results of both mathematical analysis and many experiments with different datasets indicate that the AR *-tree can clearly improve the performance of PD range queries, esp. for large OLAP datasets.

UR - http://www.scopus.com/inward/record.url?scp=84858419358&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84858419358&partnerID=8YFLogxK

U2 - 10.1504/IJDMMM.2011.041493

DO - 10.1504/IJDMMM.2011.041493

M3 - Article

VL - 3

SP - 150

EP - 171

JO - International Journal of Data Mining, Modelling and Management

JF - International Journal of Data Mining, Modelling and Management

SN - 1759-1163

IS - 2

ER -