TY - CHAP

T1 - Pitfalls for categorizations of objective interestingness measures for rule discovery

AU - Suzuki, Einoshin

PY - 2008/7/17

Y1 - 2008/7/17

N2 - In this paper, we point out four pitfalls for categorizations of objective interestingness measures for rule discovery. Rule discovery, which is extensively studied in data mining, suffers from the problem of outputting a huge number of rules. An objective interestingness measure can be used to estimate the potential usefulness of a discovered rule based on the given data set thus hopefully serves as a countermeasure to circumvent this problem. Various measures have been proposed, resulting systematic attempts for categorizing such measures. We believe that such attempts are subject to four kinds of pitfalls: data bias, rule bias, expert bias, and search bias. The main objective of this paper is to issue an alert for the pitfalls which are harmful to one of the most important research topics in data mining. We also list desiderata in categorizing objective interestingness measures.

AB - In this paper, we point out four pitfalls for categorizations of objective interestingness measures for rule discovery. Rule discovery, which is extensively studied in data mining, suffers from the problem of outputting a huge number of rules. An objective interestingness measure can be used to estimate the potential usefulness of a discovered rule based on the given data set thus hopefully serves as a countermeasure to circumvent this problem. Various measures have been proposed, resulting systematic attempts for categorizing such measures. We believe that such attempts are subject to four kinds of pitfalls: data bias, rule bias, expert bias, and search bias. The main objective of this paper is to issue an alert for the pitfalls which are harmful to one of the most important research topics in data mining. We also list desiderata in categorizing objective interestingness measures.

UR - http://www.scopus.com/inward/record.url?scp=47049123227&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=47049123227&partnerID=8YFLogxK

U2 - 10.1007/978-3-540-78983-3_17

DO - 10.1007/978-3-540-78983-3_17

M3 - Chapter

AN - SCOPUS:47049123227

SN - 9783540789826

T3 - Studies in Computational Intelligence

SP - 383

EP - 395

BT - Statistical Implicative Analysis

A2 - Gras, Régis

A2 - Suzuki, Einoshin

A2 - Guillet, Fabrice

A2 - Spagnolo, Filippo

ER -