Data squashing for speeding up boosting-based outlier detection

Shutaro Inatani, Einoshin Suzuki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

In this paper, we apply data squashing to speed up outlier detection based on boosting. One person's noise is another person's signal. Outlier detection is gaining increasing attention in data mining. In order to improve computational time for AdaBoost-based outlier detection, we beforehand compress a given data set based on a simplified method of BIRCH. Effectiveness of our approach in terms of detection accuracy and computational time is investigated by experiments with two real-world data sets of drug stores in Japan and an artificial data set of unlawful access to a computer network.

Original languageEnglish
Title of host publicationFoundations of Intelligent Systems - 13th International Symposium, ISMIS 2002, Proceedings
Pages600-611
Number of pages12
Publication statusPublished - Dec 1 2002
Event13th International Symposium on Methodologies for Intelligent Systems, ISMIS 2002 - Lyon, France
Duration: Jun 27 2002Jun 29 2002

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2366 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other13th International Symposium on Methodologies for Intelligent Systems, ISMIS 2002
CountryFrance
CityLyon
Period6/27/026/29/02

Fingerprint

Adaptive boosting
Outlier Detection
Boosting
Computer networks
Data mining
Person
Experiments
AdaBoost
Computer Networks
Japan
Data Mining
Drugs
Speedup
Experiment

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Inatani, S., & Suzuki, E. (2002). Data squashing for speeding up boosting-based outlier detection. In Foundations of Intelligent Systems - 13th International Symposium, ISMIS 2002, Proceedings (pp. 600-611). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2366 LNAI).

Data squashing for speeding up boosting-based outlier detection. / Inatani, Shutaro; Suzuki, Einoshin.

Foundations of Intelligent Systems - 13th International Symposium, ISMIS 2002, Proceedings. 2002. p. 600-611 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2366 LNAI).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Inatani, S & Suzuki, E 2002, Data squashing for speeding up boosting-based outlier detection. in Foundations of Intelligent Systems - 13th International Symposium, ISMIS 2002, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 2366 LNAI, pp. 600-611, 13th International Symposium on Methodologies for Intelligent Systems, ISMIS 2002, Lyon, France, 6/27/02.
Inatani S, Suzuki E. Data squashing for speeding up boosting-based outlier detection. In Foundations of Intelligent Systems - 13th International Symposium, ISMIS 2002, Proceedings. 2002. p. 600-611. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Inatani, Shutaro ; Suzuki, Einoshin. / Data squashing for speeding up boosting-based outlier detection. Foundations of Intelligent Systems - 13th International Symposium, ISMIS 2002, Proceedings. 2002. pp. 600-611 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{2b60b93b02c544fda431057292d0adc8,
title = "Data squashing for speeding up boosting-based outlier detection",
abstract = "In this paper, we apply data squashing to speed up outlier detection based on boosting. One person's noise is another person's signal. Outlier detection is gaining increasing attention in data mining. In order to improve computational time for AdaBoost-based outlier detection, we beforehand compress a given data set based on a simplified method of BIRCH. Effectiveness of our approach in terms of detection accuracy and computational time is investigated by experiments with two real-world data sets of drug stores in Japan and an artificial data set of unlawful access to a computer network.",
author = "Shutaro Inatani and Einoshin Suzuki",
year = "2002",
month = "12",
day = "1",
language = "English",
isbn = "3540437851",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "600--611",
booktitle = "Foundations of Intelligent Systems - 13th International Symposium, ISMIS 2002, Proceedings",

}

TY - GEN

T1 - Data squashing for speeding up boosting-based outlier detection

AU - Inatani, Shutaro

AU - Suzuki, Einoshin

PY - 2002/12/1

Y1 - 2002/12/1

N2 - In this paper, we apply data squashing to speed up outlier detection based on boosting. One person's noise is another person's signal. Outlier detection is gaining increasing attention in data mining. In order to improve computational time for AdaBoost-based outlier detection, we beforehand compress a given data set based on a simplified method of BIRCH. Effectiveness of our approach in terms of detection accuracy and computational time is investigated by experiments with two real-world data sets of drug stores in Japan and an artificial data set of unlawful access to a computer network.

AB - In this paper, we apply data squashing to speed up outlier detection based on boosting. One person's noise is another person's signal. Outlier detection is gaining increasing attention in data mining. In order to improve computational time for AdaBoost-based outlier detection, we beforehand compress a given data set based on a simplified method of BIRCH. Effectiveness of our approach in terms of detection accuracy and computational time is investigated by experiments with two real-world data sets of drug stores in Japan and an artificial data set of unlawful access to a computer network.

UR - http://www.scopus.com/inward/record.url?scp=84868089972&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84868089972&partnerID=8YFLogxK

M3 - Conference contribution

SN - 3540437851

SN - 9783540437857

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 600

EP - 611

BT - Foundations of Intelligent Systems - 13th International Symposium, ISMIS 2002, Proceedings

ER -