General algorithms for mining closed flexible patterns under various equivalence relations

Tomohiro I, Yuki Enokuma, Hideo Bannai, Masayuki Takeda

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We address the closed pattern discovery problem in sequential databases for the class of flexible patterns. We propose two techniques of coarsening existing equivalence relations on the set of patterns to obtain new equivalence relations. Our new algorithm GenCloFlex is a generalization of MaxFlex proposed by Arimura and Uno (2007) that was designed for a particular equivalence relation. GenCloFlex can cope with existing, as well as new equivalence relations, and we investigate the computational complexities of the algorithm for respective equivalence relations. Then, we present an improved algorithm GenCloFlex+ based on new pruning techniques, which improve the delay time per output for some of the equivalence relations. By computational experiments on synthetic data, we show that most of the redundancies in the mined patterns are removed using the proposed equivalence relations.

Original languageEnglish
Title of host publicationMachine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2012, Proceedings
PublisherSpringer Verlag
Pages435-450
Number of pages16
EditionPART 2
ISBN (Print)9783642334856
DOIs
Publication statusPublished - Jan 1 2012
Event2012 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2012 - Bristol, United Kingdom
Duration: Sep 24 2012Sep 28 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume7524
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other2012 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, ECML-PKDD 2012
CountryUnited Kingdom
CityBristol
Period9/24/129/28/12

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

I, T., Enokuma, Y., Bannai, H., & Takeda, M. (2012). General algorithms for mining closed flexible patterns under various equivalence relations. In Machine Learning and Knowledge Discovery in Databases - European Conference, ECML PKDD 2012, Proceedings (PART 2 ed., pp. 435-450). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7524, No. PART 2). Springer Verlag. https://doi.org/10.1007/978-3-642-33486-3_28