Discovering characteristic patterns from collections of classical Japanese poems

Mayumi Yamasaki, Masayuki Takeda, Tomoko Fukuda, Ichirō Nanri

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Waka is a form of traditional Japanese poetry with a 1300- year history. In this paper, we attempt to discover characteristics common to a collection of waka poems. As a formalism for characteristics, we use regular patterns where the constant parts are limited to sequences of auxiliary verbs and postpositional particles. We call such patterns fushi. The problem is to find automatically significant fushi patterns that characterize the poems. Solving this problem requires a reliable significance measure for the patterns. Brāzma et al. (1996) proposed such a measure according to the MDL principle. Using this method, we report successful results in finding patterns from five anthologies. Some of the results are quite stimulating, and we hope that they will lead to new discoveries. Based on our experience, we also propose a pattern-based text data mining system. Further research into waka poetry is now proceeding using this system.

Original languageEnglish
Title of host publicationDiscovery Science - 1st International Conference, DS 1998, Proceedings
EditorsSetsuo Arikawa, Hiroshi Motoda
PublisherSpringer Verlag
Pages129-141
Number of pages13
ISBN (Print)3540653902, 9783540653905
Publication statusPublished - Jan 1 1998
Event1st International Conference on Discovery Science, DS 1998 - Fukuoka, Japan
Duration: Dec 14 1998Dec 16 1998

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1532
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other1st International Conference on Discovery Science, DS 1998
CountryJapan
CityFukuoka
Period12/14/9812/16/98

Fingerprint

Data mining
Text Mining
Data Mining

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • Computer Science(all)

Cite this

Yamasaki, M., Takeda, M., Fukuda, T., & Nanri, I. (1998). Discovering characteristic patterns from collections of classical Japanese poems. In S. Arikawa, & H. Motoda (Eds.), Discovery Science - 1st International Conference, DS 1998, Proceedings (pp. 129-141). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1532). Springer Verlag.

Discovering characteristic patterns from collections of classical Japanese poems. / Yamasaki, Mayumi; Takeda, Masayuki; Fukuda, Tomoko; Nanri, Ichirō.

Discovery Science - 1st International Conference, DS 1998, Proceedings. ed. / Setsuo Arikawa; Hiroshi Motoda. Springer Verlag, 1998. p. 129-141 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1532).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Yamasaki, M, Takeda, M, Fukuda, T & Nanri, I 1998, Discovering characteristic patterns from collections of classical Japanese poems. in S Arikawa & H Motoda (eds), Discovery Science - 1st International Conference, DS 1998, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 1532, Springer Verlag, pp. 129-141, 1st International Conference on Discovery Science, DS 1998, Fukuoka, Japan, 12/14/98.
Yamasaki M, Takeda M, Fukuda T, Nanri I. Discovering characteristic patterns from collections of classical Japanese poems. In Arikawa S, Motoda H, editors, Discovery Science - 1st International Conference, DS 1998, Proceedings. Springer Verlag. 1998. p. 129-141. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Yamasaki, Mayumi ; Takeda, Masayuki ; Fukuda, Tomoko ; Nanri, Ichirō. / Discovering characteristic patterns from collections of classical Japanese poems. Discovery Science - 1st International Conference, DS 1998, Proceedings. editor / Setsuo Arikawa ; Hiroshi Motoda. Springer Verlag, 1998. pp. 129-141 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{dc7ce414dc474481981c8fb0ba44f9c8,
title = "Discovering characteristic patterns from collections of classical Japanese poems",
abstract = "Waka is a form of traditional Japanese poetry with a 1300- year history. In this paper, we attempt to discover characteristics common to a collection of waka poems. As a formalism for characteristics, we use regular patterns where the constant parts are limited to sequences of auxiliary verbs and postpositional particles. We call such patterns fushi. The problem is to find automatically significant fushi patterns that characterize the poems. Solving this problem requires a reliable significance measure for the patterns. Brāzma et al. (1996) proposed such a measure according to the MDL principle. Using this method, we report successful results in finding patterns from five anthologies. Some of the results are quite stimulating, and we hope that they will lead to new discoveries. Based on our experience, we also propose a pattern-based text data mining system. Further research into waka poetry is now proceeding using this system.",
author = "Mayumi Yamasaki and Masayuki Takeda and Tomoko Fukuda and Ichirō Nanri",
year = "1998",
month = "1",
day = "1",
language = "English",
isbn = "3540653902",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "129--141",
editor = "Setsuo Arikawa and Hiroshi Motoda",
booktitle = "Discovery Science - 1st International Conference, DS 1998, Proceedings",
address = "Germany",

}

TY - GEN

T1 - Discovering characteristic patterns from collections of classical Japanese poems

AU - Yamasaki, Mayumi

AU - Takeda, Masayuki

AU - Fukuda, Tomoko

AU - Nanri, Ichirō

PY - 1998/1/1

Y1 - 1998/1/1

N2 - Waka is a form of traditional Japanese poetry with a 1300- year history. In this paper, we attempt to discover characteristics common to a collection of waka poems. As a formalism for characteristics, we use regular patterns where the constant parts are limited to sequences of auxiliary verbs and postpositional particles. We call such patterns fushi. The problem is to find automatically significant fushi patterns that characterize the poems. Solving this problem requires a reliable significance measure for the patterns. Brāzma et al. (1996) proposed such a measure according to the MDL principle. Using this method, we report successful results in finding patterns from five anthologies. Some of the results are quite stimulating, and we hope that they will lead to new discoveries. Based on our experience, we also propose a pattern-based text data mining system. Further research into waka poetry is now proceeding using this system.

AB - Waka is a form of traditional Japanese poetry with a 1300- year history. In this paper, we attempt to discover characteristics common to a collection of waka poems. As a formalism for characteristics, we use regular patterns where the constant parts are limited to sequences of auxiliary verbs and postpositional particles. We call such patterns fushi. The problem is to find automatically significant fushi patterns that characterize the poems. Solving this problem requires a reliable significance measure for the patterns. Brāzma et al. (1996) proposed such a measure according to the MDL principle. Using this method, we report successful results in finding patterns from five anthologies. Some of the results are quite stimulating, and we hope that they will lead to new discoveries. Based on our experience, we also propose a pattern-based text data mining system. Further research into waka poetry is now proceeding using this system.

UR - http://www.scopus.com/inward/record.url?scp=84949208217&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84949208217&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84949208217

SN - 3540653902

SN - 9783540653905

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 129

EP - 141

BT - Discovery Science - 1st International Conference, DS 1998, Proceedings

A2 - Arikawa, Setsuo

A2 - Motoda, Hiroshi

PB - Springer Verlag

ER -