Accuracy improvement of automatic text classification based on feature transformation

Guowei Zu, Wataru Ohyama, Tetsushi Wakabayashi, Fumitaka Kimura

Research output: Chapter in Book/Report/Conference proceedingConference contribution

9 Citations (Scopus)

Abstract

In this paper, we describe a comparative study on techniques of feature transformation and classification to improve the accuracy of automatic text classification. The normalization to the relative word frequency, the principal component analysis (K-L transformation) and the power transformation were applied to the feature vectors, which were classified by the Euclidean distance, the linear discriminant function, the projection distance, the modified projection distance and the SVM.

Original languageEnglish
Title of host publicationProceedings of the 2003 ACM Symposium on Document Engineering
EditorsC. Vanoirbeek, C. Roisin, E. Munson
Pages118-120
Number of pages3
Publication statusPublished - Dec 1 2003
EventProceedings of the 2003 ACM Symposium on Document Engineering - Grenoble, France
Duration: Nov 20 2003Nov 22 2003

Publication series

NameProceedings of the 2003 ACM Symposium on Document Engineering

Other

OtherProceedings of the 2003 ACM Symposium on Document Engineering
CountryFrance
CityGrenoble
Period11/20/0311/22/03

Fingerprint

Principal component analysis

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Cite this

Zu, G., Ohyama, W., Wakabayashi, T., & Kimura, F. (2003). Accuracy improvement of automatic text classification based on feature transformation. In C. Vanoirbeek, C. Roisin, & E. Munson (Eds.), Proceedings of the 2003 ACM Symposium on Document Engineering (pp. 118-120). (Proceedings of the 2003 ACM Symposium on Document Engineering).

Accuracy improvement of automatic text classification based on feature transformation. / Zu, Guowei; Ohyama, Wataru; Wakabayashi, Tetsushi; Kimura, Fumitaka.

Proceedings of the 2003 ACM Symposium on Document Engineering. ed. / C. Vanoirbeek; C. Roisin; E. Munson. 2003. p. 118-120 (Proceedings of the 2003 ACM Symposium on Document Engineering).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Zu, G, Ohyama, W, Wakabayashi, T & Kimura, F 2003, Accuracy improvement of automatic text classification based on feature transformation. in C Vanoirbeek, C Roisin & E Munson (eds), Proceedings of the 2003 ACM Symposium on Document Engineering. Proceedings of the 2003 ACM Symposium on Document Engineering, pp. 118-120, Proceedings of the 2003 ACM Symposium on Document Engineering, Grenoble, France, 11/20/03.
Zu G, Ohyama W, Wakabayashi T, Kimura F. Accuracy improvement of automatic text classification based on feature transformation. In Vanoirbeek C, Roisin C, Munson E, editors, Proceedings of the 2003 ACM Symposium on Document Engineering. 2003. p. 118-120. (Proceedings of the 2003 ACM Symposium on Document Engineering).
Zu, Guowei ; Ohyama, Wataru ; Wakabayashi, Tetsushi ; Kimura, Fumitaka. / Accuracy improvement of automatic text classification based on feature transformation. Proceedings of the 2003 ACM Symposium on Document Engineering. editor / C. Vanoirbeek ; C. Roisin ; E. Munson. 2003. pp. 118-120 (Proceedings of the 2003 ACM Symposium on Document Engineering).
@inproceedings{f8f5c76dcc424daead0a9ecb335e3ffd,
title = "Accuracy improvement of automatic text classification based on feature transformation",
abstract = "In this paper, we describe a comparative study on techniques of feature transformation and classification to improve the accuracy of automatic text classification. The normalization to the relative word frequency, the principal component analysis (K-L transformation) and the power transformation were applied to the feature vectors, which were classified by the Euclidean distance, the linear discriminant function, the projection distance, the modified projection distance and the SVM.",
author = "Guowei Zu and Wataru Ohyama and Tetsushi Wakabayashi and Fumitaka Kimura",
year = "2003",
month = "12",
day = "1",
language = "English",
isbn = "1581137249",
series = "Proceedings of the 2003 ACM Symposium on Document Engineering",
pages = "118--120",
editor = "C. Vanoirbeek and C. Roisin and E. Munson",
booktitle = "Proceedings of the 2003 ACM Symposium on Document Engineering",

}

TY - GEN

T1 - Accuracy improvement of automatic text classification based on feature transformation

AU - Zu, Guowei

AU - Ohyama, Wataru

AU - Wakabayashi, Tetsushi

AU - Kimura, Fumitaka

PY - 2003/12/1

Y1 - 2003/12/1

N2 - In this paper, we describe a comparative study on techniques of feature transformation and classification to improve the accuracy of automatic text classification. The normalization to the relative word frequency, the principal component analysis (K-L transformation) and the power transformation were applied to the feature vectors, which were classified by the Euclidean distance, the linear discriminant function, the projection distance, the modified projection distance and the SVM.

AB - In this paper, we describe a comparative study on techniques of feature transformation and classification to improve the accuracy of automatic text classification. The normalization to the relative word frequency, the principal component analysis (K-L transformation) and the power transformation were applied to the feature vectors, which were classified by the Euclidean distance, the linear discriminant function, the projection distance, the modified projection distance and the SVM.

UR - http://www.scopus.com/inward/record.url?scp=3543055806&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=3543055806&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:3543055806

SN - 1581137249

SN - 9781581137248

T3 - Proceedings of the 2003 ACM Symposium on Document Engineering

SP - 118

EP - 120

BT - Proceedings of the 2003 ACM Symposium on Document Engineering

A2 - Vanoirbeek, C.

A2 - Roisin, C.

A2 - Munson, E.

ER -