Feature extraction using restricted bootstrapping

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

The bootstrapping method is known as an application of the Page-rank technique for documents and words. The technique calculates the score of the words by mutually propagating the score of the words and the documents. However, sometimes the result is far away from the initial query word. The problem is known as "topic drift". This paper proposes to restrict the words to be to the top t words in the process of bootstrapping. The method is simpler than the technique known so far. The method is applied for the real bankruptcy information documents to extract the bankruptcy causes strongly related to the query. It is confirmed that the method prevents the topic drift.

Original languageEnglish
Title of host publicationProceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012
Pages283-288
Number of pages6
DOIs
Publication statusPublished - Jul 25 2012
Externally publishedYes
Event2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012 - Shanghai, China
Duration: May 30 2012Jun 1 2012

Publication series

NameProceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012

Other

Other2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012
CountryChina
CityShanghai
Period5/30/126/1/12

Fingerprint

Feature extraction

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Information Systems

Cite this

Hirokawa, S. (2012). Feature extraction using restricted bootstrapping. In Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012 (pp. 283-288). [6211110] (Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012). https://doi.org/10.1109/ICIS.2012.50

Feature extraction using restricted bootstrapping. / Hirokawa, Sachio.

Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012. 2012. p. 283-288 6211110 (Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hirokawa, S 2012, Feature extraction using restricted bootstrapping. in Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012., 6211110, Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012, pp. 283-288, 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012, Shanghai, China, 5/30/12. https://doi.org/10.1109/ICIS.2012.50
Hirokawa S. Feature extraction using restricted bootstrapping. In Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012. 2012. p. 283-288. 6211110. (Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012). https://doi.org/10.1109/ICIS.2012.50
Hirokawa, Sachio. / Feature extraction using restricted bootstrapping. Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012. 2012. pp. 283-288 (Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012).
@inproceedings{5e7fc1947ca342e1b4c5463d0b324dd8,
title = "Feature extraction using restricted bootstrapping",
abstract = "The bootstrapping method is known as an application of the Page-rank technique for documents and words. The technique calculates the score of the words by mutually propagating the score of the words and the documents. However, sometimes the result is far away from the initial query word. The problem is known as {"}topic drift{"}. This paper proposes to restrict the words to be to the top t words in the process of bootstrapping. The method is simpler than the technique known so far. The method is applied for the real bankruptcy information documents to extract the bankruptcy causes strongly related to the query. It is confirmed that the method prevents the topic drift.",
author = "Sachio Hirokawa",
year = "2012",
month = "7",
day = "25",
doi = "10.1109/ICIS.2012.50",
language = "English",
isbn = "9780769546940",
series = "Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012",
pages = "283--288",
booktitle = "Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012",

}

TY - GEN

T1 - Feature extraction using restricted bootstrapping

AU - Hirokawa, Sachio

PY - 2012/7/25

Y1 - 2012/7/25

N2 - The bootstrapping method is known as an application of the Page-rank technique for documents and words. The technique calculates the score of the words by mutually propagating the score of the words and the documents. However, sometimes the result is far away from the initial query word. The problem is known as "topic drift". This paper proposes to restrict the words to be to the top t words in the process of bootstrapping. The method is simpler than the technique known so far. The method is applied for the real bankruptcy information documents to extract the bankruptcy causes strongly related to the query. It is confirmed that the method prevents the topic drift.

AB - The bootstrapping method is known as an application of the Page-rank technique for documents and words. The technique calculates the score of the words by mutually propagating the score of the words and the documents. However, sometimes the result is far away from the initial query word. The problem is known as "topic drift". This paper proposes to restrict the words to be to the top t words in the process of bootstrapping. The method is simpler than the technique known so far. The method is applied for the real bankruptcy information documents to extract the bankruptcy causes strongly related to the query. It is confirmed that the method prevents the topic drift.

UR - http://www.scopus.com/inward/record.url?scp=84864051520&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84864051520&partnerID=8YFLogxK

U2 - 10.1109/ICIS.2012.50

DO - 10.1109/ICIS.2012.50

M3 - Conference contribution

AN - SCOPUS:84864051520

SN - 9780769546940

T3 - Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012

SP - 283

EP - 288

BT - Proceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012

ER -