Feature extraction using restricted bootstrapping

Sachio Hirokawa

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    2 Citations (Scopus)

    Abstract

    The bootstrapping method is known as an application of the Page-rank technique for documents and words. The technique calculates the score of the words by mutually propagating the score of the words and the documents. However, sometimes the result is far away from the initial query word. The problem is known as "topic drift". This paper proposes to restrict the words to be to the top t words in the process of bootstrapping. The method is simpler than the technique known so far. The method is applied for the real bankruptcy information documents to extract the bankruptcy causes strongly related to the query. It is confirmed that the method prevents the topic drift.

    Original languageEnglish
    Title of host publicationProceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012
    Pages283-288
    Number of pages6
    DOIs
    Publication statusPublished - 2012
    Event2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012 - Shanghai, China
    Duration: May 30 2012Jun 1 2012

    Publication series

    NameProceedings - 2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012

    Other

    Other2012 IEEE/ACIS 11th International Conference on Computer and Information Science, ICIS 2012
    CountryChina
    CityShanghai
    Period5/30/126/1/12

    All Science Journal Classification (ASJC) codes

    • Computer Networks and Communications
    • Information Systems

    Fingerprint Dive into the research topics of 'Feature extraction using restricted bootstrapping'. Together they form a unique fingerprint.

    Cite this