Non-topical classification of healthcare information on the web

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The present paper collected the asthma related 4,762 Web pages from 1,759 sites using 6 queries. Each site is manually categorized by the standard topics of description and information dissemination, diary and idle talk and Q&A. By careful analysis, it turned out that the pages can be classified in non-topical categories such as 'reading level', 'objectivity/subjectivity' and 'reliability'. The manually assigned labels of non-topical categories are then used as learning data to apply SVM (support machine vector). The prediction performance (F-measure) were below 50% with the naive application of SVM. However, the prediction performance was improved over 50% by feature selection except for reading level.

Original languageEnglish
Title of host publicationSmart Digital Futures 2014
PublisherIOS Press
Pages237-247
Number of pages11
ISBN (Print)9781614994046
DOIs
Publication statusPublished - Jan 1 2014

Publication series

NameFrontiers in Artificial Intelligence and Applications
Volume262
ISSN (Print)0922-6389

Fingerprint

Information dissemination
Support vector machines
Feature extraction
Labels
Websites

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence

Cite this

Hirokawa, S., & Ishita, E. (2014). Non-topical classification of healthcare information on the web. In Smart Digital Futures 2014 (pp. 237-247). (Frontiers in Artificial Intelligence and Applications; Vol. 262). IOS Press. https://doi.org/10.3233/978-1-61499-405-3-237

Non-topical classification of healthcare information on the web. / Hirokawa, Sachio; Ishita, Emi.

Smart Digital Futures 2014. IOS Press, 2014. p. 237-247 (Frontiers in Artificial Intelligence and Applications; Vol. 262).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hirokawa, S & Ishita, E 2014, Non-topical classification of healthcare information on the web. in Smart Digital Futures 2014. Frontiers in Artificial Intelligence and Applications, vol. 262, IOS Press, pp. 237-247. https://doi.org/10.3233/978-1-61499-405-3-237
Hirokawa S, Ishita E. Non-topical classification of healthcare information on the web. In Smart Digital Futures 2014. IOS Press. 2014. p. 237-247. (Frontiers in Artificial Intelligence and Applications). https://doi.org/10.3233/978-1-61499-405-3-237
Hirokawa, Sachio ; Ishita, Emi. / Non-topical classification of healthcare information on the web. Smart Digital Futures 2014. IOS Press, 2014. pp. 237-247 (Frontiers in Artificial Intelligence and Applications).
@inproceedings{5392de61e54e4ae4a4d1bcf0db62dc9f,
title = "Non-topical classification of healthcare information on the web",
abstract = "The present paper collected the asthma related 4,762 Web pages from 1,759 sites using 6 queries. Each site is manually categorized by the standard topics of description and information dissemination, diary and idle talk and Q&A. By careful analysis, it turned out that the pages can be classified in non-topical categories such as 'reading level', 'objectivity/subjectivity' and 'reliability'. The manually assigned labels of non-topical categories are then used as learning data to apply SVM (support machine vector). The prediction performance (F-measure) were below 50{\%} with the naive application of SVM. However, the prediction performance was improved over 50{\%} by feature selection except for reading level.",
author = "Sachio Hirokawa and Emi Ishita",
year = "2014",
month = "1",
day = "1",
doi = "10.3233/978-1-61499-405-3-237",
language = "English",
isbn = "9781614994046",
series = "Frontiers in Artificial Intelligence and Applications",
publisher = "IOS Press",
pages = "237--247",
booktitle = "Smart Digital Futures 2014",
address = "Netherlands",

}

TY - GEN

T1 - Non-topical classification of healthcare information on the web

AU - Hirokawa, Sachio

AU - Ishita, Emi

PY - 2014/1/1

Y1 - 2014/1/1

N2 - The present paper collected the asthma related 4,762 Web pages from 1,759 sites using 6 queries. Each site is manually categorized by the standard topics of description and information dissemination, diary and idle talk and Q&A. By careful analysis, it turned out that the pages can be classified in non-topical categories such as 'reading level', 'objectivity/subjectivity' and 'reliability'. The manually assigned labels of non-topical categories are then used as learning data to apply SVM (support machine vector). The prediction performance (F-measure) were below 50% with the naive application of SVM. However, the prediction performance was improved over 50% by feature selection except for reading level.

AB - The present paper collected the asthma related 4,762 Web pages from 1,759 sites using 6 queries. Each site is manually categorized by the standard topics of description and information dissemination, diary and idle talk and Q&A. By careful analysis, it turned out that the pages can be classified in non-topical categories such as 'reading level', 'objectivity/subjectivity' and 'reliability'. The manually assigned labels of non-topical categories are then used as learning data to apply SVM (support machine vector). The prediction performance (F-measure) were below 50% with the naive application of SVM. However, the prediction performance was improved over 50% by feature selection except for reading level.

UR - http://www.scopus.com/inward/record.url?scp=84902313053&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84902313053&partnerID=8YFLogxK

U2 - 10.3233/978-1-61499-405-3-237

DO - 10.3233/978-1-61499-405-3-237

M3 - Conference contribution

AN - SCOPUS:84902313053

SN - 9781614994046

T3 - Frontiers in Artificial Intelligence and Applications

SP - 237

EP - 247

BT - Smart Digital Futures 2014

PB - IOS Press

ER -