Extracting the author of web pages

Yoshikiyo Kato, Daisuke Kawahara, Kentaro Inui, Sadao Kurohashi, Tomohide Shibata

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

In this paper, we define the problem of identifying the author of a Web page as a sub-problem of identifying the information sender configuration of a Web page. We propose a method that extracts the author name candidates from a Web page based on linguistic features, and rank the candidates based on local features such as distance from the main content. The evaluation shows that we can achieve more than 75% precision when evaluated with candidates ranked within top five.

Original languageEnglish
Title of host publicationProceedings of the 2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08
Pages35-41
Number of pages7
DOIs
Publication statusPublished - Dec 1 2008
Event2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08 - Napa Valley, CA, United States
Duration: Oct 26 2008Oct 30 2008

Publication series

NameInternational Conference on Information and Knowledge Management, Proceedings

Other

Other2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08
CountryUnited States
CityNapa Valley, CA
Period10/26/0810/30/08

All Science Journal Classification (ASJC) codes

  • Decision Sciences(all)
  • Business, Management and Accounting(all)

Fingerprint Dive into the research topics of 'Extracting the author of web pages'. Together they form a unique fingerprint.

  • Cite this

    Kato, Y., Kawahara, D., Inui, K., Kurohashi, S., & Shibata, T. (2008). Extracting the author of web pages. In Proceedings of the 2nd ACM Workshop on Information Credibility on the Web, WICOW'08, Co-located with the 17th ACM Conference on Information and Knowledge Management, CIKM'08 (pp. 35-41). (International Conference on Information and Knowledge Management, Proceedings). https://doi.org/10.1145/1458527.1458537