Component search engine based on html path and word weight

Jun Zeng, Sachio Hirokawa

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

With the popularization of search engines, finding information has become easier than before. However, the information found by most search engines today is web page, which may contain more than one topic. It makes user spend extra time to read the irrelevant contents in order to find out the information he wants. We propose a novel search engine model called "Component Search Engine", which calculates the score of each component in a page by HTML path and word weight. The higher score the component gains, the higherranking it will appear at. The usability study determinates that the component search engine can find out the important contents efficiently.

Original languageEnglish
Pages (from-to)563-568
Number of pages6
JournalICIC Express Letters
Volume6
Issue number2
Publication statusPublished - Feb 2012

Fingerprint

Search engines
HTML
Websites

All Science Journal Classification (ASJC) codes

  • Computer Science(all)
  • Control and Systems Engineering

Cite this

Component search engine based on html path and word weight. / Zeng, Jun; Hirokawa, Sachio.

In: ICIC Express Letters, Vol. 6, No. 2, 02.2012, p. 563-568.

Research output: Contribution to journalArticle

@article{ac233b1f9d194d62bbbb0f2db8045c23,
title = "Component search engine based on html path and word weight",
abstract = "With the popularization of search engines, finding information has become easier than before. However, the information found by most search engines today is web page, which may contain more than one topic. It makes user spend extra time to read the irrelevant contents in order to find out the information he wants. We propose a novel search engine model called {"}Component Search Engine{"}, which calculates the score of each component in a page by HTML path and word weight. The higher score the component gains, the higherranking it will appear at. The usability study determinates that the component search engine can find out the important contents efficiently.",
author = "Jun Zeng and Sachio Hirokawa",
year = "2012",
month = "2",
language = "English",
volume = "6",
pages = "563--568",
journal = "ICIC Express Letters",
issn = "1881-803X",
publisher = "ICIC Express Letters Office",
number = "2",

}

TY - JOUR

T1 - Component search engine based on html path and word weight

AU - Zeng, Jun

AU - Hirokawa, Sachio

PY - 2012/2

Y1 - 2012/2

N2 - With the popularization of search engines, finding information has become easier than before. However, the information found by most search engines today is web page, which may contain more than one topic. It makes user spend extra time to read the irrelevant contents in order to find out the information he wants. We propose a novel search engine model called "Component Search Engine", which calculates the score of each component in a page by HTML path and word weight. The higher score the component gains, the higherranking it will appear at. The usability study determinates that the component search engine can find out the important contents efficiently.

AB - With the popularization of search engines, finding information has become easier than before. However, the information found by most search engines today is web page, which may contain more than one topic. It makes user spend extra time to read the irrelevant contents in order to find out the information he wants. We propose a novel search engine model called "Component Search Engine", which calculates the score of each component in a page by HTML path and word weight. The higher score the component gains, the higherranking it will appear at. The usability study determinates that the component search engine can find out the important contents efficiently.

UR - http://www.scopus.com/inward/record.url?scp=84856965437&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84856965437&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:84856965437

VL - 6

SP - 563

EP - 568

JO - ICIC Express Letters

JF - ICIC Express Letters

SN - 1881-803X

IS - 2

ER -