Time-based sampling methods for detecting helpful reviews

Ristu Saptono, Tsunenori Mine

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Product reviews describe customer opinions and experiences to products. Better opinions and experiences in the reviews more attract and help people who want to buy the products. The reviews, including such factors, are called helpful reviews. Many studies have been conducted to detect helpful reviews and proposed many useful factors, such as review-related factors, product-related factors, and reviewer-related factors. Meanwhile, the elapsed time of reviews has been used as a factor in detecting helpful reviews but never considered as sampling methods, despite that it is an essential factor to determine the freshness of the reviews, which influence the people being interested in the product. In this paper, we propose time-based sampling methods, which determine the sample size as small as possible in detecting helpful reviews with high accuracy. To investigate the effect of the time-based sampling methods in detecting helpful reviews, we conducted extensive experiments comparing with total sampling and simple random sampling, using two machine learning methods: XGBoost and CNN which involve text and numerical factors. Experimental results illustrate the validity of the proposed methods. Significantly, in large datasets, our proposed sampling methods outperform the other sampling methods.

Original languageEnglish
Title of host publicationProceedings - 2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2020
EditorsJing He, Hemant Purohit, Guangyan Huang, Xiaoying Gao, Ke Deng
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages508-513
Number of pages6
ISBN (Electronic)9781665419246
DOIs
Publication statusPublished - Dec 2020
Event2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2020 - Virtual, Online
Duration: Dec 14 2020Dec 17 2020

Publication series

NameProceedings - 2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2020

Conference

Conference2020 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2020
CityVirtual, Online
Period12/14/2012/17/20

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Computer Networks and Communications
  • Software

Fingerprint

Dive into the research topics of 'Time-based sampling methods for detecting helpful reviews'. Together they form a unique fingerprint.

Cite this