TY - GEN
T1 - Best Approximate Distribution-based Model for Helpful Vote of Customer Review Prediction
AU - Saptono, Ristu
AU - Mine, Tsunenori
N1 - Funding Information:
This work was supported in part by JSPS KAKENHI Grant Numbers: JP21H00907, JP21K11847, JP20H01728, JP20H04300, JP19KK0257 and Universitas Sebelas Maret PDD Grant Numbers: 260/UN27.22/HK.07.00/2021, 254/UN27.22/PT.01.03/2022.
Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Product reviews are more and more important for potential customers to decide on their purchases in electronic commerce nowadays. The helpful vote is a critical indicator of how much impact the review has on other customers. Therefore, the prediction of helpful votes is an essential task. Linear and Tobit Regression are general methods of the prediction. Those methods share the same objective function and come from the initial assumption that the helpful votes on any dataset follow a normal distribution. However, the assumption is not usually confirmed, and the distribution of the helpful votes often follows other distributions. Consequently, the prediction results might not be fully appropriate. This paper proposes a model that follows the best approximate distribution of helpful votes to predict the number of helpful votes. On top of that, considering the elapsed time since reviews were written, we propose an adaptive window size sampling method to evaluate the model on review datasets sorted chronologically. To validate the proposed model, we conducted extensive experiments on real-world datasets. Experimental results illustrate the validity of the proposed model.
AB - Product reviews are more and more important for potential customers to decide on their purchases in electronic commerce nowadays. The helpful vote is a critical indicator of how much impact the review has on other customers. Therefore, the prediction of helpful votes is an essential task. Linear and Tobit Regression are general methods of the prediction. Those methods share the same objective function and come from the initial assumption that the helpful votes on any dataset follow a normal distribution. However, the assumption is not usually confirmed, and the distribution of the helpful votes often follows other distributions. Consequently, the prediction results might not be fully appropriate. This paper proposes a model that follows the best approximate distribution of helpful votes to predict the number of helpful votes. On top of that, considering the elapsed time since reviews were written, we propose an adaptive window size sampling method to evaluate the model on review datasets sorted chronologically. To validate the proposed model, we conducted extensive experiments on real-world datasets. Experimental results illustrate the validity of the proposed model.
UR - http://www.scopus.com/inward/record.url?scp=85142770487&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85142770487&partnerID=8YFLogxK
U2 - 10.1109/SMC53654.2022.9945190
DO - 10.1109/SMC53654.2022.9945190
M3 - Conference contribution
AN - SCOPUS:85142770487
T3 - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
SP - 3427
EP - 3434
BT - 2022 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2022 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2022
Y2 - 9 October 2022 through 12 October 2022
ER -