TY - JOUR
T1 - Estimation of danger signs in regional complaint data
AU - Lin, Yao
AU - Mine, Tsunenori
AU - Yamaguchi, Kohei
AU - Hirokawa, Sachio
N1 - Funding Information:
This work was partially supported by JSPS KAKENHI Grant No. JP15H05708, JP16H02926, JP17H01843 and JP18K18656.
Publisher Copyright:
© 2018, Politeknik Negeri Padang. All rights reserved.
PY - 2018
Y1 - 2018
N2 - Government 2.0 activities have become very attractive and popular. Using the platforms to support the activities, anyone can anytime report issues in a city on the Web and share the reports with other people. Since a variety of reports are posted, officials in the city management section have to give priorities to the reports. However, it is not easy task for the officials to judge the importance of the reports because importance judgments vary depending on the officials, and consequently the agreement rate becomes low. To remedy the low agreement rate problem of human judgment, it is necessary to create an intelligent agent which supports finding reports with high priorities. Hirokawa et al. employed the Support Vector Machine (SVM) with a word Feature Selection method (SVM+FS) to detect signs of danger from posted reports because the signs of danger is one of high priority issues to be dealt with. However they did not compare the SVM+FS method with other conventional machine learning methods and it is not clear if the SVM+FS method has better performance than the other methods. This paper explores methods for detecting the signs of danger through comprehensive experiments to develop an intelligent agent which supports officials in the city management sections. We explores conventional machine learning methods: SVM, Random Forest, Naïve Bayse using conventional word vectors, an LDA-based document vector, and word embedding by Word2Vec and compared the best method with SVM+FS. Experimental results illustrate the superiority of SVM+FS and invoke the importance of using multiple data sets when evaluating the methods of detecting signs of danger.
AB - Government 2.0 activities have become very attractive and popular. Using the platforms to support the activities, anyone can anytime report issues in a city on the Web and share the reports with other people. Since a variety of reports are posted, officials in the city management section have to give priorities to the reports. However, it is not easy task for the officials to judge the importance of the reports because importance judgments vary depending on the officials, and consequently the agreement rate becomes low. To remedy the low agreement rate problem of human judgment, it is necessary to create an intelligent agent which supports finding reports with high priorities. Hirokawa et al. employed the Support Vector Machine (SVM) with a word Feature Selection method (SVM+FS) to detect signs of danger from posted reports because the signs of danger is one of high priority issues to be dealt with. However they did not compare the SVM+FS method with other conventional machine learning methods and it is not clear if the SVM+FS method has better performance than the other methods. This paper explores methods for detecting the signs of danger through comprehensive experiments to develop an intelligent agent which supports officials in the city management sections. We explores conventional machine learning methods: SVM, Random Forest, Naïve Bayse using conventional word vectors, an LDA-based document vector, and word embedding by Word2Vec and compared the best method with SVM+FS. Experimental results illustrate the superiority of SVM+FS and invoke the importance of using multiple data sets when evaluating the methods of detecting signs of danger.
UR - http://www.scopus.com/inward/record.url?scp=85097128924&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85097128924&partnerID=8YFLogxK
U2 - 10.30630/joiv.2.4-2.177
DO - 10.30630/joiv.2.4-2.177
M3 - Article
AN - SCOPUS:85097128924
SN - 2549-9904
VL - 2
SP - 342
EP - 348
JO - International Journal on Informatics Visualization
JF - International Journal on Informatics Visualization
IS - 4-2
ER -