TY - GEN
T1 - Prediction of Bus Delay over Intervals on Various Kinds of Routes Using Bus Probe Data
AU - Yamaguchi, Tsubasa
AU - As, Mansur
AU - Mine, Tsunenori
N1 - Funding Information:
The probe data used in this study were provided by NISHITETSU Bus Company, Fukuoka, Japan. This work was partially supported by JSPS KAKENHI Grant Number JP15H05708.
PY - 2019/1/9
Y1 - 2019/1/9
N2 - Prediction of bus travel time and/or delay time is a useful tool for passengers who want to plan their journey, e.g., when they should leave from the origin bus stop, what they will do after arriving at the destination bus stop, and so on. Many studies have tackled this task using probe data and/or the real time data provided by automatic vehicle location (AVL) systems. Most of them only targeted a small number of routes, short time periods, e.g. less than one week, and used few machine learning models to evaluate their methods. However, different routes generally show different characteristics. In fact, there are big differences between urban routes and rural routes. Furthermore, the performance of machine learning models also varies according to the data dealt with by the models. In this paper, we propose prediction models for bus delay over all intervals between pairs of adjacent bus stops. To build the models, we use one month of bus probe data, which includes more than 80 routes, and apply several machine learning models: linear regression (LR), artificial neural network (ANN), support vector regression (SVR), random forest (RF), and gradient boosting decision tree (GBDT). Experimental results demonstrate the superiority of the GBDT-based prediction model and the effects of considering travel time over prior intervals.
AB - Prediction of bus travel time and/or delay time is a useful tool for passengers who want to plan their journey, e.g., when they should leave from the origin bus stop, what they will do after arriving at the destination bus stop, and so on. Many studies have tackled this task using probe data and/or the real time data provided by automatic vehicle location (AVL) systems. Most of them only targeted a small number of routes, short time periods, e.g. less than one week, and used few machine learning models to evaluate their methods. However, different routes generally show different characteristics. In fact, there are big differences between urban routes and rural routes. Furthermore, the performance of machine learning models also varies according to the data dealt with by the models. In this paper, we propose prediction models for bus delay over all intervals between pairs of adjacent bus stops. To build the models, we use one month of bus probe data, which includes more than 80 routes, and apply several machine learning models: linear regression (LR), artificial neural network (ANN), support vector regression (SVR), random forest (RF), and gradient boosting decision tree (GBDT). Experimental results demonstrate the superiority of the GBDT-based prediction model and the effects of considering travel time over prior intervals.
UR - http://www.scopus.com/inward/record.url?scp=85061786597&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061786597&partnerID=8YFLogxK
U2 - 10.1109/BDCAT.2018.00020
DO - 10.1109/BDCAT.2018.00020
M3 - Conference contribution
AN - SCOPUS:85061786597
T3 - Proceedings - 5th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2018
SP - 97
EP - 106
BT - Proceedings - 5th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2018
A2 - Spillner, Josef
A2 - Sill, Alan
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th IEEE/ACM International Conference on Big Data Computing, Applications and Technologies, BDCAT 2018
Y2 - 17 December 2018 through 20 December 2018
ER -