Generating Artificial Data to Evaluate Machine Learning Predictive Algorithms for Bus Travel Time

Authors

  • Leandro S. Ribeiro University of Brasília (UNB)
  • Thiago P. Faleiros University of Brasília (UNB)
  • Maristela Holanda University of Brasília (UNB)

DOI:

https://doi.org/10.5753/jidm.2019.1630

Keywords:

machine learning, traffic simulation, bus travel time prediction

Abstract

This paper proposes a simulator called Simulator for Bus Information Analysis (S-BIA). The proposed simulator is capable of quickly generating a large amount of data that may be used to train bus travel time predictive algorithms in an urban transport network. To validate the proposal, a case study was carried out on a bus line in the city of Brasília/DF, Brazil. In the case study, S-BIA generated data for several scenarios that differ in distinct levels of variability and these data were used to evaluate the performance of machine learning predictors in each of the scenarios. Moreover, we provided experimental evaluation of several machine learning algorithms and we compared their performance to our proposed method based in Linear Combination of predictors. The mean absolute error was adopted in these experiments to evaluate the quality of the predictor’s results, and our proposed linear combiner approach was able to improve the performance of the prediction in both real and artificial data.

Downloads

Download data is not yet available.

References

Adacher, L. and Tiriolo, M. A macroscopic model with the advantages of microscopic model: A review of cell transmission model’s extensions for urban traffic networks. Simulation Modelling Practice and Theory, 2018.

Aha, D. W., Kibler, D., and Albert, M. K. Instance-based learning algorithms. Machine learning 6 (1): 37–66, 1991.

Barceló, J. et al. Fundamentals of traffic simulation. Vol. 145. Springer, 2010.

Bin, Y., Zhongzhen, Y., and Baozhen, Y. Bus arrival time prediction using support vector machines. Journal of Intelligent Transportation Systems 10 (4): 151–158, 2006.

Cortes, C. and Vapnik, V. Support-vector networks. Machine learning 20 (3): 273–297, 1995.

de Paulo Faleiros, T., Rossi, R. G., and de Andrade Lopes, A. Optimizing the class information divergence for transductive classification of texts using propagation in bipartite graphs. Pattern Recognition Letters vol. 87, pp. 127–138, 2017.

Demšar, J. Statistical comparisons of classifiers over multiple data sets. Journal of Machine learning research 7 (Jan): 1–30, 2006.

dos Santos Ribeiro, L., de Paulo Faleiros, T., and Terto de Holanda, M. Generating artificial data for bus travel time predictions. Proceedings XIX GEOINFO, 2018.

Friedman, J. Greedy function approximation: A gradient boosting machine [link], 1999.

García, S., Fernández, A., Luengo, J., and Herrera, F. Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power. Information Sciences 180 (10): 2044–2064, 2010.

Hamamoto, Y., Uchimura, S., and Tomita, S. A bootstrap technique for nearest neighbor classifier design. vol. 19, pp. 73 – 79, 02, 1997.

Hassanat, A. B., Abbadi, M. A., Altarawneh, G. A., and Alhasanat, A. A. Solving the problem of the k parameter in the knn classifier using an ensemble learning approach. CoRR vol. abs/1409.0919, 2014.

Helbing, D., Hennecke, A., Shvetsov, V., and Treiber, M. Micro-and macro-simulation of freeway traffic. Mathematical and computer modelling 35 (5-6): 517–547, 2002.

Ho, T. K. Random decision forests. In Document analysis and recognition, 1995., proceedings of the third international conference on. Vol. 1. IEEE, pp. 278–282, 1995.

Kormáksson, M., Barbosa, L., Vieira, M. R., and Zadrozny, B. Bus travel time predictions using additive models. In Data Mining (ICDM), 2014 IEEE International Conference on. IEEE, pp. 875–880, 2014.

Kuncheva, L. I. and Whitaker, C. J. Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Machine learning 51 (2): 181–207, 2003.

Lee, W.-H., Tseng, S.-S., and Tsai, S.-H. A knowledge based real-time travel time prediction system for urban network. Expert Systems with Applications 36 (3): 4239–4247, 2009.

Lima, A. M. and Campos, J. How reliable is the traffic information gathered from web map services? In GEOINFO. pp. 234–245, 2017.

Mendes-Moreira, J., Jorge, A. M., de Sousa, J. F., and Soares, C. Improving the accuracy of long-term travel time prediction using heterogeneous ensembles. Neurocomputing vol. 150, pp. 428–439, 2015.

Monteiro, C. M., Martins, F. V. C., and Junior, C. A. D. Optimization of new pick-up and drop-off points for public transportation, 2017.

Opitz, D. and Maclin, R. Popular ensemble methods: An empirical study. Journal of artificial intelligence research vol. 11, pp. 169–198, 1999.

Polikar, R. Ensemble based systems in decision making. IEEE Circuits and systems magazine 6 (3): 21–45, 2006.

Reddy, K. K., Kumar, B. A., and Vanajakshi, L. Bus travel time prediction under high variability conditions. Current Science (00113891) 111 (4), 2016.

Rokach, L. Ensemble-based classifiers. Artificial Intelligence Review 33 (1-2): 1–39, 2010.

Shalev-Shwartz, S. and Ben-David, S. Understanding machine learning: From theory to algorithms. Cambridge university press, 2014.

Sollich, P. and Krogh, A. Learning with ensembles: How overfitting can be useful. In Advances in neural information processing systems. pp. 190–196, 1996.

Sommer, C., German, R., and Dressler, F. Bidirectionally coupled network and road traffic simulation for improved ivc analysis. IEEE Transactions on Mobile Computing 10 (1): 3–15, 2011.

Trawiński, B., Smętek, M., Telec, Z., and Lasota, T. Nonparametric statistical analysis for multiple comparison of machine learning regression algorithms. International Journal of Applied Mathematics and Computer Science 22 (4): 867–881, 2012.

Vanajakshi, L. and Rilett, L. R. A comparison of the performance of artificial neural networks and support vector machines for the prediction of traffic speed. In Intelligent Vehicles Symposium, 2004 IEEE. IEEE, pp. 194–199, 2004.

Vanajakshi, L. and Rilett, L. R. Support vector machine technique for the short term prediction of travel time. In Intelligent Vehicles Symposium, 2007 IEEE. IEEE, pp. 600–605, 2007.

Wen, X. A work zone simulation model for travel time prediction in a connected vehicle environment. arXiv preprint arXiv:1801.07579 , 2018.

Wu, C.-H., Su, D.-C., Chang, J., Wei, C.-C., Ho, J.-M., Lin, K.-J., Lee, D., et al. An advanced traveler information system with emerging network technologies. In Proc. 6th Asia-Pacific Conf. Intelligent Transportation Systems Forum. pp. 230–231, 2003.

Yu, B., Yang, Z. Z., and Wang, J. Bus travel-time prediction based on bus speed. In Proceedings of the Institution of Civil Engineers-Transport. Vol. 163. Thomas Telford Ltd, pp. 3–7, 2010.

Downloads

Published

2019-06-20

How to Cite

Ribeiro, L. S., Faleiros, T. P., & Holanda, M. (2019). Generating Artificial Data to Evaluate Machine Learning Predictive Algorithms for Bus Travel Time. Journal of Information and Data Management, 10(1), 49–64. https://doi.org/10.5753/jidm.2019.1630

Issue

Section

GEOINFO 2018