×

Hybrid machine learning and geographic information systems approach – a case for grade crossing crash data analysis. (English) Zbl 1462.62750

Summary: Highway-rail grade crossing (HRGC) accidents continue to be a major source of transportation casualties in the United States. This can be attributed to increased road and rail operations and/or lack of adequate safety programs based on comprehensive HRGC accidents analysis amidst other reasons. The focus of this study is to predict HRGC accidents in a given rail network based on a machine learning analysis of a similar network with cognate attributes. This study is an improvement on past studies that either attempt to predict accidents in a given HRGC or spatially analyze HRGC accidents for a particular rail line. In this study, a case for a hybrid machine learning and geographic information systems (GIS) approach is presented in a large rail network. The study involves collection and wrangling of relevant data from various sources; exploratory analysis, and supervised machine learning (classification and regression) of HRGC data from 2008 to 2017 in California. The models developed from this analysis were used to make binary predictions [98.9% accuracy & 0.9838 Receiver Operating Characteristic (ROC) score] and quantitative estimations of HRGC casualties in a similar network over the next 10 years. While results are spatially presented in GIS, this novel hybrid application of machine learning and GIS in HRGC accidents’ analysis will help stakeholders to pro-actively engage with casualties through addressing major accident causes as identified in this study. This paper is concluded with a Systems-Action-Management (SAM) approach based on text analysis of HRGC accident risk reports from Federal Railroad Administration.

MSC:

62P30 Applications of statistics in engineering and industry; control charts
62R07 Statistical aspects of big data and data science
68T05 Learning and adaptive systems in artificial intelligence
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Ananth, C. V. and Kleinbaum, D. G. (1997). Regression models for ordinal responses: A review of methods and applications and Kleinbaum D G. regression models for ordinal responses: A review of methods and applications. Technical Report 6. Available at: https://faculty.washington.edu/heagerty/Courses/b571/homework/Ananth-Kleinbaum-1997.pdf.
[2] ArcMap, E. (2018). ArcMap — ArcGIS Desktop. Available at: http://desktop.arcgis.com/en/arcmap/.
[3] Attoh-Okine (2017). Big Data and Differential Privacy. Wiley Series in Operations Research and Management Science. · Zbl 1073.62070
[4] Austin, R. D. and Carson, J. L. (2002). An alternative accident prediction model for highway-rail interfaces. Technical Report. Available at: www.elsevier.com/locate/aap.
[5] Aven, T. (2015). Risk assessment and risk management: Review of recent advances on their foundation, Eur. J. Operat. Res.253: 1-13. Available at: http://dx.doi.org/ 10.1016/j.ejor.2015.12.023. · Zbl 1346.91102
[6] Caltrans (2019). Caltrans GIS Data Library. Available at: http://www.dot.ca.gov/hq/tsip/gis/datalibrary/.
[7] Erdogan, S., Yilmaz, I., Baybura, T. and Gullu, M. (2008). Geographical information systems aided traffic accident analysis system case study: City of Afyonkarahisar, Accid. Anal. Prev.40: 174-181. https://doi.org/10.1016/j.aap.2007.05.004
[8] Faghri, A. and Demetsky, M. J. (1980). A comparison of formulae for predicting rail-highway crossing Hazards. Transport. Res. Rec. J. Transp. Res. Board, 1114. Available at: http://onlinepubs.trb.org/Onlinepubs/trr/1987/1114/1114-016.pdf.
[9] FRA (2018a). 3.11 — Accident detail report. Technical Report. Available at: https://safetydata.fra.dot.gov/OfficeofSafety/publicsite/Query/incrpt.aspx.
[10] FRA (2018b). 5.14 — Hwy rail accident incident summary by railroad. Technical Report. Federal Railroad Administration. Available at: https://safetydata.fra.dot.gov/OfficeofSafety/publicsite/Query/HwyRailAccidentSummaryByRR.aspx.
[11] FRA (2018c). Annual web accident prediction system. Technical Report. Federal Railroad Administration Highway-Rail Crossing Safety & Trespass Prevention.
[12] FRA, S. (2019). 3.10 — Accident causes, train accidents by cause from FRA F 6180.54. Available at: https://safetydata.fra.dot.gov/OfficeofSafety/publicsite/Query/inccaus.aspx.
[13] Ghomi, H., Bagheri, M., Fu, L. and Miranda-Moreno, L. F. (2016). Traffic Injury Prevention Analyzing injury severity factors at highway railway grade crossing accidents involving vulnerable road users: A comparative study. Traffic Inj. Prev., 17: 833-441. Available at: https://www.tandfonline.com/action/journalInformation?journalCode=gcpi20. https://doi.org/10.1080/15389588.2016.1151011
[14] Gilles, L. (2014). Understanding random forests from theory to practice. Ph.D. Thesis, University of Liege. Available at: https://arxiv.org/pdf/1407.7502.pdf, arXiv:1407.7502v3.
[15] Hao, W., Kamga, C. and Wan, D. (2016). The effect of time of day on driver’s injury severity at highway-rail grade crossings in the United States. J. Traffic Transp. Eng.(Engl. Ed.), 3: 37-50. Available at: http://dx.doi.org/10.1016/j.jtte.2015.10.006. https://doi.org/10.1016/j.jtte.2015.10
[16] Hastie, T., Tibshirani, R. and Friedman, J. (2009). The elements of statistical learning. Vol. 1. Springer. Available at: http://www.springerlink.com/index/10.1007/b94608, arXiv:1010.3003. · Zbl 1273.62005
[17] James, G., Witten, D., Hastie, T. and Tibshirani, R. (2013). An Introduction to Statistical Learning, 1st edn. Springer Texts in Statistics. Available at: http://www-bcf.usc.edu/{ }gareth/ISL/ISLR First Printing.pdf. https://doi.org/10.1007/978-1-4614-7138-7, arXiv:1011.1669v3. · Zbl 1281.62147
[18] Khan, I. U. and Lee, E. (2018). Developing a highway rail grade crossing accident probability prediction model: A north dakota case study. MDPI Open AccessJ. Safety, 1-12. https://doi.org/10.3390/safety4020022
[19] Lasisi, A. and Attoh-Okine, N. (2018). Principal components analysis and track quality index: A machine learning approach. Transport. Res. Part C, Emerg. Technol., 91: 230-248. Available at: https://www.sciencedirect.com/science/article/pii/S0968090X18304303. https://doi.org/10.1016/J.TRC.2018.04.001
[20] Lauren, B. (2017). Machine Learning in ArcGIS. Available at: https://www.esri.com/arcgis-blog/products/arcgis-pro/analytics/machine-learning-in-arcgis/.
[21] Lord, D. and Mannering, F. (2010). The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives. Transport. Res. Part A. Available at: https://ac.els-cdn. com/S0965856410000376/1-s2.0-S0965856410000376-main.pdf?_tid=6bb8c0b9-b68c-4a2b-bd8d-e21f9d2cedb8&acdnat=1549919900_2d69 ff7617f08ffaa19bb2c32aac5993, doi:10.1016/j.tra.2010.02.001.
[22] Lu, P. and Tolliver, D. (2016). Accident prediction model for public highway-rail grade crossings. Accid. Anal. Prev.90: 73-81. Available at: http://dx.doi.org/10.1016/j.aap.2016.02.012. https://doi.org/10.1016/j.aap.2016.02.012
[23] Lu, P. and Zheng, Z. (2017). Accident Prediction for Highway-Rail Grade Crossings: A Model Comparison of Decision Tree and Neural Network.
[24] Marsland, S. (2015). Machine Learning: An Algorithmic Perspective.2nd edn. Chapman & Hall/CRC, CRC Press, Boca Raton.
[25] Mennecke, B. E. and Crossland, M. D. (1996). Geographic Information Systems: Applications and Research Opportunities for Information Systems Researchers. Proc. 29th Annual Hawaii Int. Conf. System Sciences, Waitea, HI, USA, .
[26] Miaou, S.-P., Song, J. J. and Song, J. J. (2005). Bayesian ranking of sites for engineering safety improvements: Decision parameter, treatability concept, statistical criterion, and spatial dependence. Accid. Anal. Prev., 37: 699-720. Available at: https://ac.els-cdn.com/S0001457505000497/1-s2.0-S0001457505000497-main.pdf?_tid=0f6b2960-e04a-434b-af0b-545b81ee2322&acdnat=1549661509_3e64e28a8fcf71266cb8718529b77bd6. https://doi.org/10.1016/j.aap.2005.03.012
[27] Ogden, B. D. (2007). Railroad Highway Grade Crossing Handbook.Transportation US, Department of Federal Highway Administration.
[28] Oh, J., Washington, S. P. and Nam, D. (2006). Accident prediction model for railway-highway interfaces. Accid. Anal. Prev., 38: 346-356. https://doi.org/10.1016/j.aap.2005.10.004
[29] Panchanathan, S. and Faghri, A. (1995). Artificial Intelligence and Geographical Information. Transport. Res. Rec. J. Transp. Res. Board, 1114. Available at: http://onlinepubs.trb.org/Onlinepubs/trr/1995/1497/1497-012.pdf.
[30] Pate-Cornell, M. E. (1990). Organizational Aspects of Engineering Systems Safety: The case of Offshore Platforms. J. Risk Anal.
[31] Peabody, L. and Dimmick, T. (1941). Accident Hazards at Grade Crossings. Public Roads, 22: 12-130.
[32] Perrot, É. D. F. P. G. V. A. (2011). Scikit-learn: Machine Learning in Python. Available at: http://www.jmlr.org/papers/volume12/pedregosa11a/pedregosa11a.pdf. · Zbl 1280.68189
[33] Raub, R. A. (2007). Examination of Highway-Rail Grade Crossing Collisions Nationally from 1998 to 2007. Transport. Res. Rec. J. Transport. Res. Board, 63-71. https://doi.org/10.3141/2122-08
[34] Rohit Singh (2018). How we did it: Integrating ArcGIS and deep learning at UC 2018. Available at: https://www.esri.com/arcgis-blog/products/api-python/ analytics/how-we-did-it-integrating-arcgis-and-machine-learning-at-uc-2018/.
[35] Sellers, K. F., Swift, A. W. and Weems, K. S. (2017). A flexible distribution class for count data. J. Stat. Distrib. Appl., 4: 22. Available at: https://jsdajournal.springeropen.com/track/pdf/10.1186/s40488-017-0077-0. https://doi.org/10.1186/s40488-017-0077-0 · Zbl 1386.60063
[36] Skinner, R. E., Barry, T. F. and Berry, B. J. (1997). Transportation Reseach Board Executive Committee 1999 Officers, National Cooperative Highway Research Program. Technical Report. Available at: http://onlinepubs.trb.org/onlinepubs/nchrp/nchrp_syn_271.pdf.
[37] Urbanowicz, R. J. and Moore, J. H. (2015). ExSTraCS 2.0: Description and evaluation of a scalable learning classifier system. Evol. Intell., 8: 89-116. Available at: http://link.springer.com/10.1007/s12065-015-0128-8. https://doi.org/10.1007/s12065-015-0128-8
[38] USDOT (2018a). Direct Download of National Transportation Atlas Database (NTAD) Geospatial Files — Bureau of Transportation Statistics. Available at: https://www.bts.gov/geography/geospatial-portal/NTAD-direct-download.
[39] USDOT (2018b). Safety and Health — US Department of Transportation. Available at: https://www.transportation.gov/policy/transportation-policy/safety.
[40] Wright, R. (2016). Integration of Grade Crossing Data into FRA’s GIS Program 2016 ESRI Rail Summit. Technical Report. Available at: https://www.esri.com/events/rail-summit/{ }/media/B9A885BB7CBA4DD0A7C607E5EA384D71.ashx.
[41] Yan, X., Richards, S. and Su, X. (2010). Using hierarchical tree-based regression model to predict train-vehicle crashes at passive highway-rail grade crossings. Accid. Anal. Prev., 42: 64-74. Available at: https://ac.els-cdn.com/S0001457509001687/1-s2.0-S0001457509001687-main.pdf?_tid=6e062379-14c7-492d-883a-bb9e8fc0161f&acdnat=1550079562_6da1f2263e88f713453e4fbf769f3bfb. https://doi.org/10.1016/j.aap.2009.07.003
[42] Zhang, Z., Liu, X. and Holt, K. (2018). Positive Train Control (PTC) for railway safety in the United States: Policy developments and critical issues. Util. Policy, 51: 33-40. Available at: https://doi.org/10.1016/j.jup.2018.03.002. https://doi.org/10.1016/j.jup.2018.03.002
[43] Zhao, S. and Khattak, A. (2015). Motor vehicle drivers’ injuries in train-motor vehicle crashes. Accid. Anal. Prev., 74: 162-168. Available at: http://dx.doi.org/10.1016/j.aap.2014.10.022. https://doi.org/10.1016/j.aap.2014.10.022
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.