Predicting Operating Train Delays into New York City using Random Forest Regression and XGBoost Regression Models

Authors

  • Thomas Wiese

Keywords:

Random Forest Regression model, XGBoost Regression Model, Machine Learning, Operations Man-agement, Management, Business Analytics, Analytics, Industrial Internet, Industrial Internet of Things, Trains, Train Delays, Decision Tree

Abstract

The Long Island Railroad operates one of the largest commuter rail networks in the U.S.[1]. This study uses data which includes the location and arrival time of trains based on onboard GPS position and other internal sources. This paper analyzes the GPS position of the train to gain insight into potential gaps in on time performance and train operations. This was done by developing a Random Forest Re-gression model [2] and an XGBoost regression model [3[. Both models prove to be useful to make such predictions and should be used to help railroads to prepare and adjust their operations.

Downloads

Published

2023-03-06

How to Cite

Wiese, T. (2023). Predicting Operating Train Delays into New York City using Random Forest Regression and XGBoost Regression Models. International Journal of Engineering, Business and Management, 7(1). https://journal-repository.com/index.php/ijebm/article/view/6069