Delays in flights and other airline operations have significant consequences in quality of service, operational costs, and customer satisfaction. Therefore, it is important to predict the occurrence of delays and take necessary actions accordingly. In this study, we addressed the flight delay prediction problem from a supervised machine learning perspective. Using a real-world airline operations dataset provided by a leading airline company, we identified optimum dataset features for optimum prediction accuracy. In addition, we trained and tested 11 machine learning models on the datasets that we created from the original dataset via feature selection and transformation. CART and KNN showed consistently good performance in almost all cases achieving 0.816 and 0.807 F-Scores respectively. Similarly, GBM, XGB, and LGBM showed very good performance in most of the cases, achieving F-Scores around 0.810.
Research and Development Center of TAV Airports Holding
Funding for this work was partially supported by the Research and Development Center of TAV Airports Holding accredited on Turkey - Ministry of Science.
Primary Language | English |
---|---|
Subjects | Artificial Intelligence |
Journal Section | Research Articles |
Authors | |
Publication Date | December 1, 2020 |
Submission Date | March 27, 2020 |
Acceptance Date | September 11, 2020 |
Published in Issue | Year 2020 |
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.