EN
Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities
Abstract
Air pollution, more specifically Particulate Matter (PM2.5 - particulate matter with diameter less than 2.5 micrometers), threatens the public health most critically in urban Indian cities, and Delhi, among them, presents the most acute challenge. This study predicts the concentrations of PM2.5 using machine learning models using data ranging from 2010 to 2023 and assessing model fit via R², RMSE, MAE, and MAPE metrics. Models tested: Random Forest, Gradient Boosting, AdaBoost, Histogram-Based Gradient Boosting, XGBoost. The Random Forest model is extremely effective for the training set (R² = 0.99) but shows the highest degree of overfitting, with R² of 0.35 for the test set. Gradient Boosting has a more balanced result, with R² 0.54 and 0.48, respectively on the training and test set, as well as fewer errors (RMSE: 56.46, MAE: 39.60, MAPE: 0.50). Hence, it is a good predictor. AdaBoost performs the worst with an R² of 0.28 on the test set and the highest errors in terms of RMSE: 66.86, MAE: 52.34, MAPE: 0.94. Histogram Gradient Boosting and XGBoost: both of these models yield an average accuracy value, but the Gradient Boosting model is still a tad better than the former ones in terms of RMSE and MAE. Thus, Gradient Boosting happens to be the most accurate model in light of generalization as well as accuracy for the prediction of the concentration of PM2.5. These results will be highly beneficial to policymakers to adopt machine learning-based air quality forecasting for better environmental management and the protection of public health.
Keywords
References
- H. Liu, Q. Han, H. Sun, J. Sheng, and Z. Yang, “Spatiotemporal adaptive attention graph convolution network for city-level air quality prediction,” Scientific Reports, vol. 13(1), pp. 13335, 2023, doi: 10.1038/s41598-023-39286-0.
- J. Duan, Y. Gong, J. Luo, and Z. Zhao, “Air-quality prediction based on the ARIMA-CNN-LSTM combination model optimized by dung beetle optimizer,” Scientific Reports, vol. 13(1), pp. 12127, 2023, doi: 10.1038/s41598-023-36620-4.
- D. M and R. V, “Novel Regression and Least Square Support Vector Machine Learning Technique for Air Pollution Forecasting,” International Journal of Engineering Trends and Technology, vol. 71(4), pp. 147–158, 2023, doi: 10.14445/22315381/IJETT-V71I4P214.
- X. Zhang, X. Jiang, and Y. Li, “Prediction of air quality index based on the SSA-BiLSTM-LightGBM model,” Scientific Reports, vol. 13(1), pp. 5550, 2023, doi: 10.1038/s41598-023-32775-2.
- M. Bonas and S. Castruccio, “Calibration of SpatioTemporal forecasts from citizen science urban air pollution data with sparse recurrent neural networks,” The Annals of Applied Statistics, vol. 17(3), 2023, doi: 10.1214/22-AOAS1683.
- R. López-Blanco, M. Chaveinte García, R. S. Alonso, J. Prieto, and J. M. Corchado, “Pollutant Time Series Analysis for Improving Air-Quality in Smart Cities,” International Journal of Interactive Multimedia and Artificial Intelligence, vol. 8(3), pp. 98, 2023, doi: 10.9781/ijimai.2023.08.005.
- R. Guo, Y. Qi, B. Zhao, Z. Pei, F. Wen, S. Wu, and Q. Zhang, “High-Resolution Urban Air Quality Mapping for Multiple Pollutants Based on Dense Monitoring Data and Machine Learning,” International Journal of Environmental Research and Public Health, vol. 19(13), pp. 8005, 2022, doi: 10.3390/ijerph19138005.
- M. Méndez, M. G. Merayo, and M. Núñez, “Machine learning algorithms to forecast air quality: a survey,” Artificial Intelligence Review, vol. 56(9), pp. 10031–10066, 2023, doi: 10.1007/s10462-023-10424-4.
Details
Primary Language
English
Subjects
Air Pollution Processes and Air Quality Measurement
Journal Section
Research Article
Authors
Early Pub Date
November 18, 2025
Publication Date
December 31, 2025
Submission Date
November 18, 2024
Acceptance Date
December 12, 2024
Published in Issue
Year 2025 Volume: 8 Number: 4
APA
Singh, S. K., Jain, R., Palaniappan, D., Parmar, K., T, P., & Gothania, J. (2025). Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. Environmental Research and Technology, 8(4), 809-822. https://doi.org/10.35208/ert.1587308
AMA
1.Singh SK, Jain R, Palaniappan D, Parmar K, T P, Gothania J. Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. ERT. 2025;8(4):809-822. doi:10.35208/ert.1587308
Chicago
Singh, Sitesh Kumar, Rituraj Jain, Damodharan Palaniappan, Kumar Parmar, Premavathi T, and Jaishri Gothania. 2025. “Spatiotemporal Analysis and Machine Learning-Based Prediction of Air Quality in Indian Urban Cities”. Environmental Research and Technology 8 (4): 809-22. https://doi.org/10.35208/ert.1587308.
EndNote
Singh SK, Jain R, Palaniappan D, Parmar K, T P, Gothania J (December 1, 2025) Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. Environmental Research and Technology 8 4 809–822.
IEEE
[1]S. K. Singh, R. Jain, D. Palaniappan, K. Parmar, P. T, and J. Gothania, “Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities”, ERT, vol. 8, no. 4, pp. 809–822, Dec. 2025, doi: 10.35208/ert.1587308.
ISNAD
Singh, Sitesh Kumar - Jain, Rituraj - Palaniappan, Damodharan - Parmar, Kumar - T, Premavathi - Gothania, Jaishri. “Spatiotemporal Analysis and Machine Learning-Based Prediction of Air Quality in Indian Urban Cities”. Environmental Research and Technology 8/4 (December 1, 2025): 809-822. https://doi.org/10.35208/ert.1587308.
JAMA
1.Singh SK, Jain R, Palaniappan D, Parmar K, T P, Gothania J. Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. ERT. 2025;8:809–822.
MLA
Singh, Sitesh Kumar, et al. “Spatiotemporal Analysis and Machine Learning-Based Prediction of Air Quality in Indian Urban Cities”. Environmental Research and Technology, vol. 8, no. 4, Dec. 2025, pp. 809-22, doi:10.35208/ert.1587308.
Vancouver
1.Sitesh Kumar Singh, Rituraj Jain, Damodharan Palaniappan, Kumar Parmar, Premavathi T, Jaishri Gothania. Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. ERT. 2025 Dec. 1;8(4):809-22. doi:10.35208/ert.1587308