Research Article

Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities

Volume: 8 Number: 4 December 31, 2025
EN

Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities

Abstract

Air pollution, more specifically Particulate Matter (PM2.5 - particulate matter with diameter less than 2.5 micrometers), threatens the public health most critically in urban Indian cities, and Delhi, among them, presents the most acute challenge. This study predicts the concentrations of PM2.5 using machine learning models using data ranging from 2010 to 2023 and assessing model fit via R², RMSE, MAE, and MAPE metrics. Models tested: Random Forest, Gradient Boosting, AdaBoost, Histogram-Based Gradient Boosting, XGBoost. The Random Forest model is extremely effective for the training set (R² = 0.99) but shows the highest degree of overfitting, with R² of 0.35 for the test set. Gradient Boosting has a more balanced result, with R² 0.54 and 0.48, respectively on the training and test set, as well as fewer errors (RMSE: 56.46, MAE: 39.60, MAPE: 0.50). Hence, it is a good predictor. AdaBoost performs the worst with an R² of 0.28 on the test set and the highest errors in terms of RMSE: 66.86, MAE: 52.34, MAPE: 0.94. Histogram Gradient Boosting and XGBoost: both of these models yield an average accuracy value, but the Gradient Boosting model is still a tad better than the former ones in terms of RMSE and MAE. Thus, Gradient Boosting happens to be the most accurate model in light of generalization as well as accuracy for the prediction of the concentration of PM2.5. These results will be highly beneficial to policymakers to adopt machine learning-based air quality forecasting for better environmental management and the protection of public health.

Keywords

References

  1. H. Liu, Q. Han, H. Sun, J. Sheng, and Z. Yang, “Spatiotemporal adaptive attention graph convolution network for city-level air quality prediction,” Scientific Reports, vol. 13(1), pp. 13335, 2023, doi: 10.1038/s41598-023-39286-0.
  2. J. Duan, Y. Gong, J. Luo, and Z. Zhao, “Air-quality prediction based on the ARIMA-CNN-LSTM combination model optimized by dung beetle optimizer,” Scientific Reports, vol. 13(1), pp. 12127, 2023, doi: 10.1038/s41598-023-36620-4.
  3. D. M and R. V, “Novel Regression and Least Square Support Vector Machine Learning Technique for Air Pollution Forecasting,” International Journal of Engineering Trends and Technology, vol. 71(4), pp. 147–158, 2023, doi: 10.14445/22315381/IJETT-V71I4P214.
  4. X. Zhang, X. Jiang, and Y. Li, “Prediction of air quality index based on the SSA-BiLSTM-LightGBM model,” Scientific Reports, vol. 13(1), pp. 5550, 2023, doi: 10.1038/s41598-023-32775-2.
  5. M. Bonas and S. Castruccio, “Calibration of SpatioTemporal forecasts from citizen science urban air pollution data with sparse recurrent neural networks,” The Annals of Applied Statistics, vol. 17(3), 2023, doi: 10.1214/22-AOAS1683.
  6. R. López-Blanco, M. Chaveinte García, R. S. Alonso, J. Prieto, and J. M. Corchado, “Pollutant Time Series Analysis for Improving Air-Quality in Smart Cities,” International Journal of Interactive Multimedia and Artificial Intelligence, vol. 8(3), pp. 98, 2023, doi: 10.9781/ijimai.2023.08.005.
  7. R. Guo, Y. Qi, B. Zhao, Z. Pei, F. Wen, S. Wu, and Q. Zhang, “High-Resolution Urban Air Quality Mapping for Multiple Pollutants Based on Dense Monitoring Data and Machine Learning,” International Journal of Environmental Research and Public Health, vol. 19(13), pp. 8005, 2022, doi: 10.3390/ijerph19138005.
  8. M. Méndez, M. G. Merayo, and M. Núñez, “Machine learning algorithms to forecast air quality: a survey,” Artificial Intelligence Review, vol. 56(9), pp. 10031–10066, 2023, doi: 10.1007/s10462-023-10424-4.

Details

Primary Language

English

Subjects

Air Pollution Processes and Air Quality Measurement

Journal Section

Research Article

Early Pub Date

November 18, 2025

Publication Date

December 31, 2025

Submission Date

November 18, 2024

Acceptance Date

December 12, 2024

Published in Issue

Year 2025 Volume: 8 Number: 4

APA
Singh, S. K., Jain, R., Palaniappan, D., Parmar, K., T, P., & Gothania, J. (2025). Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. Environmental Research and Technology, 8(4), 809-822. https://doi.org/10.35208/ert.1587308
AMA
1.Singh SK, Jain R, Palaniappan D, Parmar K, T P, Gothania J. Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. ERT. 2025;8(4):809-822. doi:10.35208/ert.1587308
Chicago
Singh, Sitesh Kumar, Rituraj Jain, Damodharan Palaniappan, Kumar Parmar, Premavathi T, and Jaishri Gothania. 2025. “Spatiotemporal Analysis and Machine Learning-Based Prediction of Air Quality in Indian Urban Cities”. Environmental Research and Technology 8 (4): 809-22. https://doi.org/10.35208/ert.1587308.
EndNote
Singh SK, Jain R, Palaniappan D, Parmar K, T P, Gothania J (December 1, 2025) Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. Environmental Research and Technology 8 4 809–822.
IEEE
[1]S. K. Singh, R. Jain, D. Palaniappan, K. Parmar, P. T, and J. Gothania, “Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities”, ERT, vol. 8, no. 4, pp. 809–822, Dec. 2025, doi: 10.35208/ert.1587308.
ISNAD
Singh, Sitesh Kumar - Jain, Rituraj - Palaniappan, Damodharan - Parmar, Kumar - T, Premavathi - Gothania, Jaishri. “Spatiotemporal Analysis and Machine Learning-Based Prediction of Air Quality in Indian Urban Cities”. Environmental Research and Technology 8/4 (December 1, 2025): 809-822. https://doi.org/10.35208/ert.1587308.
JAMA
1.Singh SK, Jain R, Palaniappan D, Parmar K, T P, Gothania J. Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. ERT. 2025;8:809–822.
MLA
Singh, Sitesh Kumar, et al. “Spatiotemporal Analysis and Machine Learning-Based Prediction of Air Quality in Indian Urban Cities”. Environmental Research and Technology, vol. 8, no. 4, Dec. 2025, pp. 809-22, doi:10.35208/ert.1587308.
Vancouver
1.Sitesh Kumar Singh, Rituraj Jain, Damodharan Palaniappan, Kumar Parmar, Premavathi T, Jaishri Gothania. Spatiotemporal analysis and machine learning-based prediction of air quality in Indian urban cities. ERT. 2025 Dec. 1;8(4):809-22. doi:10.35208/ert.1587308