Research Article

High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries

Volume: 18 Number: 1 June 1, 2026
EN

High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries

Abstract

In high-dimensional datasets, it is a subject of investigation whether robust regression methods have high prediction and variable selection performance when vertical outliers and leverage points change from light-tailed errors to heavy-tailed errors. In this paper, we study the performance of six types of high-dimensional robust regression models (LAD-Lasso, Q-Lasso applied at the 0.25 and 0.75 quantile levels, Huber-Lasso, MTE-Lasso, RLARS, and Sparse-LTS) by creating both uncontaminated and contaminated control groups with different distributions of error terms. The methods are validated by comparing the estimation metrics at different scenarios through object-oriented simulations. Additionally, in this study, real data sets are examined and the performance of robust regression methods is evaluated to select the factors determining the CO2 emissions per capita of OECD countries under the EKC (Environmental Kuznets Curve) hypothesis. The results prove that, although simulations show that different methods perform well on different datasets with different contaminations, Spars LTS is superior to other methods. In addition, sparse LTS also performs better in terms of variable selection and prediction success in real OECD dataset with outliers and leverage points.

Keywords

References

  1. Adomaityte, U., L. Defilippis, B. Loureiro and G. Sicuro (2024). High-Dimensional Robust Regression Under Heavy-Tailed Data: Asymptotics and Universality. Journal of Statistical Mechanics: Theory and Experiment, 2024(11), 114002.
  2. Alfons, A. (2022). “Package‘perryExamples’: Examples for Integrating Prediction Error Estimation into Regression Models. https://cran.r-project.org/web/packages/perryExamples/refman/perryExamples.html
  3. Alfons, A. (2022). Package ‘simFrame’: Simulation Framework. https://CRAN.R-project.org/package=simFrame
  4. Alfons, A. and M. A. Alfons (2021). Package ‘robustHD’: An R Package for Robust Regression with High-Dimensional Data. Journal of Open-Source Software, 6, 1-61
  5. Alfons, A., C. Croux and S. Gelper (2013). Sparse Least Trimmed Squares Regression for Analyzing High-Dimensional Large Data Sets. The Annals of Applied Statistics, 7(1), 226-248.
  6. Amato, U., A. Antoniadis, I. De Feis, and I. Gijbels (2021). Penalised Robust Estimators for Sparse and High-Dimensional Linear Models. Statistical Methods & Applications, 30(1), 1-48.
  7. Belloni, A., and V. Chernozhukov (2011). L1 -Penalized Quantile Regression in High-Dimensional Sparse Models. The Annals of Statistics, 39, 82-130.
  8. Dalalyan, A. and P. Thompson (2019). Outlier-Robust Estimation of a Sparse Linear Model Using L1 Penalized Huber's M -Estimator. Advances in Neural Information Processing Systems, 32, 13208-13219.

Details

Primary Language

English

Subjects

Econometric and Statistical Methods

Journal Section

Research Article

Publication Date

June 1, 2026

Submission Date

October 25, 2025

Acceptance Date

June 1, 2026

Published in Issue

Year 2026 Volume: 18 Number: 1

APA
Topal, K. H., & Çağlayan Akay, E. (2026). High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries. International Econometric Review, 18(1), 43-63. https://izlik.org/JA38XR37GX
AMA
1.Topal KH, Çağlayan Akay E. High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries. IER. 2026;18(1):43-63. https://izlik.org/JA38XR37GX
Chicago
Topal, Kadriye Hilal, and Ebru Çağlayan Akay. 2026. “High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries”. International Econometric Review 18 (1): 43-63. https://izlik.org/JA38XR37GX.
EndNote
Topal KH, Çağlayan Akay E (June 1, 2026) High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries. International Econometric Review 18 1 43–63.
IEEE
[1]K. H. Topal and E. Çağlayan Akay, “High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries”, IER, vol. 18, no. 1, pp. 43–63, June 2026, [Online]. Available: https://izlik.org/JA38XR37GX
ISNAD
Topal, Kadriye Hilal - Çağlayan Akay, Ebru. “High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries”. International Econometric Review 18/1 (June 1, 2026): 43-63. https://izlik.org/JA38XR37GX.
JAMA
1.Topal KH, Çağlayan Akay E. High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries. IER. 2026;18:43–63.
MLA
Topal, Kadriye Hilal, and Ebru Çağlayan Akay. “High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries”. International Econometric Review, vol. 18, no. 1, June 2026, pp. 43-63, https://izlik.org/JA38XR37GX.
Vancouver
1.Kadriye Hilal Topal, Ebru Çağlayan Akay. High-Dimensional Robust Estimation Methods in Contaminated Datasets and a Review of the Environmental Kuznets Curve Hypothesis in OECD Countries. IER [Internet]. 2026 Jun. 1;18(1):43-6. Available from: https://izlik.org/JA38XR37GX