Research Article

A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation

Volume: 54 Number: 4 August 29, 2025
EN

A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation

Abstract

Anomalies and ill-conditioned predictors present considerable obstacles to reliable parameter estimation in regression models. This paper presents an innovative approach that combines principal component regression with approximate Bayesian computation to address these issues. Principal component regression mitigates the effects of ill-conditioned variables by transforming highly correlated predictors into orthogonal components. Meanwhile, approximate Bayesian computation enhances robustness by approximating the posterior distribution of error variance ($\sigma^2$). This flexible framework models uncertainty and noise effectively. The integration of these methods improves both parameter estimation and anomaly detection. By assigning probabilistic scores to potential outliers, the method provides a more accurate and nuanced identification of anomalies. Extensive validation through simulated and real-world datasets demonstrates the favorable performance of the proposed technique over existing robust methods. These findings highlight the potential of approximate Bayesian computation as a powerful tool to improve the robustness and precision of regression analyzes in noisy and complex data environments.

Keywords

References

  1. [1] A. Alin, Multicollinearity, Wiley Interdisciplinary Reviews: Computational Statistics 2 (3), 370-374, 2010.
  2. [2] T.W. Anderson, An Introduction to Multivariate Statistical Analysis, Wiley, 2003.
  3. [3] M. Alshakhs, P.J. Goedecke, J.E. Bailey and C. Madlock-Brown, Racial differences in healthcare expenditures for prevalent multimorbidity combinations in the usa: A cross sectional study, BMC Medicine 21(1), 399, 2023.
  4. [4] K.C. Arum and F.I. Ugwuowo, Combining principal component and robust ridge estimators in linear regression model with multicollinearity and outlier, Concurr. Comput. Pract. Exp. 34(10), 2022.
  5. [5] K.C. Arum, F.I. Ugwuowo, H.E. Oranye, T.O. Alakija, T.E. Ugah and O. C. Asogwa, Combating outliers and multicollinearity in linear regression model using robust Kibria-Lukman mixed with principal component estimator, simulation and computation, Sci. Afr. 19, 2023.
  6. [6] F.A. Awwad, I. Dawoud and M.R. Abonazel, Development of robust ÖzkaleKaçiranlar and YangChang estimators for regression models in the presence of multicollinearity and outliers. Concurr. Comput. Pract. Exp. 34(6), 2022.
  7. [7] V. Barnett, and T. Lewis, Outliers in statistical data, John Wiley and Sons, New York, 1994.
  8. [8] M.J. Bayarri and J. Morales, Bayesian measures of surprise for outlier detection. J. Stat. Plan. Inference 111(1-2), 3-22, 2003.

Details

Primary Language

English

Subjects

Computational Statistics, Probability Theory

Journal Section

Research Article

Early Pub Date

August 4, 2025

Publication Date

August 29, 2025

Submission Date

January 7, 2025

Acceptance Date

July 30, 2025

Published in Issue

Year 2025 Volume: 54 Number: 4

APA
Tahir, A., & Ilyas, M. (2025). A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics, 54(4), 1688-1707. https://doi.org/10.15672/hujms.1614841
AMA
1.Tahir A, Ilyas M. A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics. 2025;54(4):1688-1707. doi:10.15672/hujms.1614841
Chicago
Tahir, Aiman, and Maryam Ilyas. 2025. “A Robust Probabilistic Framework for Principal Component Regression: Optimizing Parameter Identification and Outlier Detection via Approximate Bayesian Computation”. Hacettepe Journal of Mathematics and Statistics 54 (4): 1688-1707. https://doi.org/10.15672/hujms.1614841.
EndNote
Tahir A, Ilyas M (August 1, 2025) A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics 54 4 1688–1707.
IEEE
[1]A. Tahir and M. Ilyas, “A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation”, Hacettepe Journal of Mathematics and Statistics, vol. 54, no. 4, pp. 1688–1707, Aug. 2025, doi: 10.15672/hujms.1614841.
ISNAD
Tahir, Aiman - Ilyas, Maryam. “A Robust Probabilistic Framework for Principal Component Regression: Optimizing Parameter Identification and Outlier Detection via Approximate Bayesian Computation”. Hacettepe Journal of Mathematics and Statistics 54/4 (August 1, 2025): 1688-1707. https://doi.org/10.15672/hujms.1614841.
JAMA
1.Tahir A, Ilyas M. A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics. 2025;54:1688–1707.
MLA
Tahir, Aiman, and Maryam Ilyas. “A Robust Probabilistic Framework for Principal Component Regression: Optimizing Parameter Identification and Outlier Detection via Approximate Bayesian Computation”. Hacettepe Journal of Mathematics and Statistics, vol. 54, no. 4, Aug. 2025, pp. 1688-07, doi:10.15672/hujms.1614841.
Vancouver
1.Aiman Tahir, Maryam Ilyas. A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics. 2025 Aug. 1;54(4):1688-707. doi:10.15672/hujms.1614841