EN
A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation
Abstract
Anomalies and ill-conditioned predictors present considerable obstacles to reliable parameter estimation in regression models. This paper presents an innovative approach that combines principal component regression with approximate Bayesian computation to address these issues. Principal component regression mitigates the effects of ill-conditioned variables by transforming highly correlated predictors into orthogonal components. Meanwhile, approximate Bayesian computation enhances robustness by approximating the posterior distribution of error variance ($\sigma^2$). This flexible framework models uncertainty and noise effectively. The integration of these methods improves both parameter estimation and anomaly detection. By assigning probabilistic scores to potential outliers, the method provides a more accurate and nuanced identification of anomalies. Extensive validation through simulated and real-world datasets demonstrates the favorable performance of the proposed technique over existing robust methods. These findings highlight the potential of approximate Bayesian computation as a powerful tool to improve the robustness and precision of regression analyzes in noisy and complex data environments.
Keywords
References
- [1] A. Alin, Multicollinearity, Wiley Interdisciplinary Reviews: Computational Statistics 2 (3), 370-374, 2010.
- [2] T.W. Anderson, An Introduction to Multivariate Statistical Analysis, Wiley, 2003.
- [3] M. Alshakhs, P.J. Goedecke, J.E. Bailey and C. Madlock-Brown, Racial differences in healthcare expenditures for prevalent multimorbidity combinations in the usa: A cross sectional study, BMC Medicine 21(1), 399, 2023.
- [4] K.C. Arum and F.I. Ugwuowo, Combining principal component and robust ridge estimators in linear regression model with multicollinearity and outlier, Concurr. Comput. Pract. Exp. 34(10), 2022.
- [5] K.C. Arum, F.I. Ugwuowo, H.E. Oranye, T.O. Alakija, T.E. Ugah and O. C. Asogwa, Combating outliers and multicollinearity in linear regression model using robust Kibria-Lukman mixed with principal component estimator, simulation and computation, Sci. Afr. 19, 2023.
- [6] F.A. Awwad, I. Dawoud and M.R. Abonazel, Development of robust ÖzkaleKaçiranlar and YangChang estimators for regression models in the presence of multicollinearity and outliers. Concurr. Comput. Pract. Exp. 34(6), 2022.
- [7] V. Barnett, and T. Lewis, Outliers in statistical data, John Wiley and Sons, New York, 1994.
- [8] M.J. Bayarri and J. Morales, Bayesian measures of surprise for outlier detection. J. Stat. Plan. Inference 111(1-2), 3-22, 2003.
Details
Primary Language
English
Subjects
Computational Statistics, Probability Theory
Journal Section
Research Article
Early Pub Date
August 4, 2025
Publication Date
August 29, 2025
Submission Date
January 7, 2025
Acceptance Date
July 30, 2025
Published in Issue
Year 2025 Volume: 54 Number: 4
APA
Tahir, A., & Ilyas, M. (2025). A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics, 54(4), 1688-1707. https://doi.org/10.15672/hujms.1614841
AMA
1.Tahir A, Ilyas M. A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics. 2025;54(4):1688-1707. doi:10.15672/hujms.1614841
Chicago
Tahir, Aiman, and Maryam Ilyas. 2025. “A Robust Probabilistic Framework for Principal Component Regression: Optimizing Parameter Identification and Outlier Detection via Approximate Bayesian Computation”. Hacettepe Journal of Mathematics and Statistics 54 (4): 1688-1707. https://doi.org/10.15672/hujms.1614841.
EndNote
Tahir A, Ilyas M (August 1, 2025) A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics 54 4 1688–1707.
IEEE
[1]A. Tahir and M. Ilyas, “A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation”, Hacettepe Journal of Mathematics and Statistics, vol. 54, no. 4, pp. 1688–1707, Aug. 2025, doi: 10.15672/hujms.1614841.
ISNAD
Tahir, Aiman - Ilyas, Maryam. “A Robust Probabilistic Framework for Principal Component Regression: Optimizing Parameter Identification and Outlier Detection via Approximate Bayesian Computation”. Hacettepe Journal of Mathematics and Statistics 54/4 (August 1, 2025): 1688-1707. https://doi.org/10.15672/hujms.1614841.
JAMA
1.Tahir A, Ilyas M. A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics. 2025;54:1688–1707.
MLA
Tahir, Aiman, and Maryam Ilyas. “A Robust Probabilistic Framework for Principal Component Regression: Optimizing Parameter Identification and Outlier Detection via Approximate Bayesian Computation”. Hacettepe Journal of Mathematics and Statistics, vol. 54, no. 4, Aug. 2025, pp. 1688-07, doi:10.15672/hujms.1614841.
Vancouver
1.Aiman Tahir, Maryam Ilyas. A robust probabilistic framework for principal component regression: optimizing parameter identification and outlier detection via approximate Bayesian computation. Hacettepe Journal of Mathematics and Statistics. 2025 Aug. 1;54(4):1688-707. doi:10.15672/hujms.1614841