Research Article

Robust and sparse logistic regression for high-dimensional data: A DPD-ENP hybrid model

Volume: 54 Number: 6 December 30, 2025
EN

Robust and sparse logistic regression for high-dimensional data: A DPD-ENP hybrid model

Abstract

A popular statistical technique for modeling binary response variables is logistic regression. Nevertheless, the performance of standard logistic regression may be affected by the sensitivity of its maximum likelihood estimation to correlated predictor variables and outliers. In addition, traditional estimation techniques can occasionally become too complicated when working with high-dimensional datasets. In order to overcome these constraints, we provide a sparse and robust logistic regression model that makes use of the elastic net penalty as a regularizer that induces sparsity and density power divergence for robustness. Our method makes use of k-fold cross-validation to guaranty model stability and generalizability along with the majorization-minimization algorithm for effective parameter estimation. The efficacy of the suggested strategy for managing outliers in high dimensions is demonstrated through the execution of simulated datasets and a real-time example using the breast cancer data set.

Keywords

References

  1. [1] B. Akturk, U. Beyaztas, H.L. Shang and A. Mandal, Robust functional logistic regression. Adv. Data Anal. Classif. 18 (1), 125, 2024.
  2. [2] Z. Y. Algamal and M. H. Lee, High dimensional logistic regression model using adjusted elastic net penalty, Pak. J. Stat. Oper. Res. 11 (4), 667676, 2015.
  3. [3] A. Basu, I. R. Harris, N. L. Hjort and M. C. Jones, Robust and efficient estimation by minimising a density power divergence, Biometrika 85 (3), 549559, 1998.
  4. [4] A. M. Bianco and V. J. Yohai, Robust estimation in the logistic regression model, Robust Stat., Data Anal. Comput. Intensive Methods, Springer 109, 1734, 1996.
  5. [5] E. Cantoni and E. Ronchetti, Robust inference for generalized linear models, J. Am. Stat. Assoc. 96 (455), 10221030, 2001.
  6. [6] R. J. Carroll and S. Pederson, On robustness in the logistic regression model, J. R. Stat. Soc. Ser. B (Methodol.) 55 (3), 693706, 1993.
  7. [7] R. D. Cook and S. Weisberg, Residuals and Influence in Regression, Chapman and Hall, New York, 1982. Available at: https://conservancy.umn.edu/handle/11299/37076
  8. [8] D. Cornilly, L. Tubex, S. Van Aelst and T. Verdonck, Robust and sparse logistic regression, Adv. Data Anal. Classif. 18 (3), 663679, 2024.

Details

Primary Language

English

Subjects

Computational Statistics

Journal Section

Research Article

Early Pub Date

November 19, 2025

Publication Date

December 30, 2025

Submission Date

June 26, 2025

Acceptance Date

November 8, 2025

Published in Issue

Year 2025 Volume: 54 Number: 6

APA
Arya, K., & T, P. (2025). Robust and sparse logistic regression for high-dimensional data: A DPD-ENP hybrid model. Hacettepe Journal of Mathematics and Statistics, 54(6), 2463-2482. https://doi.org/10.15672/hujms.1727381
AMA
1.Arya K, T P. Robust and sparse logistic regression for high-dimensional data: A DPD-ENP hybrid model. Hacettepe Journal of Mathematics and Statistics. 2025;54(6):2463-2482. doi:10.15672/hujms.1727381
Chicago
Arya, K, and Palanisamy T. 2025. “Robust and Sparse Logistic Regression for High-Dimensional Data: A DPD-ENP Hybrid Model”. Hacettepe Journal of Mathematics and Statistics 54 (6): 2463-82. https://doi.org/10.15672/hujms.1727381.
EndNote
Arya K, T P (December 1, 2025) Robust and sparse logistic regression for high-dimensional data: A DPD-ENP hybrid model. Hacettepe Journal of Mathematics and Statistics 54 6 2463–2482.
IEEE
[1]K. Arya and P. T, “Robust and sparse logistic regression for high-dimensional data: A DPD-ENP hybrid model”, Hacettepe Journal of Mathematics and Statistics, vol. 54, no. 6, pp. 2463–2482, Dec. 2025, doi: 10.15672/hujms.1727381.
ISNAD
Arya, K - T, Palanisamy. “Robust and Sparse Logistic Regression for High-Dimensional Data: A DPD-ENP Hybrid Model”. Hacettepe Journal of Mathematics and Statistics 54/6 (December 1, 2025): 2463-2482. https://doi.org/10.15672/hujms.1727381.
JAMA
1.Arya K, T P. Robust and sparse logistic regression for high-dimensional data: A DPD-ENP hybrid model. Hacettepe Journal of Mathematics and Statistics. 2025;54:2463–2482.
MLA
Arya, K, and Palanisamy T. “Robust and Sparse Logistic Regression for High-Dimensional Data: A DPD-ENP Hybrid Model”. Hacettepe Journal of Mathematics and Statistics, vol. 54, no. 6, Dec. 2025, pp. 2463-82, doi:10.15672/hujms.1727381.
Vancouver
1.K Arya, Palanisamy T. Robust and sparse logistic regression for high-dimensional data: A DPD-ENP hybrid model. Hacettepe Journal of Mathematics and Statistics. 2025 Dec. 1;54(6):2463-82. doi:10.15672/hujms.1727381