Research Article

Residual Modelling as a New Approach for Variable Selection

Number: 10 December 31, 2024
EN TR

Residual Modelling as a New Approach for Variable Selection

Abstract

Variable selection in statistical model building still has challenges to overcome as the depth and breadth of the research data is expanding. To help reduce this challenge, we introduce a new approach in variable selection, called residual modeling, which can be applicable regardless of the number of predictors. We compare the statistical power and type-1 error retainment of the forward, backward, and stepwise variable selection approaches with the proposed modeling strategy controlling for known predictors. In Residual Modeling, each predictor enters the model as a single predictor, whose resulting residuals become the dependent variable for the next predictor, and so on. We compare these models under different scenarios with varying sample sizes and various combinations of significant and insignificant predictors. When there exist known predictors from the literature, in identifying new significant predictors controlling for these known predictors, Residual Modelling shows higher statistical power especially as the number of predictors increases compared to the other variable selection methods used. It also has reduced bias in parameter estimation and reduced standard errors. The Type-1 error was retained at its nominal level for Residual Modelling while forward, backward, and stepwise variable selection approaches had slightly reduced Type-1 Error rates. When dealing with multiple predictors in the presence of known significant predictors, Residual Modelling offers a practical solution without causing loss of statistical power or increased Type-1 Error Rate.

Keywords

Supporting Institution

TUBİTAK-BİDEB-2232 International Fellowship for Outstanding Researchers (Award No: 118C306)

Ethical Statement

Our research protocol was approved by Istanbul Medipol University Ethics Committee (Application number: 10840098-604.01.01-E.53819)

Thanks

This study was partially funded by TUBITAK Directorate of Science Fellowships and Grant Programmes (BİDEB)-2232 International Fellowship for Outstanding Researchers. We also thank the Turkish Republic Ministry of Commerce and Turkish Statistical Institute for data sharing. The opinions raised in this article solely belong to its authors, and does not represent the position of TUBITAK, Turkish Republic Ministry of Commerce and Turkish Statistical Institute in any shape or form.

References

  1. [1] W. Sauerbrei, A. Perperoglou, M. Schmid, M. Abrahamowicz, H. Becher, H. Binder, D. Dunkler, F. E. Harrell, P. Royston, and G. Heinze, "State of the art in selection of variables and functional forms in multivariable analysis—outstanding issues," Diagnostic and Prognostic Research, vol. 4, pp. 1–8, Dec. 2020. doi: 10.1186/s41512-020-00074-3.
  2. [2] M. Z. Chowdhury and T. C. Turin, "Variable selection strategies and its importance in clinical prediction modelling," Family Medicine and Community Health, vol. 8, no. 1, 2020. doi: 10.1136/fmch-2019-000262.
  3. [3] G. Claeskens, "Statistical model choice," Annual Review of Statistics and Its Application, vol. 3, pp. 233–256, Jun. 2016. doi: 10.1146/annurev-statistics-041715-033659.
  4. [4] G. Heinze, C. Wallisch, and D. Dunkler, "Variable selection–a review and recommendations for the practicing statistician," Biometrical Journal, vol. 60, no. 3, pp. 431–449, May 2018. doi: 10.1002/bimj.201700067.
  5. [5] Y. Wang, Q. Chen, N. Zhang, and Y. Wang, "Conditional residual modeling for probabilistic load forecasting," IEEE Transactions on Power Systems, vol. 33, no. 6, pp. 7327–7330, Aug. 2018. doi: 10.1109/TPWRS.2018.2819668.
  6. [6] P. S. de Mattos Neto, G. D. Cavalcanti, D. S. O. Santos Júnior, and E. G. Silva, "Hybrid systems using residual modeling for sea surface temperature forecasting," Scientific Reports, vol. 12, no. 1, p. 487, Jan. 2022. doi: 10.1038/s41598-021-04342-8.
  7. [7] M. Goodwin, "Residual modeling in music analysis-synthesis," in 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, May 1996, vol. 2, pp. 1005–1008. IEEE. doi: 10.1109/ICASSP.1996.543310.
  8. [8] X. Weng, Y. Li, L. Chi, and Y. Mu, "High-capacity convolutional video steganography with temporal residual modeling," in Proceedings of the 2019 on International Conference on Multimedia Retrieval, Jun. 2019, pp. 87–95. doi: 10.1145/3323873.3325047.

Details

Primary Language

English

Subjects

Biostatistics

Journal Section

Research Article

Early Pub Date

December 24, 2024

Publication Date

December 31, 2024

Submission Date

July 30, 2024

Acceptance Date

November 24, 2024

Published in Issue

Year 2024 Number: 10

APA
Koçak, A. N., Daşdelen, M. F., & Koçak, M. (2024). Residual Modelling as a New Approach for Variable Selection. Journal of Statistics and Applied Sciences, 10, 86-95. https://doi.org/10.52693/jsas.1525029
AMA
1.Koçak AN, Daşdelen MF, Koçak M. Residual Modelling as a New Approach for Variable Selection. JSAS. 2024;(10):86-95. doi:10.52693/jsas.1525029
Chicago
Koçak, Aslı Nurefşan, Muhammet Furkan Daşdelen, and Mehmet Koçak. 2024. “Residual Modelling As a New Approach for Variable Selection”. Journal of Statistics and Applied Sciences, nos. 10: 86-95. https://doi.org/10.52693/jsas.1525029.
EndNote
Koçak AN, Daşdelen MF, Koçak M (December 1, 2024) Residual Modelling as a New Approach for Variable Selection. Journal of Statistics and Applied Sciences 10 86–95.
IEEE
[1]A. N. Koçak, M. F. Daşdelen, and M. Koçak, “Residual Modelling as a New Approach for Variable Selection”, JSAS, no. 10, pp. 86–95, Dec. 2024, doi: 10.52693/jsas.1525029.
ISNAD
Koçak, Aslı Nurefşan - Daşdelen, Muhammet Furkan - Koçak, Mehmet. “Residual Modelling As a New Approach for Variable Selection”. Journal of Statistics and Applied Sciences. 10 (December 1, 2024): 86-95. https://doi.org/10.52693/jsas.1525029.
JAMA
1.Koçak AN, Daşdelen MF, Koçak M. Residual Modelling as a New Approach for Variable Selection. JSAS. 2024;:86–95.
MLA
Koçak, Aslı Nurefşan, et al. “Residual Modelling As a New Approach for Variable Selection”. Journal of Statistics and Applied Sciences, no. 10, Dec. 2024, pp. 86-95, doi:10.52693/jsas.1525029.
Vancouver
1.Aslı Nurefşan Koçak, Muhammet Furkan Daşdelen, Mehmet Koçak. Residual Modelling as a New Approach for Variable Selection. JSAS. 2024 Dec. 1;(10):86-95. doi:10.52693/jsas.1525029