Research Article

Visual research on the trustability of classical variable selection methods in Cox regression

Volume: 49 Number: 2 April 2, 2020
EN

Visual research on the trustability of classical variable selection methods in Cox regression

Abstract

Multivariate models such as the Cox regression model, if developed carefully, are powerful tools for making prognostic prediction which are frequently used in studies of clinical outcomes. Many applications require a large number of variables to be modelled by using a relatively small patient sample. Determination of the important variables in a model is critical to understand the behaviour of phenomena as the independent variables contribute the most to the outcome. From a practical perspective, a small subset of independent variables are usually selected from a large data set without the loss of any predictive efficiency. Automatic variable selection algorithms in scientific studies are commonly used for obtaining interpretable and practically applicable models. However, the careless use of these methods may lead to statistical problems. The performance of the generated models may be poor due to the violation of assumption, omission of the important variables, problems of overfitting, and the problem of multicollinearity and outliers. In order to enhance the accuracy of a model, it is essential to explore the data and its main characteristics before making any statistical inference. This study suggests an approach for acquiring a trustworthy model selection procedure for survival data by performing classical variables selection methods, accompanied by a graphical visualization method, namely robust coplot. Thus, it enables us to investigate the discrimination of observations, clusters of the variables and clusters of the observations that are highly characterized by a particular variable in a one graph. We present an application of combined method, as an integral part of statistical modelling, on survival data on multiple myeloma to show how coplot results are used in automatic variable selection algorithm in Cox regression model-building.

Keywords

References

  1. [1] H. Akaike, A new look at the statistical model identification, IEEE Transactions on Automatic Control AC 19, 716-723, 1974.
  2. [2] N. Ata and M.T. Sozer, Cox regression models with nonproportional hazards applied to lung cancer survival data, Hacet. J. Math. Stat. 36 (2), 157-167, 2007.
  3. [3] Y.K. Atilgan, Robust coplot analysis, Comm. Statist. Simulation Comput. 45 (5), 1763-1775, 2016.
  4. [4] Y.K. Atilgan and E.L. Atilgan, RobCoP: A Matlab Package for Robust CoPlot Analysis, Open Journal of Statistics 7, 23-35, 2017.
  5. [5] T. Bednarski, On sensitivity of Coxs estimator, Statistics and Decisions 7, 215-228, 1989.
  6. [6] D.M. Bravata, K.G. Shojania, I. Oklin and A. Raveh A tool for visualizing multivariate data in medicine, Stat. Med. 27 (12), 2234-2247, 2007.
  7. [7] D. Collett,Modeling Survival Data in Medical Research, 2nd Ed. New York: Chapman @ Hall/ CRS A CRC Press Company, 2003.
  8. [8] D.R.Cox, Regression Models and Life Tables, J. R. Stat. Soc. Ser. B. Stat. Methodol. 34 (2), 187-220, 1972.

Details

Primary Language

English

Subjects

Statistics

Journal Section

Research Article

Publication Date

April 2, 2020

Submission Date

October 7, 2019

Acceptance Date

March 11, 2020

Published in Issue

Year 2020 Volume: 49 Number: 2

APA
Ata Tutkun, N., & Kayhan Atılgan, Y. (2020). Visual research on the trustability of classical variable selection methods in Cox regression. Hacettepe Journal of Mathematics and Statistics, 49(2), 869-886. https://doi.org/10.15672/hujms.630402
AMA
1.Ata Tutkun N, Kayhan Atılgan Y. Visual research on the trustability of classical variable selection methods in Cox regression. Hacettepe Journal of Mathematics and Statistics. 2020;49(2):869-886. doi:10.15672/hujms.630402
Chicago
Ata Tutkun, Nihal, and Yasemin Kayhan Atılgan. 2020. “Visual Research on the Trustability of Classical Variable Selection Methods in Cox Regression”. Hacettepe Journal of Mathematics and Statistics 49 (2): 869-86. https://doi.org/10.15672/hujms.630402.
EndNote
Ata Tutkun N, Kayhan Atılgan Y (April 1, 2020) Visual research on the trustability of classical variable selection methods in Cox regression. Hacettepe Journal of Mathematics and Statistics 49 2 869–886.
IEEE
[1]N. Ata Tutkun and Y. Kayhan Atılgan, “Visual research on the trustability of classical variable selection methods in Cox regression”, Hacettepe Journal of Mathematics and Statistics, vol. 49, no. 2, pp. 869–886, Apr. 2020, doi: 10.15672/hujms.630402.
ISNAD
Ata Tutkun, Nihal - Kayhan Atılgan, Yasemin. “Visual Research on the Trustability of Classical Variable Selection Methods in Cox Regression”. Hacettepe Journal of Mathematics and Statistics 49/2 (April 1, 2020): 869-886. https://doi.org/10.15672/hujms.630402.
JAMA
1.Ata Tutkun N, Kayhan Atılgan Y. Visual research on the trustability of classical variable selection methods in Cox regression. Hacettepe Journal of Mathematics and Statistics. 2020;49:869–886.
MLA
Ata Tutkun, Nihal, and Yasemin Kayhan Atılgan. “Visual Research on the Trustability of Classical Variable Selection Methods in Cox Regression”. Hacettepe Journal of Mathematics and Statistics, vol. 49, no. 2, Apr. 2020, pp. 869-86, doi:10.15672/hujms.630402.
Vancouver
1.Nihal Ata Tutkun, Yasemin Kayhan Atılgan. Visual research on the trustability of classical variable selection methods in Cox regression. Hacettepe Journal of Mathematics and Statistics. 2020 Apr. 1;49(2):869-86. doi:10.15672/hujms.630402

Cited By