Research Article

Comparison of Different Estimation Approaches in Rare Events Data

Volume: 21 Number: 3 June 30, 2021
  • Ece Bacaksız
  • Selçuk Koç
EN

Comparison of Different Estimation Approaches in Rare Events Data

Abstract

In social science researches, there may be cases where a category of the dependent variable is seen hundred times less (more) than the other category. Events like wars, mass migrations or coups in social sciences; an event of interest in binary variable(s) may have very low prevalence, resulting in low or even zero cell counts in one or two cells in the 2X2 tables of two factors. In this case, independent variable predict the dependent variable perfectly or almost perfectly, and this leads to an issue called complete or quasi-complete separation problem in statistical modelling. This study aims to compare three methods suggested in the literature for the quasi-complete separation in a real small dataset; penalized maximum likelihood (Firth-type), exact logistic regression and bayesian logistic regression. Methods were compared via odds ratios, odds’ standard error estimates, confidence intervals and statistical significance. Parameter estimates were obtained under three different models with binary and continuous variables. Results show that all methods can provide convergence in the presence of quasi-complete separation. In conclusion, bayesian logistic regression estimates tend to be superior than the other methods in terms of estimation of standard errors.

Keywords

References

  1. Allison P.D. (2008). Convergence failures in logistic regression. In: Proceedings of the SAS Global Forum 2008 Conference. SAS Institute Inc., Cary, NC. http://www2.sas.com/proceedings/ forum2008/360-2008.pdf
  2. Cengiz, M.A., Terzi, E. Şenel, T. ve Murat, N. (2013). Lojistik regresyonda parametre tahmininde Bayesci bir yaklaşım. Afyon Kocatepe Üniversitesi Fen Bilimleri Dergisi, 12(2012), 15-22.
  3. Derr R.E. (2009). Performing exact logistic regression with the SAS System-Revised 2009. Proceedings of the Twenty-fifth Annual SAS Users Group International Conference; Cary, NC; 2009: Citeseer.
  4. Devika, S. Jeyaseelan, L. ve Sebastian, G. (2016). Analysis of sparse data in logistic regression in medical research: a newer approach. Journal of Postgraduate Medicine, 62(1), 26-31.
  5. Eyduran, E. (2008). Usage of penalized maximum likelihood estimation method in medical research: an alternative to maximum likelihood estimation method, JRMS 13(6), 325- 330.
  6. Firth D. (1993). Bias reduction of maximum likelihood estimates. Biometrika, 80(1), 27-38.
  7. Gavanji, R. (2019). Penalized Regression Methods for Modelling Rare Events Data with Application to Occupational Injury Study (Doctoral dissertation, University of Saskatchewan).
  8. Gelman, A. ve Hill, J. (2007). Data analysis using regression and multilevel/hierarchical models. Cambridge University Press, USA.

Details

Primary Language

English

Subjects

Economics

Journal Section

Research Article

Authors

Ece Bacaksız This is me
0000-0003-0534-6011
Türkiye

Publication Date

June 30, 2021

Submission Date

January 21, 2021

Acceptance Date

June 23, 2021

Published in Issue

Year 2021 Volume: 21 Number: 3

APA
Bacaksız, E., & Koç, S. (2021). Comparison of Different Estimation Approaches in Rare Events Data. Ege Academic Review, 21(3), 263-272. https://doi.org/10.21121/eab.960840
AMA
1.Bacaksız E, Koç S. Comparison of Different Estimation Approaches in Rare Events Data. ear. 2021;21(3):263-272. doi:10.21121/eab.960840
Chicago
Bacaksız, Ece, and Selçuk Koç. 2021. “Comparison of Different Estimation Approaches in Rare Events Data”. Ege Academic Review 21 (3): 263-72. https://doi.org/10.21121/eab.960840.
EndNote
Bacaksız E, Koç S (June 1, 2021) Comparison of Different Estimation Approaches in Rare Events Data. Ege Academic Review 21 3 263–272.
IEEE
[1]E. Bacaksız and S. Koç, “Comparison of Different Estimation Approaches in Rare Events Data”, ear, vol. 21, no. 3, pp. 263–272, June 2021, doi: 10.21121/eab.960840.
ISNAD
Bacaksız, Ece - Koç, Selçuk. “Comparison of Different Estimation Approaches in Rare Events Data”. Ege Academic Review 21/3 (June 1, 2021): 263-272. https://doi.org/10.21121/eab.960840.
JAMA
1.Bacaksız E, Koç S. Comparison of Different Estimation Approaches in Rare Events Data. ear. 2021;21:263–272.
MLA
Bacaksız, Ece, and Selçuk Koç. “Comparison of Different Estimation Approaches in Rare Events Data”. Ege Academic Review, vol. 21, no. 3, June 2021, pp. 263-72, doi:10.21121/eab.960840.
Vancouver
1.Ece Bacaksız, Selçuk Koç. Comparison of Different Estimation Approaches in Rare Events Data. ear. 2021 Jun. 1;21(3):263-72. doi:10.21121/eab.960840