Araştırma Makalesi
BibTex RIS Kaynak Göster
Yıl 2019, Cilt: 23 Sayı: 6, 1225 - 1236, 01.12.2019

Öz

Kaynakça

  • [1] J. L. Schafer, Analysis of incomplete multivariate data: Chapman and Hall/CRC, 1997.
  • [2] I. R. Dohoo, C. R. Nielsen, and U. Emanuelson, "Multiple imputation in veterinary epidemiological studies: a case study and simulation," Preventive veterinary medicine, vol. 129, pp. 35-47, 2016.
  • [3] G. Ser and S. Keskin, "EXAMINING OF MULTIPLE IMPUTATION METHOD IN TWO MISSING OBSERVATION MECHANISMS," JAPS, Journal of Animal and Plant Sciences, vol. 26, pp. 594-598, 2016.
  • [4] P. Cihan, E. Gökçe, and O. Kalıpsız, "A review of machine learning applications in veterinary field," Kafkas Univ Vet Fak Derg, vol. 23, pp. 673-680, 2017.
  • [5] O. Troyanskaya, M. Cantor, G. Sherlock, P. Brown, T. Hastie, R. Tibshirani, et al., "Missing value estimation methods for DNA microarrays," Bioinformatics, vol. 17, pp. 520-525, 2001.
  • [6] S. Van Buuren, H. C. Boshuizen, and D. L. Knook, "Multiple imputation of missing blood pressure covariates in survival analysis," Statistics in medicine, vol. 18, pp. 681-694, 1999.
  • [7] D. J. Stekhoven and P. Bühlmann, "MissForest—non-parametric missing value imputation for mixed-type data," Bioinformatics, vol. 28, pp. 112-118, 2011.
  • [8] E. HM, "An epidemiological study on neonatal lamb health," Kafkas Üniversitesi Veteriner Fakültesi Dergisi, vol. 15, 2009.
  • [9] K. AH and E. HM, "Risk Factors Associated with Passive Immunity, Health, Birth Weight And Growth Performance in Lambs: III. The Relationship among Passive Immunity, Birth Weight Gender, Birth Type, Parity, Dam," Kafkas Üniversitesi Veteriner Fakültesi Dergisi, vol. 19, 2013.
  • [10] R. J. Little and D. B. Rubin, Statistical analysis with missing data vol. 333: John Wiley & Sons, 2014.
  • [11] E. Alpaydin, Introduction to machine learning: MIT press, 2009.
  • [12] A. J. Viera and J. M. Garrett, "Understanding interobserver agreement: the kappa statistic," Fam Med, vol. 37, pp. 360-363, 2005.

Effect of Imputation Methods in the Classifier Performance

Yıl 2019, Cilt: 23 Sayı: 6, 1225 - 1236, 01.12.2019

Öz

Missing
values in a data set present an important problem for almost any traditional
and modern statistical method, since most of these methods were developed under
the assumption that the data set was complete. However, in the real world no
complete datasets are available and the issue of missing data is frequently
encountered in veterinary field studies as in other fields. While imputation of
missing data is important in veterinary field studies where data mining is
newly starting to be implemented, another important issue is how it should be
imputed. This is because in many studies observations with any variables having
missing values are being removed or they are completed by traditional methods.
In recent years, while alternative approaches are widely available to prevent
removal of observations with missing values, they are being used rarely. The
aim of this study is to examine mean, median, nearest neighbors, mice and
missForest methods to impute the simulated missing data which is the randomly
removed with varying frequencies (5 to 25% by 5%) from original veterinary
dataset. Then highly accurate methods selected to impute original dataset for
observation of influence in classifier performance and to determine the optimal
imputation method for the original dataset. 

Kaynakça

  • [1] J. L. Schafer, Analysis of incomplete multivariate data: Chapman and Hall/CRC, 1997.
  • [2] I. R. Dohoo, C. R. Nielsen, and U. Emanuelson, "Multiple imputation in veterinary epidemiological studies: a case study and simulation," Preventive veterinary medicine, vol. 129, pp. 35-47, 2016.
  • [3] G. Ser and S. Keskin, "EXAMINING OF MULTIPLE IMPUTATION METHOD IN TWO MISSING OBSERVATION MECHANISMS," JAPS, Journal of Animal and Plant Sciences, vol. 26, pp. 594-598, 2016.
  • [4] P. Cihan, E. Gökçe, and O. Kalıpsız, "A review of machine learning applications in veterinary field," Kafkas Univ Vet Fak Derg, vol. 23, pp. 673-680, 2017.
  • [5] O. Troyanskaya, M. Cantor, G. Sherlock, P. Brown, T. Hastie, R. Tibshirani, et al., "Missing value estimation methods for DNA microarrays," Bioinformatics, vol. 17, pp. 520-525, 2001.
  • [6] S. Van Buuren, H. C. Boshuizen, and D. L. Knook, "Multiple imputation of missing blood pressure covariates in survival analysis," Statistics in medicine, vol. 18, pp. 681-694, 1999.
  • [7] D. J. Stekhoven and P. Bühlmann, "MissForest—non-parametric missing value imputation for mixed-type data," Bioinformatics, vol. 28, pp. 112-118, 2011.
  • [8] E. HM, "An epidemiological study on neonatal lamb health," Kafkas Üniversitesi Veteriner Fakültesi Dergisi, vol. 15, 2009.
  • [9] K. AH and E. HM, "Risk Factors Associated with Passive Immunity, Health, Birth Weight And Growth Performance in Lambs: III. The Relationship among Passive Immunity, Birth Weight Gender, Birth Type, Parity, Dam," Kafkas Üniversitesi Veteriner Fakültesi Dergisi, vol. 19, 2013.
  • [10] R. J. Little and D. B. Rubin, Statistical analysis with missing data vol. 333: John Wiley & Sons, 2014.
  • [11] E. Alpaydin, Introduction to machine learning: MIT press, 2009.
  • [12] A. J. Viera and J. M. Garrett, "Understanding interobserver agreement: the kappa statistic," Fam Med, vol. 37, pp. 360-363, 2005.
Toplam 12 adet kaynakça vardır.

Ayrıntılar

Birincil Dil İngilizce
Konular Bilgisayar Yazılımı
Bölüm Araştırma Makalesi
Yazarlar

Pinar Cihan 0000-0001-7958-7251

Oya Kalıpsız 0000-0001-9553-669X

Erhan Gökçe Bu kişi benim 0000-0003-2674-1010

Yayımlanma Tarihi 1 Aralık 2019
Gönderilme Tarihi 22 Ocak 2019
Kabul Tarihi 23 Temmuz 2019
Yayımlandığı Sayı Yıl 2019 Cilt: 23 Sayı: 6

Kaynak Göster

APA Cihan, P., Kalıpsız, O., & Gökçe, E. (2019). Effect of Imputation Methods in the Classifier Performance. Sakarya University Journal of Science, 23(6), 1225-1236.
AMA Cihan P, Kalıpsız O, Gökçe E. Effect of Imputation Methods in the Classifier Performance. SAUJS. Aralık 2019;23(6):1225-1236.
Chicago Cihan, Pinar, Oya Kalıpsız, ve Erhan Gökçe. “Effect of Imputation Methods in the Classifier Performance”. Sakarya University Journal of Science 23, sy. 6 (Aralık 2019): 1225-36.
EndNote Cihan P, Kalıpsız O, Gökçe E (01 Aralık 2019) Effect of Imputation Methods in the Classifier Performance. Sakarya University Journal of Science 23 6 1225–1236.
IEEE P. Cihan, O. Kalıpsız, ve E. Gökçe, “Effect of Imputation Methods in the Classifier Performance”, SAUJS, c. 23, sy. 6, ss. 1225–1236, 2019.
ISNAD Cihan, Pinar vd. “Effect of Imputation Methods in the Classifier Performance”. Sakarya University Journal of Science 23/6 (Aralık 2019), 1225-1236.
JAMA Cihan P, Kalıpsız O, Gökçe E. Effect of Imputation Methods in the Classifier Performance. SAUJS. 2019;23:1225–1236.
MLA Cihan, Pinar vd. “Effect of Imputation Methods in the Classifier Performance”. Sakarya University Journal of Science, c. 23, sy. 6, 2019, ss. 1225-36.
Vancouver Cihan P, Kalıpsız O, Gökçe E. Effect of Imputation Methods in the Classifier Performance. SAUJS. 2019;23(6):1225-36.