BibTex RIS Kaynak Göster

A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods

Yıl 2014, Volume 2, 2014, 1 - 11, 26.05.2016

Öz

In this article, we propose clustering approach based on Principal Component Analysis (PCA) to diagnosis of heart disease patients. At the first stage, the original dataset is reduced using PCA reduction method. Then, at the second stage, reduced dataset is applied to clustering methods which is based on fuzzy C-means and K-means algorithms. These algorithms are implemented and tested on a Cleveland heart disease dataset. We compared the clustering results with and without PCA. The results are suggesting that the combination of clustering algorithms and PCA was the most effective at heart disease diagnosis.

Kaynakça

  • World Health Organization, http://www.who.int/topics/cardiovascular diseases/en/
  • Patil, S. B., & Kumaraswamy, Y. S., Intelligent and effective heart attack prediction system using data mining and artificial neural network,European Journal of Scientific Research,31(4),642-656, 2009.
  • Maimon,O. &Rokach, L., Data Mining and Knowledge Discovery Handbook, Springer, 2010.
  • Jolliffe, I., Principal Component Analysis, Springer, 2nd edition, 2002.
  • Ding, C., He, X., K-means Clustering via Principal Component Analysis, International Conference on machine learning , Banff, Canada, 2004.
  • Ziasabounchi, N. & Askerzade, I., ANFIS based classification model for heart disease diagnosis, international Journal of Engineering & Computer Sciences,14(2),7-12, 2014.
  • Patil, B.M., Joshi, R.C., Toshnival. D., Hybrid prediction model for type-2 diabetic patients, Expert systems with applications,37(12), 8102-8108, 2010.
  • Singh, N., Mohapatra, A.G. & Kanungo, G., Breast cancer mass detection in mammograms using K-means and fuzzy C-means clustering, International Journal of Computer Applications,22(2),15-21, 2011.
  • Chitra, R. & Seenivasagam, V., Heart attack prediction system using fuzzy C-mean classifier, IOSR Journal of Computer Engineering,14,23-31, 2013.
  • Kahramanli, H. & Allahverdi, N., Design of a hybrid system for the diabetes and heart disease, Expert systems with applications,35,82-89, 2008.
  • Askerzade, I.N. & Mahmud, M., Design and implementation of group traffic control system using fuzzy logic, International Journal of Research and Reviews in Applied Sciences,6,196-202, 2011.
  • Askerzade, I.N. & Mahmud, M., Control the extension time of traffic light in single junction by using fuzzy logic, International Journal of Electrical & Computer Sciences,10(2),48-55, 2011.
  • Sundar, B., Devi, T., & Saravanan, N., Development of a clustering algorithm for prediction heart, International Journal of Computer Application,48(7),8-13, 2013.
  • Yin, J., Sun, H., Yang,J., & Gou, Q., Comparison of K-Means and fuzzy C-means algorithm performance for automated determination of the arterial input function, PLoS ONE, 9(2),e85884, 2014.
  • Howley, T., Madden, G.M., Connel, M.L. & Ryder, G.A., The effect of principal component analysis on machine learning accuracy with high dimensional spectral data, Knowledge Based Systems,19(5),363–370, 2006.
  • Indhumathi, R. & Sathiyabama, S., Reducing and clustering high dimensional data through principal component analysis, International Journal of computer Application,11(8),1-4, 2010.
  • Avci, E., Turkoglu, I., An intelligent diagnosis system based on principle component analysis and ANFIS for the heart valve diseases, Journal of Expert Systems with Application,36,2873-2878, 2009.
  • Napoleon, D. & Pavalakodi, S., A new method for dimensionality reduction using K- means clustering algorithm for high dimensional data set , International Journal of Computer Applications, 13(7),41-46, 2011.
  • Fan, J., Han, M., & Wang, J., Single point iterative weighted fuzzy C-means clustering algorithm for remote sensing image segmentation, Pattern Recognition,42(11),2527–2540, 2009.
  • Gath, I. & Geva, A.B., Unsupervised optimal fuzzy clustering, IEEE Transaction on Pattern Analysis Machine Intelligence, 11,773-780, 1989.
  • Ross, T., Fuzzy logic with engineering applications, New York: McGraw Hill Co., 1995.
  • Altun, S., Okur, V., Goktepe, A.B. & Ansal, A., Comparison of Dynamic Properties of Clays Obtained by Different Test Methods, 4th International Conference on Earthquake Geotechnical Engineering ,2007.
  • Subbuthai, P., Periasamy, A., & Muruganand, S., Identifying the character by applying PCA method using Matlab, International Journal of Computer Applications ,60(1),8-11, 2012.
  • Zhu, W., Zeng, N. & Wang, N., Sensitivity, Specificity, Accuracy, Associated Confidence Interval and ROC Analysis with Practical SAS Implementations, 2010.
  • Hammouda, K. & Karray, F., A comparative study of data clustering techniques, SYDE 625: Tools of intelligent system design, course project, 2000.
Yıl 2014, Volume 2, 2014, 1 - 11, 26.05.2016

Öz

Kaynakça

  • World Health Organization, http://www.who.int/topics/cardiovascular diseases/en/
  • Patil, S. B., & Kumaraswamy, Y. S., Intelligent and effective heart attack prediction system using data mining and artificial neural network,European Journal of Scientific Research,31(4),642-656, 2009.
  • Maimon,O. &Rokach, L., Data Mining and Knowledge Discovery Handbook, Springer, 2010.
  • Jolliffe, I., Principal Component Analysis, Springer, 2nd edition, 2002.
  • Ding, C., He, X., K-means Clustering via Principal Component Analysis, International Conference on machine learning , Banff, Canada, 2004.
  • Ziasabounchi, N. & Askerzade, I., ANFIS based classification model for heart disease diagnosis, international Journal of Engineering & Computer Sciences,14(2),7-12, 2014.
  • Patil, B.M., Joshi, R.C., Toshnival. D., Hybrid prediction model for type-2 diabetic patients, Expert systems with applications,37(12), 8102-8108, 2010.
  • Singh, N., Mohapatra, A.G. & Kanungo, G., Breast cancer mass detection in mammograms using K-means and fuzzy C-means clustering, International Journal of Computer Applications,22(2),15-21, 2011.
  • Chitra, R. & Seenivasagam, V., Heart attack prediction system using fuzzy C-mean classifier, IOSR Journal of Computer Engineering,14,23-31, 2013.
  • Kahramanli, H. & Allahverdi, N., Design of a hybrid system for the diabetes and heart disease, Expert systems with applications,35,82-89, 2008.
  • Askerzade, I.N. & Mahmud, M., Design and implementation of group traffic control system using fuzzy logic, International Journal of Research and Reviews in Applied Sciences,6,196-202, 2011.
  • Askerzade, I.N. & Mahmud, M., Control the extension time of traffic light in single junction by using fuzzy logic, International Journal of Electrical & Computer Sciences,10(2),48-55, 2011.
  • Sundar, B., Devi, T., & Saravanan, N., Development of a clustering algorithm for prediction heart, International Journal of Computer Application,48(7),8-13, 2013.
  • Yin, J., Sun, H., Yang,J., & Gou, Q., Comparison of K-Means and fuzzy C-means algorithm performance for automated determination of the arterial input function, PLoS ONE, 9(2),e85884, 2014.
  • Howley, T., Madden, G.M., Connel, M.L. & Ryder, G.A., The effect of principal component analysis on machine learning accuracy with high dimensional spectral data, Knowledge Based Systems,19(5),363–370, 2006.
  • Indhumathi, R. & Sathiyabama, S., Reducing and clustering high dimensional data through principal component analysis, International Journal of computer Application,11(8),1-4, 2010.
  • Avci, E., Turkoglu, I., An intelligent diagnosis system based on principle component analysis and ANFIS for the heart valve diseases, Journal of Expert Systems with Application,36,2873-2878, 2009.
  • Napoleon, D. & Pavalakodi, S., A new method for dimensionality reduction using K- means clustering algorithm for high dimensional data set , International Journal of Computer Applications, 13(7),41-46, 2011.
  • Fan, J., Han, M., & Wang, J., Single point iterative weighted fuzzy C-means clustering algorithm for remote sensing image segmentation, Pattern Recognition,42(11),2527–2540, 2009.
  • Gath, I. & Geva, A.B., Unsupervised optimal fuzzy clustering, IEEE Transaction on Pattern Analysis Machine Intelligence, 11,773-780, 1989.
  • Ross, T., Fuzzy logic with engineering applications, New York: McGraw Hill Co., 1995.
  • Altun, S., Okur, V., Goktepe, A.B. & Ansal, A., Comparison of Dynamic Properties of Clays Obtained by Different Test Methods, 4th International Conference on Earthquake Geotechnical Engineering ,2007.
  • Subbuthai, P., Periasamy, A., & Muruganand, S., Identifying the character by applying PCA method using Matlab, International Journal of Computer Applications ,60(1),8-11, 2012.
  • Zhu, W., Zeng, N. & Wang, N., Sensitivity, Specificity, Accuracy, Associated Confidence Interval and ROC Analysis with Practical SAS Implementations, 2010.
  • Hammouda, K. & Karray, F., A comparative study of data clustering techniques, SYDE 625: Tools of intelligent system design, course project, 2000.
Toplam 25 adet kaynakça vardır.

Ayrıntılar

Diğer ID JA22TC24SV
Bölüm Makaleler
Yazarlar

Negar Ziasabounchi Bu kişi benim

İman N. Askerzade Bu kişi benim

Yayımlanma Tarihi 26 Mayıs 2016
Yayımlandığı Sayı Yıl 2014 Volume 2, 2014

Kaynak Göster

APA Ziasabounchi, N., & Askerzade, İ. N. (2016). A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. Turkish Journal of Mathematics and Computer Science, 2(1), 1-11.
AMA Ziasabounchi N, Askerzade İN. A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. TJMCS. Mayıs 2016;2(1):1-11.
Chicago Ziasabounchi, Negar, ve İman N. Askerzade. “A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods”. Turkish Journal of Mathematics and Computer Science 2, sy. 1 (Mayıs 2016): 1-11.
EndNote Ziasabounchi N, Askerzade İN (01 Mayıs 2016) A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. Turkish Journal of Mathematics and Computer Science 2 1 1–11.
IEEE N. Ziasabounchi ve İ. N. Askerzade, “A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods”, TJMCS, c. 2, sy. 1, ss. 1–11, 2016.
ISNAD Ziasabounchi, Negar - Askerzade, İman N. “A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods”. Turkish Journal of Mathematics and Computer Science 2/1 (Mayıs 2016), 1-11.
JAMA Ziasabounchi N, Askerzade İN. A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. TJMCS. 2016;2:1–11.
MLA Ziasabounchi, Negar ve İman N. Askerzade. “A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods”. Turkish Journal of Mathematics and Computer Science, c. 2, sy. 1, 2016, ss. 1-11.
Vancouver Ziasabounchi N, Askerzade İN. A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. TJMCS. 2016;2(1):1-11.