BibTex RIS Cite

A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods

Year 2014, Volume 2, 2014, 1 - 11, 26.05.2016

Abstract

In this article, we propose clustering approach based on Principal Component Analysis (PCA) to diagnosis of heart disease patients. At the first stage, the original dataset is reduced using PCA reduction method. Then, at the second stage, reduced dataset is applied to clustering methods which is based on fuzzy C-means and K-means algorithms. These algorithms are implemented and tested on a Cleveland heart disease dataset. We compared the clustering results with and without PCA. The results are suggesting that the combination of clustering algorithms and PCA was the most effective at heart disease diagnosis.

References

  • World Health Organization, http://www.who.int/topics/cardiovascular diseases/en/
  • Patil, S. B., & Kumaraswamy, Y. S., Intelligent and effective heart attack prediction system using data mining and artificial neural network,European Journal of Scientific Research,31(4),642-656, 2009.
  • Maimon,O. &Rokach, L., Data Mining and Knowledge Discovery Handbook, Springer, 2010.
  • Jolliffe, I., Principal Component Analysis, Springer, 2nd edition, 2002.
  • Ding, C., He, X., K-means Clustering via Principal Component Analysis, International Conference on machine learning , Banff, Canada, 2004.
  • Ziasabounchi, N. & Askerzade, I., ANFIS based classification model for heart disease diagnosis, international Journal of Engineering & Computer Sciences,14(2),7-12, 2014.
  • Patil, B.M., Joshi, R.C., Toshnival. D., Hybrid prediction model for type-2 diabetic patients, Expert systems with applications,37(12), 8102-8108, 2010.
  • Singh, N., Mohapatra, A.G. & Kanungo, G., Breast cancer mass detection in mammograms using K-means and fuzzy C-means clustering, International Journal of Computer Applications,22(2),15-21, 2011.
  • Chitra, R. & Seenivasagam, V., Heart attack prediction system using fuzzy C-mean classifier, IOSR Journal of Computer Engineering,14,23-31, 2013.
  • Kahramanli, H. & Allahverdi, N., Design of a hybrid system for the diabetes and heart disease, Expert systems with applications,35,82-89, 2008.
  • Askerzade, I.N. & Mahmud, M., Design and implementation of group traffic control system using fuzzy logic, International Journal of Research and Reviews in Applied Sciences,6,196-202, 2011.
  • Askerzade, I.N. & Mahmud, M., Control the extension time of traffic light in single junction by using fuzzy logic, International Journal of Electrical & Computer Sciences,10(2),48-55, 2011.
  • Sundar, B., Devi, T., & Saravanan, N., Development of a clustering algorithm for prediction heart, International Journal of Computer Application,48(7),8-13, 2013.
  • Yin, J., Sun, H., Yang,J., & Gou, Q., Comparison of K-Means and fuzzy C-means algorithm performance for automated determination of the arterial input function, PLoS ONE, 9(2),e85884, 2014.
  • Howley, T., Madden, G.M., Connel, M.L. & Ryder, G.A., The effect of principal component analysis on machine learning accuracy with high dimensional spectral data, Knowledge Based Systems,19(5),363–370, 2006.
  • Indhumathi, R. & Sathiyabama, S., Reducing and clustering high dimensional data through principal component analysis, International Journal of computer Application,11(8),1-4, 2010.
  • Avci, E., Turkoglu, I., An intelligent diagnosis system based on principle component analysis and ANFIS for the heart valve diseases, Journal of Expert Systems with Application,36,2873-2878, 2009.
  • Napoleon, D. & Pavalakodi, S., A new method for dimensionality reduction using K- means clustering algorithm for high dimensional data set , International Journal of Computer Applications, 13(7),41-46, 2011.
  • Fan, J., Han, M., & Wang, J., Single point iterative weighted fuzzy C-means clustering algorithm for remote sensing image segmentation, Pattern Recognition,42(11),2527–2540, 2009.
  • Gath, I. & Geva, A.B., Unsupervised optimal fuzzy clustering, IEEE Transaction on Pattern Analysis Machine Intelligence, 11,773-780, 1989.
  • Ross, T., Fuzzy logic with engineering applications, New York: McGraw Hill Co., 1995.
  • Altun, S., Okur, V., Goktepe, A.B. & Ansal, A., Comparison of Dynamic Properties of Clays Obtained by Different Test Methods, 4th International Conference on Earthquake Geotechnical Engineering ,2007.
  • Subbuthai, P., Periasamy, A., & Muruganand, S., Identifying the character by applying PCA method using Matlab, International Journal of Computer Applications ,60(1),8-11, 2012.
  • Zhu, W., Zeng, N. & Wang, N., Sensitivity, Specificity, Accuracy, Associated Confidence Interval and ROC Analysis with Practical SAS Implementations, 2010.
  • Hammouda, K. & Karray, F., A comparative study of data clustering techniques, SYDE 625: Tools of intelligent system design, course project, 2000.
Year 2014, Volume 2, 2014, 1 - 11, 26.05.2016

Abstract

References

  • World Health Organization, http://www.who.int/topics/cardiovascular diseases/en/
  • Patil, S. B., & Kumaraswamy, Y. S., Intelligent and effective heart attack prediction system using data mining and artificial neural network,European Journal of Scientific Research,31(4),642-656, 2009.
  • Maimon,O. &Rokach, L., Data Mining and Knowledge Discovery Handbook, Springer, 2010.
  • Jolliffe, I., Principal Component Analysis, Springer, 2nd edition, 2002.
  • Ding, C., He, X., K-means Clustering via Principal Component Analysis, International Conference on machine learning , Banff, Canada, 2004.
  • Ziasabounchi, N. & Askerzade, I., ANFIS based classification model for heart disease diagnosis, international Journal of Engineering & Computer Sciences,14(2),7-12, 2014.
  • Patil, B.M., Joshi, R.C., Toshnival. D., Hybrid prediction model for type-2 diabetic patients, Expert systems with applications,37(12), 8102-8108, 2010.
  • Singh, N., Mohapatra, A.G. & Kanungo, G., Breast cancer mass detection in mammograms using K-means and fuzzy C-means clustering, International Journal of Computer Applications,22(2),15-21, 2011.
  • Chitra, R. & Seenivasagam, V., Heart attack prediction system using fuzzy C-mean classifier, IOSR Journal of Computer Engineering,14,23-31, 2013.
  • Kahramanli, H. & Allahverdi, N., Design of a hybrid system for the diabetes and heart disease, Expert systems with applications,35,82-89, 2008.
  • Askerzade, I.N. & Mahmud, M., Design and implementation of group traffic control system using fuzzy logic, International Journal of Research and Reviews in Applied Sciences,6,196-202, 2011.
  • Askerzade, I.N. & Mahmud, M., Control the extension time of traffic light in single junction by using fuzzy logic, International Journal of Electrical & Computer Sciences,10(2),48-55, 2011.
  • Sundar, B., Devi, T., & Saravanan, N., Development of a clustering algorithm for prediction heart, International Journal of Computer Application,48(7),8-13, 2013.
  • Yin, J., Sun, H., Yang,J., & Gou, Q., Comparison of K-Means and fuzzy C-means algorithm performance for automated determination of the arterial input function, PLoS ONE, 9(2),e85884, 2014.
  • Howley, T., Madden, G.M., Connel, M.L. & Ryder, G.A., The effect of principal component analysis on machine learning accuracy with high dimensional spectral data, Knowledge Based Systems,19(5),363–370, 2006.
  • Indhumathi, R. & Sathiyabama, S., Reducing and clustering high dimensional data through principal component analysis, International Journal of computer Application,11(8),1-4, 2010.
  • Avci, E., Turkoglu, I., An intelligent diagnosis system based on principle component analysis and ANFIS for the heart valve diseases, Journal of Expert Systems with Application,36,2873-2878, 2009.
  • Napoleon, D. & Pavalakodi, S., A new method for dimensionality reduction using K- means clustering algorithm for high dimensional data set , International Journal of Computer Applications, 13(7),41-46, 2011.
  • Fan, J., Han, M., & Wang, J., Single point iterative weighted fuzzy C-means clustering algorithm for remote sensing image segmentation, Pattern Recognition,42(11),2527–2540, 2009.
  • Gath, I. & Geva, A.B., Unsupervised optimal fuzzy clustering, IEEE Transaction on Pattern Analysis Machine Intelligence, 11,773-780, 1989.
  • Ross, T., Fuzzy logic with engineering applications, New York: McGraw Hill Co., 1995.
  • Altun, S., Okur, V., Goktepe, A.B. & Ansal, A., Comparison of Dynamic Properties of Clays Obtained by Different Test Methods, 4th International Conference on Earthquake Geotechnical Engineering ,2007.
  • Subbuthai, P., Periasamy, A., & Muruganand, S., Identifying the character by applying PCA method using Matlab, International Journal of Computer Applications ,60(1),8-11, 2012.
  • Zhu, W., Zeng, N. & Wang, N., Sensitivity, Specificity, Accuracy, Associated Confidence Interval and ROC Analysis with Practical SAS Implementations, 2010.
  • Hammouda, K. & Karray, F., A comparative study of data clustering techniques, SYDE 625: Tools of intelligent system design, course project, 2000.
There are 25 citations in total.

Details

Other ID JA22TC24SV
Journal Section Articles
Authors

Negar Ziasabounchi This is me

İman N. Askerzade This is me

Publication Date May 26, 2016
Published in Issue Year 2014 Volume 2, 2014

Cite

APA Ziasabounchi, N., & Askerzade, İ. N. (2016). A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. Turkish Journal of Mathematics and Computer Science, 2(1), 1-11.
AMA Ziasabounchi N, Askerzade İN. A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. TJMCS. May 2016;2(1):1-11.
Chicago Ziasabounchi, Negar, and İman N. Askerzade. “A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods”. Turkish Journal of Mathematics and Computer Science 2, no. 1 (May 2016): 1-11.
EndNote Ziasabounchi N, Askerzade İN (May 1, 2016) A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. Turkish Journal of Mathematics and Computer Science 2 1 1–11.
IEEE N. Ziasabounchi and İ. N. Askerzade, “A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods”, TJMCS, vol. 2, no. 1, pp. 1–11, 2016.
ISNAD Ziasabounchi, Negar - Askerzade, İman N. “A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods”. Turkish Journal of Mathematics and Computer Science 2/1 (May 2016), 1-11.
JAMA Ziasabounchi N, Askerzade İN. A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. TJMCS. 2016;2:1–11.
MLA Ziasabounchi, Negar and İman N. Askerzade. “A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods”. Turkish Journal of Mathematics and Computer Science, vol. 2, no. 1, 2016, pp. 1-11.
Vancouver Ziasabounchi N, Askerzade İN. A Comparative Study of Heart Disease Prediction Based on Principal Component Analysis and Clustering Methods. TJMCS. 2016;2(1):1-11.