Robust Principal Component Analysis based on Fuzzy Coded Data
Abstract
In the presence of outliers in the dataset, the principal component analysis method, like many of the classical statistical methods, is severely affected. For this reason, if there are outliers in dataset, researchers tend to use alternative methods. Use of fuzzy and robust approaches is the leading choice among these methods. In this study, a new approach to robust fuzzy principal component analysis is proposed. This approach combines the power of both robust and fuzzy methods at the same time and collects these two approaches under the framework of principal component analysis. The performance of proposed approach called robust principal component analysis based on fuzzy coded data is examined through a set of artificial dataset that are generated by considering three different scenarios and a real dataset to observe how it is affected by the increase in sample size and changes in the rate of outliers. In light of the study's findings, it is seen that the proposed approach gives better results than the ones in the classical and robust principal component analysis in the presence of outliers in dataset.
Keywords
References
- Alkan, B. B. (2016). Robust Principal Component Analysis Based On Modified Minimum Covariance Determinant In The Presence Of Outliers (in Turkish). Alphanumeric Journal, 4(2).
- Alkan, B. B., Atakan, C., Alkan, N., (2015). A comparison of different procedures for principal component analysis in the presence of outliers, Journal of Applied Statistics, 42(8), 1716-1722.
- Asan, Z., & Greenacre, M. (2011). Biplots of fuzzy coded data. Fuzzy sets and Systems, 183(1), 57-71.
- Asan, Z., & Senturk, S. (2011). An Application of Fuzzy Coding in Multiple Correspondence Analysis for Transforming Data from Continuous to Categorical. Journal of Multiple-Valued Logic & Soft Computing, 17.
- Atkinson, A.C., (1994). Fast Very Robust Methods for the Detection of Multiple Outliers, J. Amer. Statist. Assoc. 89, 1329–1339.
- Bezdek, J. C., Ehrlich, R., & Full, W. (1984). FCM: The fuzzy c-means clustering algorithm. Computers & Geosciences, 10(2-3), 191-203.
- Calcagnì, A., Lombardi, L., & Pascali, E. (2016). A dimension reduction technique for two-mode non-convex fuzzy data. Soft Computing, 20(2), 749-762.
- Campbell, N. A., (1980). Robust procedures in multivariate analysis I: Robust covariance estimation, Applied statistics, 231-237.
Details
Primary Language
English
Subjects
Engineering
Journal Section
Research Article
Publication Date
September 30, 2017
Submission Date
May 31, 2017
Acceptance Date
August 23, 2017
Published in Issue
Year 2017 Volume: 18 Number: 3