Research Article

Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements

Volume: 4 Number: 1 March 15, 2018
EN

Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements

Abstract

Support vector machine is a supervised learning algorithm, which is recommended for classification and nonlinear function approaches. Support vector machines require remarkable amount of memory with time consuming process for large data sets in the training process. The main reason for this is the solving a constrained quadratic programming problem within the algorithm. In this paper, we proposed three approaches for identifying the non-critical points in training set and remove these non-critical points from the original training set for speeding up the training process of support vector machine. For this purpose, we used principal component analysis, Mahalanobis distance and Euclidean distance based measurements for the elimination of non-critical training instances in the training set. We compared the proposed methods in terms of computational time and classification accuracy between each other and conventional support vector machine. Our experimental results show that principal component analysis and Mahalanobis distance based proposed methods have positive effects on computational time without degrading the classification results than the Euclidean distance based proposed method and conventional support vector machine.

Keywords

References

  1. Vapnik V. “The nature of statistical learning theory” (Springer-Verlag, New York, 1995).
  2. Cervantes J., X. Li, W. Yu, “Support vector classification for large data sets by reducing training data with change of classes”, International conference on systems, man. and cybernetics, IEEE, 2008, 2609-2614.
  3. Javed I., M.N. Ayyaz, W. Mehmood “Efficient training data reduction for SVM based handwritten digits’ recognition”, International conference on electrical engineering, 2007, 1-4.
  4. Koggalage R., S. Halgamuge “Reducing the number of training samples for fast support vector machine classification”, Neural information processing - letters and reviews, vol. 2, no. 3, 2004, 57-65.
  5. Fortuna J., D. Capson, “Improved support vector classification using PCA and ICA feature space modification”, Pattern recognition, 37, 2004, 1117-1129.
  6. Cao L. J., K.S. Chua, W.K. Chong, H.P. Lee, Q.M. Gu “A comparison of PCA, KPCA and ICA for dimensionality reduction in support vector machine”, Neurocomputing, 55, 2003, 321-336.
  7. Subasi A., M.I. Gursoy “EEG signal classification using PCA, ICA, LDA and support vector machines”, Expert systems with applications, vol. 37, no. 12, 2010, 8659-8666.
  8. Gertych A., A. Zhang, J. Sayre, S.P. Kurkowska, H.K. Huang “Bone age assessment of children using a digital hand atlas”, Computerized Medical Imaging and Graphics, 31, 2007, 322-331.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Authors

Harun Uğuz
Türkiye

Publication Date

March 15, 2018

Submission Date

January 3, 2018

Acceptance Date

March 6, 2018

Published in Issue

Year 2018 Volume: 4 Number: 1

APA
Güraksın, G. E., & Uğuz, H. (2018). Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements. International Journal of Computational and Experimental Science and Engineering, 4(1), 1-5. https://doi.org/10.22399/ijcesen.374222
AMA
1.Güraksın GE, Uğuz H. Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements. IJCESEN. 2018;4(1):1-5. doi:10.22399/ijcesen.374222
Chicago
Güraksın, Gür Emre, and Harun Uğuz. 2018. “Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements”. International Journal of Computational and Experimental Science and Engineering 4 (1): 1-5. https://doi.org/10.22399/ijcesen.374222.
EndNote
Güraksın GE, Uğuz H (March 1, 2018) Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements. International Journal of Computational and Experimental Science and Engineering 4 1 1–5.
IEEE
[1]G. E. Güraksın and H. Uğuz, “Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements”, IJCESEN, vol. 4, no. 1, pp. 1–5, Mar. 2018, doi: 10.22399/ijcesen.374222.
ISNAD
Güraksın, Gür Emre - Uğuz, Harun. “Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements”. International Journal of Computational and Experimental Science and Engineering 4/1 (March 1, 2018): 1-5. https://doi.org/10.22399/ijcesen.374222.
JAMA
1.Güraksın GE, Uğuz H. Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements. IJCESEN. 2018;4:1–5.
MLA
Güraksın, Gür Emre, and Harun Uğuz. “Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements”. International Journal of Computational and Experimental Science and Engineering, vol. 4, no. 1, Mar. 2018, pp. 1-5, doi:10.22399/ijcesen.374222.
Vancouver
1.Gür Emre Güraksın, Harun Uğuz. Comparison of Different Training Data Reduction Approaches for Fast Support Vector Machines Based on Principal Component Analysis and Distance Based Measurements. IJCESEN. 2018 Mar. 1;4(1):1-5. doi:10.22399/ijcesen.374222

Cited By