Continuous time threshold selection for binary classification on polarized data
Abstract
Binary
classification is used to distinguish some of the data elements from others in
a meaningful way according to certain characteristics. Supervised classification techniques often
use the ground-truth data, which assists to determine the distinctive characteristics
of the elements to be extracted from the data. These techniques also generate
new features for all of the data using the current features in accordance with
the ground-truth data. One of the purposes of generating new features is to
polarize the data elements (to be extracted and others) toward the separate
pools on a coordinate axis for binary classification. In this way, the binary
classification process is easy using only a threshold value on the axis. In
this work, the Linear Discriminant Analysis (LDA) is used to polarize the data
and a threshold selection algorithm is proposed, which use the harmonic mean
F-score values of the binary classification outputs resulting from some
specific threshold values. The key condition in the proposed method is that the
most suitable threshold must give the best classification score (F-score value)
and other threshold values must give lower classification scores as they become
distant from the best threshold value (move away toward the ends of the axis).
The proposed method is experimented for binary classifications of some
meaningful elements on a remote sensing image taken from a 2D semantic
labelling dataset that has the ground-truth images. The proposed method
convergences the best threshold value continuously in logarithmic time.
Keywords
Kaynakça
- Lu D, Weng Q. “A survey of image classification methods and techniques for improving classification performance”. International Journal of Remote Sensing, 28(5), 823-870, 2007.
- Wang W, Yang N, Zhang Y, Wang F, Cao T, Eklund P. “A review of road extraction from remote sensing images”. Journal of Traffic and Transportation Engineering, 3(3), 271-282, 2016.
- Saglam A, Baykan NA. “A satellite image classification approach by using one dimensional discriminant analysis”. ISPRS-International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, XLII-3/W4, 429-435, 2018.
- Fisher RA. “The use of multiple measures in taxonomic problems”. Annals of Eugenics, 7(2), 179-188, 1936.
- Duda RO, Hart PE, Stork DG. Pattern Classification. New York, USA, Wiley, 2000.
- Martis RJ, Acharya UR, Min LC. “ECG beat classification using PCA, LDA, ICA and discrete wavelet transform”. Biomedical Signal Processing and Control, 8(5), 437-448, 2013.
- Lipton ZC, Elkan C, Naryanaswamy B. “Optimal thresholding of classifiers to maximize F1 measure”. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8725(2), 225-239, 2014.
- Sanchez IE, Belgium A, Brun M. “Optimal threshold estimation for binary classifiers using game theory”. International Society for Computational Biology Community Journal, 5(5), 1-11, 2016.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Mühendislik
Bölüm
Araştırma Makalesi
Yayımlanma Tarihi
21 Ekim 2019
Gönderilme Tarihi
7 Ağustos 2018
Kabul Tarihi
-
Yayımlandığı Sayı
Yıl 2019 Cilt: 25 Sayı: 5