Research Article

Detection of Voice Pathologies with Information Theory Based Features

Volume: 39 Number: 1 January 10, 2026
EN

Detection of Voice Pathologies with Information Theory Based Features

Abstract

Voice is produced by excited dynamic vocal tract system consisting of vocal cords, lips, tongue, with the air from lungs. Pathological cases in these components change the characteristics of the voice. Voice diseases impair quality of life, and some pathologies can be life-threatening, so early diagnosis is important. In literature, voice disorders can be generally investigated with traditional or acoustics signal analysis methods and detected by using machine learning. In this study, analysis and classification studies were realized for detecting vocal cord deformities such as cyst, polyp and sulcus vocalis, and healthy ones. Unlike literature, instead of acoustic features, features based on information theory such as recurrence plot quantities, entropies and dimensions coming from nonlinear approaches were used. Three feature selection and three classification methods were tried with nonlinear features calculated from whole signal and decomposed signal with discrete wavelet transform. Two classification procedures (binary and multiple class) with 10-fold cross validation were applied and tested with distinct groups. According to the results, in binary classification (healthy/ diseased), the best test accuracy of 99.2% with Coarse Tree classifier was obtained with only four features selected with MRMR algorithm. In multiclass (healthy/cyst/polyp/sulcus), the test accuracy of 91.6% was found as the best with only eight selected features by using ANOVA and Kernel Naive Bayes classifier. The results show that the use of a few features coming from nonlinear domain provided very effective and successful in classifying voice disorders in both binary and multiclass when compared to the studies in the literature.

Keywords

Thanks

The authors would like to thank Prof. Dr. M. Akif Kılıç for giving permission the using data.

References

  1. [1] Bhattacharyya, N., “The prevalence of voice problems among adults in the United States”, The Laryngoscope, 124: 2359-2362, (2014). DOI: https://doi.org/10.1002/lary.24740
  2. [2] Hossain, M. S., Muhammad, G., Alamri, A., “Smart healthcare monitoring: A voice pathology detection paradigm for smart cities”, Multimedia Systems, 25: 565-575, (2017). DOI: https://doi.org/10.1007/s00530-017-0561-x
  3. [3] Kılıç M., “Ses problemi olan hastanın objektif ve subjektif yöntemlerle değerlendirilmesi”, Kulak Burun Boğaz Baş Boyun Cerrahisi’nde Güncel Yaklaşım, 6, (2010).
  4. [4] Hegde, S., Shetty, S., Rai, S., Dodderi, T., “A survey on machine learning approaches for automatic detection of voice disorders”, Journal of Voice, 33(6): 947.e11-947.e33, (2019). DOI: https://doi.org/10.1016/j.jvoice.2018.07.014
  5. [5] Mohamed, A. N., Salama, A. A., Darwish, S. H., “Classification of voice pathology using SVM classifier”, International Telecommunications Conference (ITC-Egypt), Alexandria, Egypt, 419-422, (2023). DOI: https://doi.org/10.1109/ITC-Mısır58155.2023.10206116
  6. [6] Gayathri, S., Priya, E., “Identification of voice pathology from temporal and cepstral features for vowel ‘a’ low intonation”, International Conference on Electronic Systems and Intelligent Computing (ICESIC), Chennai, India, 345-350, (2022). DOI: https://doi.org/10.1109/ICESIC53714.2022.9783484
  7. [7] Murthy, G. L. N., Iswarya, V., Ramya Sri, K., Harshitha, K., Prasanth, Ch., “A novel algorithm for detecting spasmodic dysphonia voice pathology using random forest framework”, International Conference on Edge Computing and Applications (ICECAA), Tamilnadu, India, 800-804, (2022). DOI: https://doi.org/10.1109/ICECAA55415.2022.9936567
  8. [8] Miliaresi, I., Poutos, K., Pikrakis, A., “Combining acoustic features and medical data in deep learning networks for voice pathology classification”, 28th European Signal Processing Conference (EUSIPCO), Amsterdam, Netherlands, 1190-1194, (2021). DOI: https://doi.org/10.23919/Eusipco47968.2020.9287333

Details

Primary Language

English

Subjects

Audio Processing, Machine Learning (Other)

Journal Section

Research Article

Early Pub Date

January 10, 2026

Publication Date

January 10, 2026

Submission Date

December 23, 2024

Acceptance Date

November 3, 2025

Published in Issue

Year 2026 Volume: 39 Number: 1

APA
Aküren, E., & Yılmaz, D. (2026). Detection of Voice Pathologies with Information Theory Based Features. Gazi University Journal of Science, 39(1), 113-128. https://doi.org/10.35378/gujs.1606193
AMA
1.Aküren E, Yılmaz D. Detection of Voice Pathologies with Information Theory Based Features. Gazi University Journal of Science. 2026;39(1):113-128. doi:10.35378/gujs.1606193
Chicago
Aküren, Ecenur, and Derya Yılmaz. 2026. “Detection of Voice Pathologies With Information Theory Based Features”. Gazi University Journal of Science 39 (1): 113-28. https://doi.org/10.35378/gujs.1606193.
EndNote
Aküren E, Yılmaz D (March 1, 2026) Detection of Voice Pathologies with Information Theory Based Features. Gazi University Journal of Science 39 1 113–128.
IEEE
[1]E. Aküren and D. Yılmaz, “Detection of Voice Pathologies with Information Theory Based Features”, Gazi University Journal of Science, vol. 39, no. 1, pp. 113–128, Mar. 2026, doi: 10.35378/gujs.1606193.
ISNAD
Aküren, Ecenur - Yılmaz, Derya. “Detection of Voice Pathologies With Information Theory Based Features”. Gazi University Journal of Science 39/1 (March 1, 2026): 113-128. https://doi.org/10.35378/gujs.1606193.
JAMA
1.Aküren E, Yılmaz D. Detection of Voice Pathologies with Information Theory Based Features. Gazi University Journal of Science. 2026;39:113–128.
MLA
Aküren, Ecenur, and Derya Yılmaz. “Detection of Voice Pathologies With Information Theory Based Features”. Gazi University Journal of Science, vol. 39, no. 1, Mar. 2026, pp. 113-28, doi:10.35378/gujs.1606193.
Vancouver
1.Ecenur Aküren, Derya Yılmaz. Detection of Voice Pathologies with Information Theory Based Features. Gazi University Journal of Science. 2026 Mar. 1;39(1):113-28. doi:10.35378/gujs.1606193