Konuşmacıları Kadın, Erkek ve Çocuk Olarak Sınıflandırmada Veri Artırmanın Performansa Etkisi
Abstract
Keywords
References
- Arakawa, R., Takamichi, S., & Saruwatari, H. (2019). Implementation of DNN-based real-time voice conversion and its improvements by audio data augmentation and mask-shaped device. In: Proc. ISCA Workshop Speech Synthesis, (pp. 93–98). Vienna, Austria.
- Bhatt, D., Patel, C., Talsania, H., Patel, J., Vaghela, R., Pandya, S., ... & Ghayvat, H. (2021). CNN variants for computer vision: History, architecture, application, challenges and future scope. Electronics, 10(20), 2470.
- Bishop, C. M. (1995). Training with noise is equivalent to Tikhonov regularization. Neural Computation, 7(1), 108–116.
- Chai, J., Zeng, H., Li, A., & Ngai, E. W. (2021). Deep learning in computer vision: A critical review of emerging techniques and application scenarios. Machine Learning with Applications, 6, 100134.
- Gerosa, M., Giuliani, D., & Brugnara, F. (2005). Speaker adaptive acoustic modeling with mixture of adult and children's speech. In Interspeech, (pp. 2193-2196). Lisbon, Portugal.
- Dehak, N., Kenny, P.J., Dehak, R., Dumouchel, P., & Ouellet, P. (2011). Front-End Factor Analysis for Speaker Verification. IEEE Trans. Audio Speech Lang, 19, 788–798.
- Ertam, F. (2019)An effective gender recognition approach using voice data via deeper LSTM networks. Appl. Acoust., 156, 351–358.
- Gupta, A., Harrison, P. J., Wieslander, H., Pielawski, N., Kartasalo, K., Partel, G., ... & Wählby, C. (2019). Deep learning in image cytometry: a review. Cytometry Part A, 95(4), 366-380.
Details
Primary Language
Turkish
Subjects
Computer Software
Journal Section
Research Article
Authors
Ergün Yücesoy
*
0000-0003-1707-384X
Türkiye
Early Pub Date
August 27, 2024
Publication Date
September 1, 2024
Submission Date
June 26, 2024
Acceptance Date
July 21, 2024
Published in Issue
Year 2024 Volume: 14 Number: 3