GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI

Semih Ergin; Rifat Aykut Arapoğlu

EN TR

USE OF REGRESSION IN NOISY SPEECH RECOGNITION

Abstract

In this study, we investigated the contribution of the multiple regression to robust noisy speech recognition in improving the recognition rates. When the noisy speech recognition process is carried out; first of all, an Affine Transformation is performed in order to map the feature vectors of noisy speech into those of clean speech. After transforming, the recognition step is achieved using the Common Vector Approach (CVA). We used several multiple linear as well as non-linear regression models to improve the recognition rates by adding non-linear terms into the model during the affine transformation stage. In the experimental study, the recognition rates of the noisy speech signals with 0 dB, 5 dB, 10dB, and 20 dB Signal-to-Noise Ratio (SNR) values have been obtained. Noisy speech which has 20, 10, 5, and 0 dB SNR is obtained using MATLAB by adding white Gaussian noise on the clean speech taken from the Texas Instruments (TI) Digit Database. Improvements are observed when non-linear terms are introduced into the model.

Keywords

Regression analysis,Affine transformation,Robust speech recognition

References

Basbug, F., Swaminathan, K. & Nandkumar, S. (2003). Noise reduction and echo cancellation front-end for speech codecs. IEEE Transactions on Speech and Audio Processing, 11(1), 1-13.
Chien, J.-T. (2003). Linear regression based Bayesian predictive classification for speech recognition. IEEE Transactions on Speech and Audio Processing, 11(1), 70-79.
David, A. F. (2005). Statistical Models: Theory and Practice. Cambridge: Cambridge University Press.
Gulmezoglu, M. B., Dzhafarov, V., Keskin, M. & Barkana, A. (1999). A novel approach to isolated word recognition. IEEE Transactions on Speech and Audio Processing, 7(6), 620-628.
Gulmezoglu, M. B., Dzhafarov, V. & Barkana, A. (2001). The Common Vector Approach and its relation to Principal Component Analysis. IEEE Transactions on Speech and Audio Processing, 9(6), 655-662.
Karnjanadecha, M. & Zahorian, S. A. (2001). Signal modeling for high-performance robust isolated word recognition. IEEE Transactions on Speech and Audio Processing, 9(6), 647-654.
Lee, C., Hyun, D., Choi, E., Go, J. & Lee, C. (2003). Optimizing feature extraction for speech recognition. IEEE Transactions on Speech and Audio Processing, 11(1), 80-87.
Mammone, J. R., Zhang, X. & Ramachandran, R. P. (1996, September). Robust speaker recognition – A feature based approach. IEEE Signal Processing Magazine, 58-71.

Details

Primary Language

Turkish

Subjects

-

Journal Section

Research Article

Authors

Semih Ergin

Rifat Aykut Arapoğlu

Publication Date

October 1, 2014

Submission Date

November 13, 2013

Acceptance Date

-

Published in Issue

Year 2014

IZ

https://izlik.org/JA59DK76LY

Cite

RIS / Bibtex

APA

Ergin, S., & Arapoğlu, R. A. (2014). GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI. Dumlupınar Üniversitesi Sosyal Bilimler Dergisi, 495-502. https://izlik.org/JA59DK76LY

AMA

1.Ergin S, Arapoğlu RA. GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI. Dumlupınar Üniversitesi Sosyal Bilimler Dergisi. Published online October 1, 2014:495-502. https://izlik.org/JA59DK76LY

Chicago

Ergin, Semih, and Rifat Aykut Arapoğlu. 2014. “GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI”. Dumlupınar Üniversitesi Sosyal Bilimler Dergisi, October 1, 495-502. https://izlik.org/JA59DK76LY.

EndNote

Ergin S, Arapoğlu RA (October 1, 2014) GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI. Dumlupınar Üniversitesi Sosyal Bilimler Dergisi 495–502.

IEEE

[1]S. Ergin and R. A. Arapoğlu, “GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI”, Dumlupınar Üniversitesi Sosyal Bilimler Dergisi, pp. 495–502, Oct. 2014, [Online]. Available: https://izlik.org/JA59DK76LY

ISNAD

Ergin, Semih - Arapoğlu, Rifat Aykut. “GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI”. Dumlupınar Üniversitesi Sosyal Bilimler Dergisi. October 1, 2014. 495-502. https://izlik.org/JA59DK76LY.

JAMA

1.Ergin S, Arapoğlu RA. GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI. Dumlupınar Üniversitesi Sosyal Bilimler Dergisi. 2014;:495–502.

MLA

Ergin, Semih, and Rifat Aykut Arapoğlu. “GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI”. Dumlupınar Üniversitesi Sosyal Bilimler Dergisi, Oct. 2014, pp. 495-02, https://izlik.org/JA59DK76LY.

Vancouver

1.Semih Ergin, Rifat Aykut Arapoğlu. GÜRÜLTÜLÜ SES TANIMADA REGRESYON KULLANIMI. Dumlupınar Üniversitesi Sosyal Bilimler Dergisi [Internet]. 2014 Oct. 1;495-502. Available from: https://izlik.org/JA59DK76LY