Research Article

Co-occurrence Weight Selection for Word Embeddings to Enhance Test Performance

Volume: 23 Number: 1 April 5, 2018
EN TR

Co-occurrence Weight Selection for Word Embeddings to Enhance Test Performance

Abstract

This study revisits the problem of maximizing the performance of mathematical word representations for a given task. It is aimed to improve performance in analogy and similarity tasks by suggesting innovative weights instead of the counting weights used conventionally in counting-based methods of generating word representations (adding the statistics of word co-occurrences to the account). The language of study was selected as Turkish. The root structures of Turkish words were managed during the compilation of corpus such that each word having a suffix was considered as a new word. The performance of the proposed co-occurrence weights are analyzed with respect to the varying parameter and the results are presented within the paper.

Keywords

References

  1. Bahdanau, D., Cho, K. and Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv: 1409.0473.
  2. Bengio, Y., Ducharme, R., Vincent, P., and Jauvin C. (2003). A neural probabilistic language model. Journal of machine learning research, 1137 – 1155. doi: 10.1162/153244303322533223
  3. Bojanowski, P., Grave, E., Joulin, A., ve Mikolov, T. (2016). Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606.
  4. Faruqui, M., Dodge, J. , Jauhar, S. K., Dyer, C., Hovy, E. ve Smith, N. A. (2014) Retrofitting word vectors to semantic lexicons, arXiv preprint arXiv:1411.4166. doi: 10.3115/v1/N15-1184
  5. Firth, J. R., (1957). A synopsis of linguistic theory 1930-1955. In Studies in linguistic analysis, 1-32. Oxford:Blackwell.
  6. Huth, A.G., de Heer, W.A., Griffiths, T.L., Theunissen, F.E. and Gallant, J.L. (2016) Natural speech reveals the semantic map that tile human cerebral cortex. Nature, vol. 532, no. 7600, 453 – 458. doi:10.1038/nature17637
  7. Karpathy, A. and Fei-Fei, L. (2016). Deep visual-semantic alignments for generating image descriptions. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39 (4), 664-676. doi: 10.1109/TPAMI.2016.2598339
  8. Krizhevsky, A., Sutskever, I., and Hinton, G. E. (2012). Imagenet classification with deep convolutional neural netwroks. In Advances in neural information processing systems, 1097-1105. doi: 10.1145/3065386

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Authors

Aykut Koç
ASELSAN
Türkiye

Veysel Yücesoy
ASELSAN
Türkiye

Publication Date

April 5, 2018

Submission Date

June 5, 2017

Acceptance Date

February 7, 2018

Published in Issue

Year 2018 Volume: 23 Number: 1

APA
Koç, A., & Yücesoy, V. (2018). Co-occurrence Weight Selection for Word Embeddings to Enhance Test Performance. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi, 23(1), 31-40. https://doi.org/10.17482/uumfd.318615
AMA
1.Koç A, Yücesoy V. Co-occurrence Weight Selection for Word Embeddings to Enhance Test Performance. UUJFE. 2018;23(1):31-40. doi:10.17482/uumfd.318615
Chicago
Koç, Aykut, and Veysel Yücesoy. 2018. “Co-Occurrence Weight Selection for Word Embeddings to Enhance Test Performance”. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi 23 (1): 31-40. https://doi.org/10.17482/uumfd.318615.
EndNote
Koç A, Yücesoy V (April 1, 2018) Co-occurrence Weight Selection for Word Embeddings to Enhance Test Performance. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi 23 1 31–40.
IEEE
[1]A. Koç and V. Yücesoy, “Co-occurrence Weight Selection for Word Embeddings to Enhance Test Performance”, UUJFE, vol. 23, no. 1, pp. 31–40, Apr. 2018, doi: 10.17482/uumfd.318615.
ISNAD
Koç, Aykut - Yücesoy, Veysel. “Co-Occurrence Weight Selection for Word Embeddings to Enhance Test Performance”. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi 23/1 (April 1, 2018): 31-40. https://doi.org/10.17482/uumfd.318615.
JAMA
1.Koç A, Yücesoy V. Co-occurrence Weight Selection for Word Embeddings to Enhance Test Performance. UUJFE. 2018;23:31–40.
MLA
Koç, Aykut, and Veysel Yücesoy. “Co-Occurrence Weight Selection for Word Embeddings to Enhance Test Performance”. Uludağ Üniversitesi Mühendislik Fakültesi Dergisi, vol. 23, no. 1, Apr. 2018, pp. 31-40, doi:10.17482/uumfd.318615.
Vancouver
1.Aykut Koç, Veysel Yücesoy. Co-occurrence Weight Selection for Word Embeddings to Enhance Test Performance. UUJFE. 2018 Apr. 1;23(1):31-40. doi:10.17482/uumfd.318615

Announcements:

30.03.2021-Beginning with our April 2021 (26/1) issue, in accordance with the new criteria of TR-Dizin, the Declaration of Conflict of Interest and the Declaration of Author Contribution forms fulfilled and signed by all authors are required as well as the Copyright form during the initial submission of the manuscript. Furthermore two new sections, i.e. ‘Conflict of Interest’ and ‘Author Contribution’, should be added to the manuscript. Links of those forms that should be submitted with the initial manuscript can be found in our 'Author Guidelines' and 'Submission Procedure' pages. The manuscript template is also updated. For articles reviewed and accepted for publication in our 2021 and ongoing issues and for articles currently under review process, those forms should also be fulfilled, signed and uploaded to the system by authors.