This study revisits
the problem of maximizing the performance of mathematical word representations
for a given task. It is aimed to improve performance in analogy and similarity
tasks by suggesting innovative weights instead of the counting weights used
conventionally in counting-based methods of generating word representations
(adding the statistics of word co-occurrences to the account). The language of
study was selected as Turkish. The root structures of Turkish words were managed
during the compilation of corpus such that each word having a suffix was
considered as a new word. The performance of the proposed co-occurrence weights
are analyzed with respect to the varying parameter and the results are
presented within the paper.
Bu çalışma, matematiksel kelime temsillerinin belirli bir görev için
performanslarının en iyilenmesi problemini yeniden ele almaktadır. Sayma
tabanlı (kelimelerin eşdizimlilik istatistiklerini hesaba katan) kelime temsili
oluşturma yöntemlerinde klasik olarak kullanılan sayma ağırlıkları yerine
yenilikçi ağırlıklar önererek analoji ve benzerlik bulma görevlerinde
performans artışı sağlamak hedeflenmektedir. Çalışma dili olarak Türkçe
seçilmiş, derlem oluşturulurken Türkçe’ye has ek-kök yapıları ek alan her
kelime yeni bir kelime gibi kabul edilecek şekilde yorumlanmıştır. Önerilen
eşdizimlilik ağırlıklarının performansı değişen parametreye göre analiz
edilerek sonuçlar çalışma içerisinde paylaşılmıştır.
Primary Language | English |
---|---|
Subjects | Engineering |
Journal Section | Research Articles |
Authors | |
Publication Date | April 5, 2018 |
Submission Date | June 5, 2017 |
Acceptance Date | February 7, 2018 |
Published in Issue | Year 2018 Volume: 23 Issue: 1 |
Announcements:
30.03.2021-Beginning with our April 2021 (26/1) issue, in accordance with the new criteria of TR-Dizin, the Declaration of Conflict of Interest and the Declaration of Author Contribution forms fulfilled and signed by all authors are required as well as the Copyright form during the initial submission of the manuscript. Furthermore two new sections, i.e. ‘Conflict of Interest’ and ‘Author Contribution’, should be added to the manuscript. Links of those forms that should be submitted with the initial manuscript can be found in our 'Author Guidelines' and 'Submission Procedure' pages. The manuscript template is also updated. For articles reviewed and accepted for publication in our 2021 and ongoing issues and for articles currently under review process, those forms should also be fulfilled, signed and uploaded to the system by authors.