Research Article

Analysis of Word Similarities in Tax Laws Using the Word2Vec Method

Volume: 1 Number: 1 January 30, 2025
EN

Analysis of Word Similarities in Tax Laws Using the Word2Vec Method

Abstract

This paper describes word similarity analysis in tax law using the Word2Vec model. By similarity analysis, we mean identifying relationships between similar terms in tax terminology. The Word2Vec model represents the meanings of words with vectors and identifies the semantic relationships of words through the proximity between these vectors. This article analyzes the semantic proximity of terms frequently used in tax law and visualises the relationships between these words. For example, the close relationships of the word ‘mükellef’ with words such as ‘kişi’, ‘tam’, ‘dar’, ‘firma’, and ‘imalatçı’ are represented through vectors. The paper also explains the mathematical structure of the models. Then, the features of the NumPy, Gensim, Scikit-learn, and Matplotlib libraries of the Python programming language are explained and used for this paper. For the visualisation of the similarity analysis, the t-SNE algorithm, which allows the visualisation of highdimensional data on a two-dimensional plane, was used. The main purpose of this paper is to enable AI systems that can be used as tax advisors to better understand tax law by modelling the conceptual relationships between the terms of tax law, thus contributing to the provision of more accurate and consistent information by AI.

Keywords

References

  1. Mikolov, Tomas, Chen, Kai, Corrado, Greg, and Dean, Jeffrey. Efficient Estimation of Word Representations on Vector Space. arXiv preprint arXiv:1301.3781 (2013). google scholar
  2. Pervan, Nergis. DERİN ÖĞRENME YAKLAŞIMLARI KULLANARAK TÜRKÇE METİNLERDEN ANLAMSAL ÇIKARIM YAPMA. Ankara, 2019. google scholar
  3. Onan, Aytuğ. Evrişimli Sinir Ağı Mimarilerine Dayalı Türkçe Duygu Analizi. Avrupa Bilim ve Teknoloji Dergisi (Aug. 31,2020), 374-380. google scholar
  4. Tezgider, Murat, Yıldız, Beytullah, and Aydın, Galip. Improving Word Representation by Tuning Word2Vec Parameters with Deep Learning Model. İn International Artificial Intelligence and Data Processing Symposium (IDAP) (Malatya 2018), IEEE, 1-7. google scholar
  5. Arabacı, Mehmet Ali, Esen, Ersin, Atar, Muhammed Selim, Yılmaz, Eyüp, and Kaltalıoğlu, Batuhan. Kelime Gömevi Yöntemi Kullanarak Benzer Cümle Tespiti. In 2018 26th Signal Processing and Communications Applications Conference ( 2018). google scholar
  6. Aydoğan, Murat and Karcı, Ali. Kelime Temsil Yöntemleri ile Kelime Benzerliklerinin İncelenmesi. Çukurova Üniversitesi Mühendislik Mimarlık Fakültesi Dergisi, 34, 2 (June 2019), 181-195. google scholar
  7. Acı, Çiğdem İnan and Çırak, Adem. Türkçe Haber Metinlerinin Konvolüsyonel Sinir Ağları ve Word2Vec Kullanılarak Sınıflandırılması. BİLİŞİM TEKNOLOJİLERİ DERGİSİ, 12, 13 (July 31, 2019), 219-228. google scholar
  8. Xia, Chunyu, He, Tieke, Li, Wenlong, Qin, Zemin, and Zou, Zhipeng. Similarity Analysis of Law Documents Based on Word2vec. In 2019 IEEE 19th International Conference on Software Quality, Reliability and Security Companion (QRS-C) (Sofia 2019), IEEE, 354-357. google scholar

Details

Primary Language

English

Subjects

Artificial Intelligence (Other)

Journal Section

Research Article

Publication Date

January 30, 2025

Submission Date

December 6, 2024

Acceptance Date

January 3, 2025

Published in Issue

Year 2025 Volume: 1 Number: 1

APA
Çilingir, A. İ. Ö. (2025). Analysis of Word Similarities in Tax Laws Using the Word2Vec Method. Journal of Data Analytics and Artificial Intelligence Applications, 1(1), 84-109. https://izlik.org/JA98BH56JK
AMA
1.Çilingir AİÖ. Analysis of Word Similarities in Tax Laws Using the Word2Vec Method. Journal of Data Analytics and Artificial Intelligence Applications. 2025;1(1):84-109. https://izlik.org/JA98BH56JK
Chicago
Çilingir, Ali İhsan Özgür. 2025. “Analysis of Word Similarities in Tax Laws Using the Word2Vec Method”. Journal of Data Analytics and Artificial Intelligence Applications 1 (1): 84-109. https://izlik.org/JA98BH56JK.
EndNote
Çilingir AİÖ (January 1, 2025) Analysis of Word Similarities in Tax Laws Using the Word2Vec Method. Journal of Data Analytics and Artificial Intelligence Applications 1 1 84–109.
IEEE
[1]A. İ. Ö. Çilingir, “Analysis of Word Similarities in Tax Laws Using the Word2Vec Method”, Journal of Data Analytics and Artificial Intelligence Applications, vol. 1, no. 1, pp. 84–109, Jan. 2025, [Online]. Available: https://izlik.org/JA98BH56JK
ISNAD
Çilingir, Ali İhsan Özgür. “Analysis of Word Similarities in Tax Laws Using the Word2Vec Method”. Journal of Data Analytics and Artificial Intelligence Applications 1/1 (January 1, 2025): 84-109. https://izlik.org/JA98BH56JK.
JAMA
1.Çilingir AİÖ. Analysis of Word Similarities in Tax Laws Using the Word2Vec Method. Journal of Data Analytics and Artificial Intelligence Applications. 2025;1:84–109.
MLA
Çilingir, Ali İhsan Özgür. “Analysis of Word Similarities in Tax Laws Using the Word2Vec Method”. Journal of Data Analytics and Artificial Intelligence Applications, vol. 1, no. 1, Jan. 2025, pp. 84-109, https://izlik.org/JA98BH56JK.
Vancouver
1.Ali İhsan Özgür Çilingir. Analysis of Word Similarities in Tax Laws Using the Word2Vec Method. Journal of Data Analytics and Artificial Intelligence Applications [Internet]. 2025 Jan. 1;1(1):84-109. Available from: https://izlik.org/JA98BH56JK