Research Article

CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS

Volume: 38 Number: 3 October 5, 2021
  • Metin Bilgin
EN

CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS

Abstract

Natural language processing is an artificial intelligence field which is gaining in popularity in recent years. To make an emotional deduction from texts related to an issue, or classify documents are of great importance considering the increasing data size in today's world. Understanding and interpreting written texts is a feature that pertains to people. But, it is possible to deduce from texts or classify texts using natural language processing which is a sub-branch of machine learning and artificial intelligence. In this study, both text classification was made on Turkish tweets, and text classification success of method parameter changes was investigated using two different methods of the algorithm mentioned as document vectors in the literature. It was found in the study that as well as higher accuracy values were obtained by the DBoW (Distributed Bag of Words) method than DM (Distributed Memory) method; higher accuracy values were also obtained by DBoW-NS (Negative Sampling) architecture than others.

Keywords

References

  1. [1] Chowdhury. G.G., “Natural language processing”, Annual review of information science and technology,37,1, 51-89, 2005.
  2. [2] Maron, M.E., “Automatic indexing: an experimental inquiry”, Journal of the ACM,8,3,404-417,1961.
  3. [3] Fabrizio, S., “Machine learning in automated text categorization”, ACM computing surveys,34,1,1-47,2001.
  4. [4] Dalal, M.K., Mukesh, A. Z., “Automatic Text Classification: A Technical Review”, International Journal of Computer Applications, 28,2,37- 40, 2011.
  5. [5] Sommer, S.,Schieber, A., Hilbert, A., Heinrich K., “Analyzing customer sentiments in microblogs–A topic-model-based approach for Twitter datasets”, Americas conference on information systems (AMCIS),Detroit, Michigan, USA, (2011), 1-7.
  6. [6] Liu, B, Lei, Z., “Mining Text Data: A survey of opinion mining and sentiment analysis”, Mining Text Data, Springer, Boston, USA, 2012, 415-463.
  7. [7] Prabowo, R., Thelwall, M., “Sentiment analysis: A combined approach”, Journal of Informetrics, 3,2, 143-157, 2009.
  8. [8] Zhang, D., Xu H., Su, Z., Xu, Y., “Chinese comments sentiment classification based on word2vec and SVMperf”, Expert Systems with Applications, 42,4,1857-1863,2014.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Authors

Publication Date

October 5, 2021

Submission Date

November 12, 2019

Acceptance Date

June 13, 2020

Published in Issue

Year 2020 Volume: 38 Number: 3

APA
Bilgin, M. (2021). CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS. Sigma Journal of Engineering and Natural Sciences, 38(3), 1581-1592. https://izlik.org/JA94NR39JF
AMA
1.Bilgin M. CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS. SIGMA. 2021;38(3):1581-1592. https://izlik.org/JA94NR39JF
Chicago
Bilgin, Metin. 2021. “CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS”. Sigma Journal of Engineering and Natural Sciences 38 (3): 1581-92. https://izlik.org/JA94NR39JF.
EndNote
Bilgin M (October 1, 2021) CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS. Sigma Journal of Engineering and Natural Sciences 38 3 1581–1592.
IEEE
[1]M. Bilgin, “CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS”, SIGMA, vol. 38, no. 3, pp. 1581–1592, Oct. 2021, [Online]. Available: https://izlik.org/JA94NR39JF
ISNAD
Bilgin, Metin. “CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS”. Sigma Journal of Engineering and Natural Sciences 38/3 (October 1, 2021): 1581-1592. https://izlik.org/JA94NR39JF.
JAMA
1.Bilgin M. CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS. SIGMA. 2021;38:1581–1592.
MLA
Bilgin, Metin. “CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS”. Sigma Journal of Engineering and Natural Sciences, vol. 38, no. 3, Oct. 2021, pp. 1581-92, https://izlik.org/JA94NR39JF.
Vancouver
1.Metin Bilgin. CLASSIFICATION OF TURKISH TWEETS BY DOCUMENT VECTORS AND INVESTIGATION OF THE EFFECTS OF PARAMETER CHANGES ON CLASSIFICATION SUCCESS. SIGMA [Internet]. 2021 Oct. 1;38(3):1581-92. Available from: https://izlik.org/JA94NR39JF

IMPORTANT NOTE: JOURNAL SUBMISSION LINK https://eds.yildiz.edu.tr/sigma/