BibTex RIS Cite

The Impact of NLP on Turkish Sentiment Analysis

Year 2014, Volume: 7 Issue: 1, 43 - 51, 02.11.2014

Abstract

Sentiment analysis on English texts is a highly popular and well-studied topic. On the other hand, the research in this field for morphologically rich languages is still in its infancy. Turkish is an agglutinative language with a very rich morphological structure. For the first time in the literature, this paper investigates and reports the impact of the natural language preprocessing layers on the sentiment analysis of Turkish social media texts. The experiments show that the sentiment analysis performance may be improved by nearly 5 percentage points yielding a success ratio of 78.83% on the used data set.

References

  • [1] Muhammad Abdul-Mageed, Mona Diab, and Sandra Kübler. 2014. Samar: Subjectivity and sentiment analysis for Arabic social media. Computer Speech & Language, 28(1):20–37.
  • [2] Alexandra Balahur, Rada Mihalcea, and Andrés Montoyo. 2014. Computational approaches to subjectivity and sentiment analysis: Present and envisaged methods and applications. Computer Speech & Language, 28(1):1–6.
  • [3] Zeynep Boynukalin. 2012. Emotion analysis of Turkish texts by using machine learning methods. Ms.
  • [4] Rebecca F Bruce and Janyce M Wiebe. 1999. Recognizing subjectivity: a case study in manual tagging. Natural Language Engineering, 5(2):187–205.
  • [5] Mahmut Çetin and M Fatih Amasyali. 2013. Active learning for Turkish sentiment analysis. In Innovations in Intelligent Systems and Applications (INISTA), 2013 IEEE International Symposium on, pages 1–4. IEEE.
  • [6] Mahmut Çetin and M Fatih Amasyali. 2013. Supervised and traditional term weighting methods for sentiment analysis. In Signal Processing and Communications Applications Conference (SIU), 2013 21st, pages 1–4. IEEE.
  • [7] Gülşen Eryiğit, Fatih Samet Çetin, Meltem Yanik, Tanel Temel, and Ilyas Çiçekli. 2013. Turksent: A sentiment annotation tool for social media. In Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, pages 131– 134, Sofia, Bulgaria, August. Association for Computational Linguistics.
  • [8] Gülşen Eryiğit. 2014. ITU Turkish NLP web service. In Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Gothenburg, Sweden, April. Association for Computational Linguistics.
  • [9] Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of tConference on Knowledge Discovery and Data Mining, KDD ’04, pages 168–177, New York, NY, USA. ACM.
  • [10] Hayeon Jang and Hyopil Shin. 2010. Language specific sentiment analysis in morphologically rich languages. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters, COLING ’10, pages 498–506, Stroudsburg, PA, USA. Association for Computational Linguistics.
  • [11] Mesut Kaya, Guven Fidan, and I Hakkı Toroslu. 2013. Transfer learning using twitter data for improving sentiment classification of Turkish political news. In Information Sciences and Systems 2013, pages 139–148. Springer.
  • [12] Soo-Min Kim and Eduard Hovy. 2004. Determining the sentiment of opinions. In Proceedings of the 20th international conference on Computational Linguistics, page 1367. Association for Computational Linguistics.
  • [13] Bing Liu. 2012. Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies, 5(1):1–167.
  • [14] Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi, and Toshikazu Fukushima. 2002. Mining product reputations on the web. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 341–349. ACM.
  • [15] Sadi Evren Seker and Khaled Al-naami. 2013. Sentimental analysis on Turkish blogs via ensemble classifier. In Proceedings Of The 2013 International Conference On Data Mining. DMIN.
  • [16] Dilara Torunoğlu and Gülşen Eryiğit. 2014. A cascaded approach for social media text normalization of Turkish. In 5th Workshop on Language Analysis for Social Media (LASM) at EACL, Gothenburg, Sweden, April. Association for Computational Linguistics.
  • [17] A Gural Vural, B Barla Cambazoglu, Pinar Senkul, and Z Ozge Tokgoz. 2013. A framework for sentiment analysis in Turkish: Application to polarity detection of movie reviews in Turkish. In Computer and Information Sciences III, pages 437–445. Springer.
  • [18] Janyce M Wiebe, Rebecca F Bruce, and Thomas P O’Hara. 1999. Development and use of a gold standard data set for subjectivity classifications. In Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, pages 246–253. Association for Computational Linguistics.
  • [19] Janyce Wiebe. 2000. Learning subjective adjectives from corpora. In AAAI/IAAI, pages 735–740.
  • [20] Michael Wiegand, Alexandra Balahur, Benjamin Roth, Dietrich Klakow, and Andrés Montoyo. 2010. A survey on the role of negation in sentiment analysis. In Proceedings of the Workshop on Negation and Speculation in Natural Language Processing, NeSp-NLP’10, pages 60–68, Stroudsburg, PA, USA. Association for Computational Linguistics.
  • [21] Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. 2005. Recognizing contextual polarity in phrase level sentiment analysis. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, HLT ’05, pages 347–354, Stroudsburg, PA, USA. Association for Computational Linguistics.
  • [22] Jeonghee Yi, Tetsuya Nasukawa, Razvan Bunescu, and Wayne Niblack. 2003. Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In Data Mining, 2003. ICDM 2003. Third IEEE International Conference on, pages 427–434. IEEE.
  • [23] Hong Yu and Vasileios Hatzivassiloglou. 2003. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In Proceedings of the 2003 conference on Empirical methods in natural language processing, pages 129–136. Association for Computational Linguistics. 13TÜRKİYE BİLİŞİM VAKFI BİLGİSAYAR he Tenth ACM SIGKDD International
Year 2014, Volume: 7 Issue: 1, 43 - 51, 02.11.2014

Abstract

References

  • [1] Muhammad Abdul-Mageed, Mona Diab, and Sandra Kübler. 2014. Samar: Subjectivity and sentiment analysis for Arabic social media. Computer Speech & Language, 28(1):20–37.
  • [2] Alexandra Balahur, Rada Mihalcea, and Andrés Montoyo. 2014. Computational approaches to subjectivity and sentiment analysis: Present and envisaged methods and applications. Computer Speech & Language, 28(1):1–6.
  • [3] Zeynep Boynukalin. 2012. Emotion analysis of Turkish texts by using machine learning methods. Ms.
  • [4] Rebecca F Bruce and Janyce M Wiebe. 1999. Recognizing subjectivity: a case study in manual tagging. Natural Language Engineering, 5(2):187–205.
  • [5] Mahmut Çetin and M Fatih Amasyali. 2013. Active learning for Turkish sentiment analysis. In Innovations in Intelligent Systems and Applications (INISTA), 2013 IEEE International Symposium on, pages 1–4. IEEE.
  • [6] Mahmut Çetin and M Fatih Amasyali. 2013. Supervised and traditional term weighting methods for sentiment analysis. In Signal Processing and Communications Applications Conference (SIU), 2013 21st, pages 1–4. IEEE.
  • [7] Gülşen Eryiğit, Fatih Samet Çetin, Meltem Yanik, Tanel Temel, and Ilyas Çiçekli. 2013. Turksent: A sentiment annotation tool for social media. In Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, pages 131– 134, Sofia, Bulgaria, August. Association for Computational Linguistics.
  • [8] Gülşen Eryiğit. 2014. ITU Turkish NLP web service. In Proceedings of the Demonstrations at the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL), Gothenburg, Sweden, April. Association for Computational Linguistics.
  • [9] Minqing Hu and Bing Liu. 2004. Mining and summarizing customer reviews. In Proceedings of tConference on Knowledge Discovery and Data Mining, KDD ’04, pages 168–177, New York, NY, USA. ACM.
  • [10] Hayeon Jang and Hyopil Shin. 2010. Language specific sentiment analysis in morphologically rich languages. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters, COLING ’10, pages 498–506, Stroudsburg, PA, USA. Association for Computational Linguistics.
  • [11] Mesut Kaya, Guven Fidan, and I Hakkı Toroslu. 2013. Transfer learning using twitter data for improving sentiment classification of Turkish political news. In Information Sciences and Systems 2013, pages 139–148. Springer.
  • [12] Soo-Min Kim and Eduard Hovy. 2004. Determining the sentiment of opinions. In Proceedings of the 20th international conference on Computational Linguistics, page 1367. Association for Computational Linguistics.
  • [13] Bing Liu. 2012. Sentiment analysis and opinion mining. Synthesis Lectures on Human Language Technologies, 5(1):1–167.
  • [14] Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi, and Toshikazu Fukushima. 2002. Mining product reputations on the web. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 341–349. ACM.
  • [15] Sadi Evren Seker and Khaled Al-naami. 2013. Sentimental analysis on Turkish blogs via ensemble classifier. In Proceedings Of The 2013 International Conference On Data Mining. DMIN.
  • [16] Dilara Torunoğlu and Gülşen Eryiğit. 2014. A cascaded approach for social media text normalization of Turkish. In 5th Workshop on Language Analysis for Social Media (LASM) at EACL, Gothenburg, Sweden, April. Association for Computational Linguistics.
  • [17] A Gural Vural, B Barla Cambazoglu, Pinar Senkul, and Z Ozge Tokgoz. 2013. A framework for sentiment analysis in Turkish: Application to polarity detection of movie reviews in Turkish. In Computer and Information Sciences III, pages 437–445. Springer.
  • [18] Janyce M Wiebe, Rebecca F Bruce, and Thomas P O’Hara. 1999. Development and use of a gold standard data set for subjectivity classifications. In Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, pages 246–253. Association for Computational Linguistics.
  • [19] Janyce Wiebe. 2000. Learning subjective adjectives from corpora. In AAAI/IAAI, pages 735–740.
  • [20] Michael Wiegand, Alexandra Balahur, Benjamin Roth, Dietrich Klakow, and Andrés Montoyo. 2010. A survey on the role of negation in sentiment analysis. In Proceedings of the Workshop on Negation and Speculation in Natural Language Processing, NeSp-NLP’10, pages 60–68, Stroudsburg, PA, USA. Association for Computational Linguistics.
  • [21] Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. 2005. Recognizing contextual polarity in phrase level sentiment analysis. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, HLT ’05, pages 347–354, Stroudsburg, PA, USA. Association for Computational Linguistics.
  • [22] Jeonghee Yi, Tetsuya Nasukawa, Razvan Bunescu, and Wayne Niblack. 2003. Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques. In Data Mining, 2003. ICDM 2003. Third IEEE International Conference on, pages 427–434. IEEE.
  • [23] Hong Yu and Vasileios Hatzivassiloglou. 2003. Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. In Proceedings of the 2003 conference on Empirical methods in natural language processing, pages 129–136. Association for Computational Linguistics. 13TÜRKİYE BİLİŞİM VAKFI BİLGİSAYAR he Tenth ACM SIGKDD International
There are 23 citations in total.

Details

Other ID JA37MA33FU
Journal Section Makaleler(Araştırma)
Authors

Ezgi Yıldırım This is me

Fatih Samet Çetin This is me

Gülşen Eryiğit This is me

Tanel Temel This is me

Publication Date November 2, 2014
Published in Issue Year 2014 Volume: 7 Issue: 1

Cite

APA Yıldırım, E., Çetin, F. S., Eryiğit, G., Temel, T. (2014). The Impact of NLP on Turkish Sentiment Analysis. Türkiye Bilişim Vakfı Bilgisayar Bilimleri Ve Mühendisliği Dergisi, 7(1), 43-51.
AMA Yıldırım E, Çetin FS, Eryiğit G, Temel T. The Impact of NLP on Turkish Sentiment Analysis. TBV-BBMD. November 2014;7(1):43-51.
Chicago Yıldırım, Ezgi, Fatih Samet Çetin, Gülşen Eryiğit, and Tanel Temel. “The Impact of NLP on Turkish Sentiment Analysis”. Türkiye Bilişim Vakfı Bilgisayar Bilimleri Ve Mühendisliği Dergisi 7, no. 1 (November 2014): 43-51.
EndNote Yıldırım E, Çetin FS, Eryiğit G, Temel T (November 1, 2014) The Impact of NLP on Turkish Sentiment Analysis. Türkiye Bilişim Vakfı Bilgisayar Bilimleri ve Mühendisliği Dergisi 7 1 43–51.
IEEE E. Yıldırım, F. S. Çetin, G. Eryiğit, and T. Temel, “The Impact of NLP on Turkish Sentiment Analysis”, TBV-BBMD, vol. 7, no. 1, pp. 43–51, 2014.
ISNAD Yıldırım, Ezgi et al. “The Impact of NLP on Turkish Sentiment Analysis”. Türkiye Bilişim Vakfı Bilgisayar Bilimleri ve Mühendisliği Dergisi 7/1 (November 2014), 43-51.
JAMA Yıldırım E, Çetin FS, Eryiğit G, Temel T. The Impact of NLP on Turkish Sentiment Analysis. TBV-BBMD. 2014;7:43–51.
MLA Yıldırım, Ezgi et al. “The Impact of NLP on Turkish Sentiment Analysis”. Türkiye Bilişim Vakfı Bilgisayar Bilimleri Ve Mühendisliği Dergisi, vol. 7, no. 1, 2014, pp. 43-51.
Vancouver Yıldırım E, Çetin FS, Eryiğit G, Temel T. The Impact of NLP on Turkish Sentiment Analysis. TBV-BBMD. 2014;7(1):43-51.

Article Acceptance

Use user registration/login to upload articles online.

The acceptance process of the articles sent to the journal consists of the following stages:

1. Each submitted article is sent to at least two referees at the first stage.

2. Referee appointments are made by the journal editors. There are approximately 200 referees in the referee pool of the journal and these referees are classified according to their areas of interest. Each referee is sent an article on the subject he is interested in. The selection of the arbitrator is done in a way that does not cause any conflict of interest.

3. In the articles sent to the referees, the names of the authors are closed.

4. Referees are explained how to evaluate an article and are asked to fill in the evaluation form shown below.

5. The articles in which two referees give positive opinion are subjected to similarity review by the editors. The similarity in the articles is expected to be less than 25%.

6. A paper that has passed all stages is reviewed by the editor in terms of language and presentation, and necessary corrections and improvements are made. If necessary, the authors are notified of the situation.

0

.   This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.