Research Article

Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences

Volume: 7 Number: 2 December 31, 2024
  • Şeyda Portillo - Palma *
  • Sergi Alvarez - Vidal
EN

Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences

Abstract

Turkish, a language that does not explicitly mark gender in pronouns, poses a unique challenge for machine translation systems, particularly in cases of gender-neutral or ambiguous context. This study investigates the performance of neural machine translation (NMT) and large language models (LLMs) in resolving gender ambiguity when translating Turkish subject-dropped sentences into English. The analysis examines four prominent models—Google Translate, DeepL, ChatGPT, and Gemini—evaluating their pronoun selection and the extent of gender bias, especially in emotionally charged or contextually nuanced sentences. A primarily quantitative evaluation reveals a persistent gender bias across all models, with LLMs demonstrating relatively better performance than NMTs when clearer contextual information is present. However, all models exhibit limitations in managing the complexities of cross-linguistic gender representation. This research highlights the pressing need for gender-neutral solutions and advancements in context-sensitive translation. Furthermore, we introduce a moderately sized annotated Turkish corpus, designed to facilitate future studies on gender ambiguity in machine translation (MT). This dataset provides a valuable resource for enhancing the accuracy of gendered pronoun resolution and fostering more inclusive, bias-reduced translation systems. Overall, the study contributes to the growing discourse on reducing bias in language models while addressing the challenges of nuanced linguistic diversity in translation.

Keywords

References

  1. Aleçakır, Hüseyin, Necva Bölücü, and Burcu Can. 2022. “TurkishDelightNLP: A Neural Turkish NLP Toolkit.” In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: System Demonstrations, 17–26. Association for Computational Linguistics. doi:10.18653/v1/2022.naacl-demo.3.
  2. Asghar, Muhammad Zubair, Fazli Subhan, Muhammad Imran, Fazal Masud Kundi, Adil Khan, Shahboddin Shamshirband, Amir Mosavi, Peter Csiba, and Annamaria R. Varkonyi Koczy. 2020. “Performance Evaluation of Supervised Machine Learning Techniques for Efficient Detection of Emotions from Online Content.” Computers, Materials & Continua 63 (3): 1093–1118. doi:10.32604/cmc.2020.07709.
  3. Basta, Christine Raouf Saad. 2022 “Gender Bias in Natural Language Processing.” PhD diss., Universitat Politècnica de Catalunya. doi:10.5821/dissertation-2117-379361.
  4. Bentz, Christian, Ximena Gutierrez-Vasques, Olga Sozinova, and Tanja Samardžić. 2023. “Complexity Trade-Offs and Equi-Complexity in Natural Languages: A Meta-Analysis.” In “Measuring Language Complexity,” edited by Katharina Ehret, Aleksandrs Berdicevskis, Christian Bentz, and Alice Blumenthal-Dramé. Special Issue, Linguistics Vanguard 9 (1): 9–25. doi:10.1515/lingvan-2021-0054.
  5. Brown, Tom, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, et al. 2020. “Language Models Are Few-Shot Learners.” In Advances in Neural Information Processing Systems 33 (NeurIPS 2020), edited by Hugo Larochelle, Marc’Aurelio Ranzato, Raia Hadsell, and Maria-Florina Balcan. Curran Associates. doi:10.48550/arXiv.2005.14165.
  6. Castilho, Sheila, Clodagh Quinn Mallon, Rahel Meister, and Shengya Yue. 2023. “Do Online Machine Translation Systems Care for Context? What About a GPT Model?” In Proceedings of 24th Annual Conference of the European Association for Machine Translation, edited by Mary Nurminen, Judith Brenner, Maarit Koponen, Sirkku Latomaa, Mikhail Mikhailov, Frederike Schierl, Tharindu Ranasinghe, et al., 393–417. European Association for Machine Translation. https://aclanthology.org/2023.eamt-1.39.
  7. Ciora, Chloe, Nur Iren, and Malihe Alikhani. 2021. “Examining Covert Gender Bias: A Case Study in Turkish and English Machine Translation Models.” In Proceedings of 14th International Conference on Natural Language Generation, edited by Anya Belz, Angela Fan, Ehud Reiter, and Yaji Sripada, 55–63. Association for Computational Linguistics. doi:10.18653/v1/2021.inlg-1.7.
  8. Costa-jussà, Marta R., Carlos Escolano, Christine Basta, Javier Ferrando, Roser Batlle, and Ksenia Kharitonova. 2022. “Interpreting Gender Bias in Neural Machine Translation: Multilingual Architecture Matters.” In Proceedings of the AAAI Conference on Artificial Intelligence 36 (11): 11855–11863. Association for the Advancement of Artificial Intelligence. California: AAAI Press. doi:10.1609/aaai.v36i11.21442.

Details

Primary Language

English

Subjects

Translation and Interpretation Studies

Journal Section

Research Article

Authors

Şeyda Portillo - Palma * This is me
0000-0001-6986-231X
Spain

Sergi Alvarez - Vidal This is me
0000-0002-6355-4559
Spain

Publication Date

December 31, 2024

Submission Date

October 19, 2024

Acceptance Date

December 12, 2024

Published in Issue

Year 2024 Volume: 7 Number: 2

APA
Portillo - Palma, Ş., & Alvarez - Vidal, S. (2024). Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences. TransLogos Translation Studies Journal, 7(2), 1-28. https://doi.org/10.29228/transLogos.67
AMA
1.Portillo - Palma Ş, Alvarez - Vidal S. Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences. transLogos Translation Studies Journal. 2024;7(2):1-28. doi:10.29228/transLogos.67
Chicago
Portillo - Palma, Şeyda, and Sergi Alvarez - Vidal. 2024. “Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences”. TransLogos Translation Studies Journal 7 (2): 1-28. https://doi.org/10.29228/transLogos.67.
EndNote
Portillo - Palma Ş, Alvarez - Vidal S (December 1, 2024) Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences. transLogos Translation Studies Journal 7 2 1–28.
IEEE
[1]Ş. Portillo - Palma and S. Alvarez - Vidal, “Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences”, transLogos Translation Studies Journal, vol. 7, no. 2, pp. 1–28, Dec. 2024, doi: 10.29228/transLogos.67.
ISNAD
Portillo - Palma, Şeyda - Alvarez - Vidal, Sergi. “Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences”. transLogos Translation Studies Journal 7/2 (December 1, 2024): 1-28. https://doi.org/10.29228/transLogos.67.
JAMA
1.Portillo - Palma Ş, Alvarez - Vidal S. Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences. transLogos Translation Studies Journal. 2024;7:1–28.
MLA
Portillo - Palma, Şeyda, and Sergi Alvarez - Vidal. “Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences”. TransLogos Translation Studies Journal, vol. 7, no. 2, Dec. 2024, pp. 1-28, doi:10.29228/transLogos.67.
Vancouver
1.Şeyda Portillo - Palma, Sergi Alvarez - Vidal. Gender Bias and Contextual Sensitivity in Machine Translation: A Focus on Turkish Subject-Dropped Sentences. transLogos Translation Studies Journal. 2024 Dec. 1;7(2):1-28. doi:10.29228/transLogos.67