Research Article

Detecting Personal Health Data Disclosures in Turkish Social Data

Volume: 11 Number: 2 June 30, 2022
EN

Detecting Personal Health Data Disclosures in Turkish Social Data

Abstract

The number of users of social networking environments is increasing day by day. In parallel with the number of users, new social networking platforms are also taking place on the internet according to the wishes and needs of the users. Social networking environments, which are in an indispensable position with the instinct of socialization, also provide an environment for unconscious personal data disclosures. In this study, the health data disclosed by users in social networks due to lack of awareness has been focused on. By using the data collected from Twitter, it is aimed to identify the tweets that disclose health data. To achieve this purpose tweets collected from Twitter in accordance with search keywords about personal health experiences and annotated by a group of computer engineers. Created corpus preprocessed with natural language processing tool for Turkic languages, named Zemberek, and classified with Fasttext library. With language model created, tweets containing personal health data disclosure were detected with %88 accuracy. The main contributions in this paper are mainly; being the first study to detect personal health data disclosures in Turkish language, creation of Turkish search keywords that will serve as a reference for obtaining data to meet the health data domain, instead of disease-specific approach seen frequently in literature a holistic perspective implemented by collecting tweets containing many distinct keywords about health experiences, and creation of Turkish data corpus by manually annotating around 4.500 tweets in personal health data domain.

Keywords

References

  1. S. Kemp, “Digital 2021 global overview report,” Accessed May. 01, 2022, 2021. [Online]. Avail- able: https://wearesocial- cn.s3.cn- north- 1.amazonaws.com.cn/ common/digital2021/digital-2021-global.pdf
  2. M. Timothy, F. Theodore, and S. Allison. Customer data: Designing for transparency and trust. Accessed May. 01, 2022. [Online]. Available: https://www.hipaajournal.com/ december-2021-healthcare-data-breach-report/
  3. C. Zhang, J. Sun, X. Zhu, and Y. Fang, “Privacy and security for online social networks: Challenges and opportunities,” IEEE Network, vol. 24, no. 4, pp. 13–18, 2010.
  4. S. E. Erol and S. Sagiroglu, “Privacy awareness in social networks,” in 2021 International Conference on Information Security and Cryptology (ISCTURKEY), 2021, pp. 57–62.
  5. “Cost of a data breach report,” Accessed May. 01, 2022, 2021. [Online]. Available: https://www.ibm.com/downloads/ cas/OJDVQGRY
  6. S. Alder. December 2021 healthcare data breach report. Accessed May. 01, 2022. [Online]. Available: https://www.hipaajournal.com/ december-2021-healthcare-data-breach-report/
  7. I. Lella, M. Theocharidou, E. Tsekmezoglou, and A. Malatras, “Enisa threat lanscape 2021,” Accessed May. 01, 2022, 2021. [Online]. Available: https://www.enisa.europa.eu/publications/ enisa-threat-landscape-2021.
  8. “Kamuoyu duyurusu (Veri ihlali bildirimi) – Yonca Sağlık Hizmetleri Ltd. Şti.” Accessed May. 01, 2022. [Online]. Available: https://www.kvkk.gov.tr/Icerik/7199/ Kamuoyu-Duyurusu-Veri-Ihlali-Bildirimi-Yonca-Saglik-Hizmetleri-Ltd-Sti-

Details

Primary Language

English

Subjects

Software Engineering (Other)

Journal Section

Research Article

Publication Date

June 30, 2022

Submission Date

May 9, 2022

Acceptance Date

June 16, 2022

Published in Issue

Year 2022 Volume: 11 Number: 2

APA
Erol, S. E., Sağıroğlu, Ş., & Demirezen, U. (2022). Detecting Personal Health Data Disclosures in Turkish Social Data. International Journal of Information Security Science, 11(2), 69-84. https://izlik.org/JA75XB44DL
AMA
1.Erol SE, Sağıroğlu Ş, Demirezen U. Detecting Personal Health Data Disclosures in Turkish Social Data. IJISS. 2022;11(2):69-84. https://izlik.org/JA75XB44DL
Chicago
Erol, Salih Erdem, Şeref Sağıroğlu, and Umut Demirezen. 2022. “Detecting Personal Health Data Disclosures in Turkish Social Data”. International Journal of Information Security Science 11 (2): 69-84. https://izlik.org/JA75XB44DL.
EndNote
Erol SE, Sağıroğlu Ş, Demirezen U (June 1, 2022) Detecting Personal Health Data Disclosures in Turkish Social Data. International Journal of Information Security Science 11 2 69–84.
IEEE
[1]S. E. Erol, Ş. Sağıroğlu, and U. Demirezen, “Detecting Personal Health Data Disclosures in Turkish Social Data”, IJISS, vol. 11, no. 2, pp. 69–84, June 2022, [Online]. Available: https://izlik.org/JA75XB44DL
ISNAD
Erol, Salih Erdem - Sağıroğlu, Şeref - Demirezen, Umut. “Detecting Personal Health Data Disclosures in Turkish Social Data”. International Journal of Information Security Science 11/2 (June 1, 2022): 69-84. https://izlik.org/JA75XB44DL.
JAMA
1.Erol SE, Sağıroğlu Ş, Demirezen U. Detecting Personal Health Data Disclosures in Turkish Social Data. IJISS. 2022;11:69–84.
MLA
Erol, Salih Erdem, et al. “Detecting Personal Health Data Disclosures in Turkish Social Data”. International Journal of Information Security Science, vol. 11, no. 2, June 2022, pp. 69-84, https://izlik.org/JA75XB44DL.
Vancouver
1.Salih Erdem Erol, Şeref Sağıroğlu, Umut Demirezen. Detecting Personal Health Data Disclosures in Turkish Social Data. IJISS [Internet]. 2022 Jun. 1;11(2):69-84. Available from: https://izlik.org/JA75XB44DL