This study aims to compare the performance of different ensemble learning algorithms using Natural Language Processing (NLP) approach for the detection of depression status from text. Depression is a common mental health problem worldwide and early diagnosis and intervention are of critical importance.
A dataset consisting of 124,017 tweets collected between 2019-2020 was used in the study. These tweets were classified according to their depressive and non-depressive content. Five different ensemble learning algorithms, namely Random Forest, AdaBoost, Gradient Boosting, XGBoost and Voting Classifier, were applied in the study.
The performance of the models was evaluated using various metrics such as accuracy, precision, recall and F1-score. In addition, ROC curves and learning curves were analyzed.
The results revealed that all ensemble learning algorithms showed high performance, but Random Forest gave the best results in all metrics. Voting Classifier performed the second best, while Gradient Boosting performed relatively poorly. This study shows that ensemble learning algorithms are effective in detecting depression from text, and Random Forest in particular can be used as a potential screening tool in this area. The findings may contribute to the development of technology-supported approaches for early diagnosis and intervention in the field of mental health.
Depression Detection Natural Language Processing (NLP) Ensemble Learning Algorithms Social Media Analysis
Primary Language | English |
---|---|
Subjects | Information Systems Development Methodologies and Practice |
Journal Section | Research Article |
Authors | |
Publication Date | December 31, 2024 |
Submission Date | August 1, 2024 |
Acceptance Date | December 20, 2024 |
Published in Issue | Year 2024 Volume: 9 Issue: 2 |
Copyright © 2025. AA. All rights reserved