<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>opus jsr</journal-id>
            <journal-title-group>
                                                                                    <journal-title>OPUS Journal of Society Research</journal-title>
            </journal-title-group>
                                        <issn pub-type="epub">2791-9862</issn>
                                                                                            <publisher>
                    <publisher-name>İdeal Kent Yayınları</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id pub-id-type="doi">10.26466/opusjsr.1837461</article-id>
                                                                <article-categories>
                                            <subj-group  xml:lang="en">
                                                            <subject>Labor Sociology</subject>
                                                            <subject>Labor and Organisition Sociology</subject>
                                                            <subject>Strategy, Management and Organisational Behaviour (Other)</subject>
                                                    </subj-group>
                                            <subj-group  xml:lang="tr">
                                                            <subject>Çalışma Sosyolojisi</subject>
                                                            <subject>İş ve Örgüt Sosyolojisi</subject>
                                                            <subject>Strateji, Yönetim ve Örgütsel Davranış (Diğer)</subject>
                                                    </subj-group>
                                    </article-categories>
                                                                                                                                                        <title-group>
                                                                                                                        <trans-title-group xml:lang="tr">
                                    <trans-title>Topluluk öğrenme yöntemleri ve SHAP tabanlı açıklamalar kullanılarak çalışan ayrılmalarında azınlık sınıfının tespit performansının artırılması</trans-title>
                                </trans-title-group>
                                                                                                                                                                                                <article-title>Improving minority-class detection in employee attrition with ensemble learning and SHAP-Based explanations</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0002-8445-4629</contrib-id>
                                                                <name>
                                    <surname>Özden</surname>
                                    <given-names>Cevher</given-names>
                                </name>
                                                                    <aff>ÇUKUROVA ÜNİVERSİTESİ</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20260331">
                    <day>03</day>
                    <month>31</month>
                    <year>2026</year>
                </pub-date>
                                        <volume>23</volume>
                                        <issue>2026</issue>
                                        <fpage>1</fpage>
                                        <lpage>14</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20251206">
                        <day>12</day>
                        <month>06</month>
                        <year>2025</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20260330">
                        <day>03</day>
                        <month>30</month>
                        <year>2026</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2011, OPUS Journal of Society Research</copyright-statement>
                    <copyright-year>2011</copyright-year>
                    <copyright-holder>OPUS Journal of Society Research</copyright-holder>
                </permissions>
            
                                                                                                <trans-abstract xml:lang="tr">
                            <p>Bu çalışma, özellikle küçük ve dengesiz tablosal veri setlerinde, geleneksel topluluk (ensemble) yöntemleri ile güncel derin öğrenme yaklaşımlarının çalışan yıpranmasını (işten ayrılma) tahmin etmedeki performansını incelemekte; ayrıca yalnızca teknik değerlendirmeye değil, pratik İK kullanımına da uygun, yorumlanabilir bir karar destek yapısı önermektedir. Analiz, veri sızıntısını önlemek amacıyla özel olarak yapılandırılmış bir doğrulama tasarımıyla IBM İK Analitiği veri seti üzerinde yürütülmüştür. Sınıf dengesizliğini gidermek için SMOTE, doğrulama verilerine müdahale edilmeden yalnızca eğitim katmanlarında uygulanmıştır. Karşılaştırılan modeller Rastgele Orman, XGBoost, CatBoost, Yapay Sinir Ağları (ANN) ve TabNet’tir. Varsayılan olasılık eşikleri yerine, azınlık sınıfına duyarlılığı artırmak amacıyla dinamik eşik ayarı kullanılmıştır. Model performansı Recall, Precision, F1-skoru ve hata matrisleriyle değerlendirilmiş; yorumlanabilirliği güçlendirmek için SHAP analizinden yararlanılmıştır. Güvenilirliği artırmak üzere Tabakalı 5-Katlı Çapraz Doğrulama uygulanmıştır. Bulgular, CatBoost’un 0,097 Brier skoru ve 0,468 ± 0,053 ortalama F1-skoruyla en dengeli sonuçları verdiğini göstermektedir. Dinamik eşik ayarı sonrasında TabNet en yüksek duyarlılığı (Recall: 0,573 ± 0,064) sağlamış, bu da onu erken risk tespitinde öne çıkarmıştır. SHAP sonuçlarına göre en etkili değişkenler Fazla Mesai ve Hisse Senedi Opsiyon Seviyesidir.</p></trans-abstract>
                                                                                                                                    <abstract><p>This study examines how traditional ensemble techniques and recent deep learning approaches perform in predicting employee attrition, particularly when working with small and imbalanced tabular datasets. In addition, it proposes an interpretable decision-support framework designed not only for technical evaluation but also for practical HR use. The analysis was conducted on the IBM HR Analytics dataset using a specially structured validation design to prevent data leakage. To address class imbalance, SMOTE was applied strictly within the training folds, without touching the validation data. The models compared include Random Forest, XGBoost, CatBoost, Artificial Neural Networks (ANN), and TabNet. Rather than relying on default probability thresholds, dynamic threshold adjustment was introduced to improve sensitivity to the minority class. Model performance was evaluated using Recall, Precision, F1-score, and confusion matrices, while SHAP analysis was employed to enhance interpretability and support transparent decision-making in HR contexts. To strengthen the reliability of the evaluation, a Stratified 5-Fold Cross-Validation scheme was adopted. The findings show that CatBoost produced the most balanced and consistent results, achieving a mean F1-score of 0.468 ± 0.053 together with a Brier score of 0.097. After dynamic threshold adjustment, TabNet demonstrated the highest sensitivity (Recall: 0.573 ± 0.064), making it particularly effective for early risk detection. According to the SHAP-based interpretation, OverTime and Stock Option Level emerged as the most influential predictors.</p></abstract>
                                                            
            
                                                                                        <kwd-group>
                                                    <kwd>tabnet</kwd>
                                                    <kwd>  xgboost</kwd>
                                                    <kwd>  explainable ai (xai)</kwd>
                                                    <kwd>  shap</kwd>
                                                    <kwd>  Employee churn prediction</kwd>
                                                    <kwd>  tabnet</kwd>
                                                    <kwd>  xgboost</kwd>
                                                    <kwd>  explainable ai (xai)</kwd>
                                                    <kwd>  shap</kwd>
                                                    <kwd>  imbalanced classification</kwd>
                                            </kwd-group>
                            
                                                <kwd-group xml:lang="tr">
                                                    <kwd>tabnet</kwd>
                                                    <kwd>  xgboost</kwd>
                                                    <kwd>  açıklanabilir yapay zekâ</kwd>
                                                    <kwd>  shap</kwd>
                                                    <kwd>  Çalışan ayrılma tahmini</kwd>
                                                    <kwd>  tabnet</kwd>
                                                    <kwd>  xgboost</kwd>
                                                    <kwd>  açıklanabilir yapay zekâ</kwd>
                                                    <kwd>  shap</kwd>
                                                    <kwd>  sınıf dengesizliği problem</kwd>
                                            </kwd-group>
                                                                                                                                    <funding-group specific-use="FundRef">
                    <award-group>
                                                    <funding-source>
                                <named-content content-type="funder_name">No funding was received from any public, commercial, or non-profit organization for this study.</named-content>
                            </funding-source>
                                                                    </award-group>
                </funding-group>
                                </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">Alao, D., &amp; Adeyemo, A. B. (2013). Analyzing employee attrition using decision tree algorithms. Computing, Information Systems &amp; Development Informatics, 4(1), 17-28.</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">Arik, S. O., &amp; Pfister, T. (2021). TabNet: Attentive interpretable tabular learning. Proceedings of the AAAI Conference on Artificial Intelligence, 35(8), 6679-6687. 
https://doi.org/10.1609/aaai.v35i8.16826</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5-32. 
https://doi.org/10.1023/A:1010933404324</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">Chawla, N. V., Bowyer, K. W., Hall, L. O., &amp; Kegelmeyer, W. P. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321-357. 
https://doi.org/10.1613/jair.953</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">Chen, T., &amp; Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. 
https://doi.org/10.1145/2939672.2939785</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">Fallucchi, F., Coladangelo, M., Giuliano, R., &amp; William De Luca, E. (2020). Predicting employee attrition using machine learning techniques. Computers, 9(4), 86. https://doi.org/10.3390/computers9040086</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">Fang, Y., &amp; Zhang, Z. (2025). Employee turnover prediction model based on feature selection and imbalanced data handling. IEEE Access. https://doi.org/10.1109/ACCESS.2025.3589491</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">Grinsztajn, L., Oyallon, E., &amp; Varoquaux, G. (2022). Why do tree-based models still outperform deep learning on typical tabular data? Advances in Neural Information Processing Systems, 35, 507-520. 
https://doi.org/10.48550/arXiv.2207.08815</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">He, H., &amp; Garcia, E. A. (2009). Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering, 21(9), 1263-1284. https://doi.org/10.1109/TKDE.2008.239</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">Jain, R., &amp; Nayyar, A. (2019). Predicting employee attrition using XGBoost machine learning approach. Proceedings of the 2019 International Conference on System Modeling &amp; Advancement in Research Trends (SMART), 113-120. https://doi.org/10.1109/SYSMART.2018.8746940</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">Lundberg, S. M., &amp; Lee, S. I. (2017). A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 30. 
https://doi.org/10.48550/arXiv.1705.07874</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">Molnar, C. (2020). Interpretable machine learning: A guide for making black box models explainable. Retrieved from: https://christophm.github.io/interpretable-ml-book/</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">Santos, M. S., Soares, J. P., Abreu, P. H., Araujo, H., &amp; Fujii, J. (2018). Cross-validation for imbalanced datasets: Avoiding overoptimistic performance estimation. IEEE Computational Intelligence Magazine, 13(4), 59-79. 
https://doi.org/10.1109/MCI.2018.2866730</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">Saradhi, V. V., &amp; Palshikar, G. K. (2011). Employee churn prediction. Expert Systems with Applications, 38(3), 1999-2006.
 https://doi.org/10.1016/j.eswa.2010.07.134</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">Shwartz-Ziv, R., &amp; Armon, A. (2022). Tabular data: Deep learning is not all you need. Information Fusion, 81, 84-90.
 https://doi.org/10.1016/j.inffus.2021.11.011</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">Subhash, P. (2017). IBM HR analytics employee attrition &amp; performance [Data set]. Kaggle. https://www.kaggle.com/datasets/pavansubhasht/ibm-hr-analytics-attrition-dataset</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">Thapliyal, N., Solanki, S., Pandey, N. K., &amp; Papola, S. (2024). Employee attrition analysis using XGBoost. In 2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE) (pp. 1–6). IEEE. 
https://doi.org/10.1109/IC3SE62002.2024.10593326</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">Wang, X., &amp; Zhi, J. (2021). A machine learning-based analytical framework for employee turnover prediction. Journal of Management Analytics, 8(3), 351–370. 
https://doi.org/10.1080/23270012.2021.1961318</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">Yurtsever, M. (2024). Çalışan yıpranmasını tahmin etmede analitik bir yaklaşım: Topluluk öğrenme yöntemi. Afyon Kocatepe Üniversitesi İktisadi ve İdari Bilimler Fakültesi Dergisi, 26(Özel Sayı), 150-160. https://doi.org/10.33707/akuiibfd.1462567</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
