<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>ehta</journal-id>
            <journal-title-group>
                                                                                    <journal-title>Eurasian Journal of Health Technology Assessment</journal-title>
            </journal-title-group>
                                        <issn pub-type="epub">2587-0122</issn>
                                                                                            <publisher>
                    <publisher-name>Sağlık Bakanlığı Sağlık Hizmetleri Genel Müdürlüğü</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id pub-id-type="doi">10.52148/ehta.1768556</article-id>
                                                                <article-categories>
                                            <subj-group  xml:lang="en">
                                                            <subject>Health and Ecological Risk Assessment</subject>
                                                            <subject>Digital Health</subject>
                                                    </subj-group>
                                            <subj-group  xml:lang="tr">
                                                            <subject>Sağlık ve Ekolojik Risk Değerlendirmesi</subject>
                                                            <subject>Dijital Sağlık</subject>
                                                    </subj-group>
                                    </article-categories>
                                                                                                                                                        <title-group>
                                                                                                                                                            <article-title>A Balanced Machine Learning Approach to Obesity Risk Classification: Comparative Analysis and Feature Importance</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0002-8568-4717</contrib-id>
                                                                <name>
                                    <surname>Koç</surname>
                                    <given-names>Haydar</given-names>
                                </name>
                                                                    <aff>CANKIRI KARATEKIN UNIVERSITY</aff>
                                                            </contrib>
                                                    <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0001-5204-0846</contrib-id>
                                                                <name>
                                    <surname>Koc</surname>
                                    <given-names>Tuba</given-names>
                                </name>
                                                                    <aff>CANKIRI KARATEKIN UNIVERSITY</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20251231">
                    <day>12</day>
                    <month>31</month>
                    <year>2025</year>
                </pub-date>
                                        <volume>9</volume>
                                        <issue>2</issue>
                                        <fpage>90</fpage>
                                        <lpage>107</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20250819">
                        <day>08</day>
                        <month>19</month>
                        <year>2025</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20251117">
                        <day>11</day>
                        <month>17</month>
                        <year>2025</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2016, Eurasian Journal of Health Technology Assessment</copyright-statement>
                    <copyright-year>2016</copyright-year>
                    <copyright-holder>Eurasian Journal of Health Technology Assessment</copyright-holder>
                </permissions>
            
                                                                                                                        <abstract><p>Obesity is a growing public health concern, particularly among university students who are exposed to lifestyle changes, disordered eating habits, and reduced physical activity. The aim of this study is to classify obesity risk levels among university students using machine learning classification methods and to identify the most influential factors associated with this risk. The study sample consisted of data collected from 445 students studying at Çankırı Karatekin University. In this context, eight machine learning algorithms—Logistic Regression, Random Forest, Extra Trees, Support Vector Machines, K-Nearest Neighbor, Quadratic Discriminant Analysis, Naive Bayes, and Multilayer Perceptron—were compared to classify obesity risk. Class imbalance in the dataset was addressed using the Adaptive Synthetic Sampling (ADASYN) method applied exclusively to the training set. The models were evaluated using standard performance metrics, and the highest accuracy rate (96.26%) was achieved by the Random Forest model, followed by Logistic Regression with 94.77% accuracy. Variable importance analysis indicated that age, internet use scale score, and fast-food consumption frequency were the most influential factors in classification, while the low correlation between variables (|r| &amp;lt; 0.2) suggested that model performance was driven by the combined contribution of multiple features. Overall, the findings demonstrate that the balanced machine learning approach, particularly ensemble-based methods, can classify obesity risk with high accuracy and provide valuable insights for targeted prevention strategies among university students.</p></abstract>
                                                            
            
                                                                                        <kwd-group>
                                                    <kwd>Adaptive synthetic sampling</kwd>
                                                    <kwd>  machine learning</kwd>
                                                    <kwd>  obesity</kwd>
                                                    <kwd>  young adults.</kwd>
                                            </kwd-group>
                            
                                                                                                                                                    </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">1.	Akın, P. (2023). A new hybrid approach based on genetic algorithm and support vector machine methods for hyperparameter optimization in synthetic minority over-sampling technique (SMOTE). AIMS Mathematics, 8(6), 9400–9415.</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">2.	Alzahrani, S. H., Saeedi, A. A., Baamer, M. K., Shalabi, A. F., &amp; Alzahrani, A. M. (2020). Eating habits among medical students at king abdulaziz university, Jeddah, Saudi Arabia. International journal of general medicine, 77-88.</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">3.	Bikku, T. (2020). Multi-layered deep learning perceptron approach for health risk prediction. Journal of Big Data, 7(1), 50.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">4.	Bishop, C. M., &amp; Nasrabadi, N. M. (2006). Pattern recognition and machine learning (Vol. 4, No. 4, p. 738). New York: springer.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">5.	Breiman, L. (2001). Random forests. Machine learning, 45(1), 5-32.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">6.	Brownlee, J. (2020). Imbalanced classification with Python: better metrics, balance skewed classes, cost-sensitive learning. Machine Learning Mastery.</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">7.	Chatterjee, A., Gerdes, M. W., &amp; Martinez, S. G. (2020). Identification of risk factors associated with obesity and overweight—a machine learning overview. Sensors, 20(9), 2734.</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">8.	Choudhuri, A. (2022). A hybrid machine learning model for estimation of obesity levels. In Data management, analytics and innovation conference (pp. 257–266). Springer. https://doi.org/10.1007/978-981-19-2600-6_22</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">9.	Cortes, C., &amp; Vapnik, V. (1995). Support-vector networks. Machine learning, 20(3), 273-297.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">10.	Cover, T., &amp; Hart, P. (1967). Nearest neighbor pattern classification. IEEE transactions on information theory, 13(1), 21-27.</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">11.	Dirik, M. (2023). Application of machine learning techniques for obesity prediction: a comparative study. Journal of complexity in Health Sciences, 6(2), 16-34.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">12.	Domingos, P., &amp; Pazzani, M. (1997). On the optimality of the simple Bayesian classifier under zero-one loss. Machine learning, 29(2), 103-130.</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">13.	Dormann, C. F., Elith, J., Bacher, S., Buchmann, C., Carl, G., Carré, G., ... &amp; Lautenbach, S. (2013). Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography, 36(1), 27-46.</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">14.	Ferdowsy, F., Rahi, K. S. A., Jabiullah, M. I., &amp; Habib, M. T. (2021). A machine learning approach for obesity risk prediction. Current Research in Behavioral Sciences, 2, 100053.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">15.	Fernández, A., García, S., Galar, M., Prati, R. C., Krawczyk, B., &amp; Herrera, F. (2018). Learning from imbalanced data sets (Vol. 10, No. 2018, p. 4). Cham: Springer.</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">16.	Fernández-Delgado, M., Cernadas, E., Barro, S., &amp; Amorim, D. (2014). Do we need hundreds of classifiers to solve real world classification problems?. The journal of machine learning research, 15(1), 3133-3181.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">17.	Géron, A. (2022). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow. &quot; O&#039;Reilly Media, Inc.&quot;.</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">18.	Geurts, P., Ernst, D., &amp; Wehenkel, L. (2006). Extremely randomized trees. Machine learning, 63(1), 3-42.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">19.	Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction. (No Title).</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">20.	He, H., Bai, Y., Garcia, E. A., &amp; Li, S. (2008, June). ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In 2008 IEEE international joint conference on neural networks (IEEE world congress on computational intelligence) (pp. 1322-1328). Ieee.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">21.	Helforoush, Z., &amp; Sayyad, H. (2024). Prediction and classification of obesity risk based on a hybrid metaheuristic machine learning approach. Frontiers in big Data, 7, 1469981.</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">22.	Hosmer Jr, D. W., Lemeshow, S., &amp; Sturdivant, R. X. (2013). Applied logistic regression. John Wiley &amp; Sons.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">23.	Hruby, A., &amp; Hu, F. B. (2015). The epidemiology of obesity: a big picture. Pharmacoeconomics, 33(7), 673-689.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">24.	Kotsiantis, S., Kanellopoulos, D., &amp; Pintelas, P. (2006). Handling imbalanced datasets: A review. GESTS international transactions on computer science and engineering, 30(1), 25-36.</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">25.	Musa, F., &amp; Basaky, F. (2022). Obesity prediction using machine learning techniques. Journal of Applied Artificial Intelligence, 3(1), 24–33.</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">26.	Murtagh, F. (1991). Multilayer perceptrons for classification and regression. Neurocomputing, 2(5-6), 183-197.</mixed-citation>
                    </ref>
                                    <ref id="ref27">
                        <label>27</label>
                        <mixed-citation publication-type="journal">27.	Naidu, G., Zuva, T., Sibanda, E.M. (2023). A Review of Evaluation Metrics in Machine Learning Algorithms. In: Silhavy, R., Silhavy, P. (eds) Artificial Intelligence Application in Networks and Systems. CSOC 2023. Lecture Notes in Networks and Systems, vol 724. Springer, Cham. https://doi.org/10.1007/978-3-031-35314-7_2</mixed-citation>
                    </ref>
                                    <ref id="ref28">
                        <label>28</label>
                        <mixed-citation publication-type="journal">28.	Nelson, M. C., Story, M., Larson, N. I., Neumark-Sztainer, D., &amp; Lytle, L. A. (2008). Emerging adulthood and college-aged youth: an overlooked age for weight-related behavior change. Obesity.</mixed-citation>
                    </ref>
                                    <ref id="ref29">
                        <label>29</label>
                        <mixed-citation publication-type="journal">29.	Olagunju, M. T., Aleru, E. O., Abodunrin, O. R., Adedini, C. B., Ola, O. M., Abel, C., ... &amp; Akinsolu, F. T. (2024). Association between meal skipping and the double burden of malnutrition among university students. North African Journal of Food and Nutrition Research, 8(17), 167-177.</mixed-citation>
                    </ref>
                                    <ref id="ref30">
                        <label>30</label>
                        <mixed-citation publication-type="journal">30.	Şengul, S., Lopcu, K., &amp; Cam, S. (2020). Determinants of the obesity of adults in Turkey: An empirical study. Review of applied socio-economic research, 20(2), 60-71.</mixed-citation>
                    </ref>
                                    <ref id="ref31">
                        <label>31</label>
                        <mixed-citation publication-type="journal">31.	Pendergast, F. J., Livingstone, K. M., Worsley, A., &amp; McNaughton, S. A. (2016). Correlates of meal skipping in young adults: a systematic review. International Journal of Behavioral Nutrition and Physical Activity, 13(1), 125.</mixed-citation>
                    </ref>
                                    <ref id="ref32">
                        <label>32</label>
                        <mixed-citation publication-type="journal">32.	Rish, I. (2001, August). An empirical study of the naive Bayes classifier. In IJCAI 2001 workshop on empirical methods in artificial intelligence (Vol. 3, No. 22, pp. 41-46).</mixed-citation>
                    </ref>
                                    <ref id="ref33">
                        <label>33</label>
                        <mixed-citation publication-type="journal">33.	Şahin, C., &amp; Korkmaz, Ö. (2011). İnternet bağımlılığı ölçeğinin Türkçeye uyarlanması. Selçuk Üniversitesi Ahmet Keleşoğlu Eğitim Fakültesi Dergisi, 32(1), 101-115.</mixed-citation>
                    </ref>
                                    <ref id="ref34">
                        <label>34</label>
                        <mixed-citation publication-type="journal">34.	World Health Organization. (2024). Obesity and overweight. https://www.who.int/news-room/fact-sheets/detail/obesity-and-overweight</mixed-citation>
                    </ref>
                                    <ref id="ref35">
                        <label>35</label>
                        <mixed-citation publication-type="journal">35.	Yağmur, N. (2024). A hybrid approach to obesity level determination with decision tree and pelican optimization algorithm. Journal of Scientific Reports-A, 57, 97–109. https://doi.org/10.59313/jsr-a.1447814</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
