<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                    <journal-id></journal-id>
            <journal-title-group>
                                                                                    <journal-title>Bitlis Eren Üniversitesi Fen Bilimleri Dergisi</journal-title>
            </journal-title-group>
                            <issn pub-type="ppub">2147-3129</issn>
                                        <issn pub-type="epub">2147-3188</issn>
                                                                                            <publisher>
                    <publisher-name>Bitlis Eren University</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id pub-id-type="doi">10.17798/bitlisfen.1796956</article-id>
                                                                <article-categories>
                                            <subj-group  xml:lang="en">
                                                            <subject>Natural Language Processing</subject>
                                                    </subj-group>
                                            <subj-group  xml:lang="tr">
                                                            <subject>Doğal Dil İşleme</subject>
                                                    </subj-group>
                                    </article-categories>
                                                                                                                                                        <title-group>
                                                                                                                        <article-title>AI vs. Human Text Detection: A High-Accuracy Ensemble Approach Using Machine Learning</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0002-9864-2866</contrib-id>
                                                                <name>
                                    <surname>Kökver</surname>
                                    <given-names>Yunus</given-names>
                                </name>
                                                                    <aff>ANKARA UNIVERSITY, ELMADAĞ VOCATIONAL SCHOOL, DEPARTMENT OF COMPUTER TECHNOLOGIES</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20260324">
                    <day>03</day>
                    <month>24</month>
                    <year>2026</year>
                </pub-date>
                                        <volume>15</volume>
                                        <issue>1</issue>
                                        <fpage>245</fpage>
                                        <lpage>258</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20251004">
                        <day>10</day>
                        <month>04</month>
                        <year>2025</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20260209">
                        <day>02</day>
                        <month>09</month>
                        <year>2026</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2012, Bitlis Eren Üniversitesi Fen Bilimleri Dergisi</copyright-statement>
                    <copyright-year>2012</copyright-year>
                    <copyright-holder>Bitlis Eren Üniversitesi Fen Bilimleri Dergisi</copyright-holder>
                </permissions>
            
                                                                                                <abstract><p>This study aims to develop and evaluate a machine learning (ML)-based classification model for distinguishing between texts generated by artificial intelligence (AI) and those written by humans. Utilizing a comprehensive dataset comprising 487235 text samples, various ML algorithms—including Multilayer Perceptron (MLP), Random Forest (RF), Gradient Boosting (GB), Logistic Regression (LR), Support Vector Machines (SVM), Decision Trees (DT), and an Ensemble Model—were trained and evaluated to classify AI-generated and human-generated texts. Ensemble Model, which combines the best-performing algorithms, achieved an accuracy rate of 99.90%, outperforming individual models. Additionally, the study presents a user-friendly interface that enables real-time classification of texts using the weights of the ensemble model. This interface holds potential as a practical tool for researchers and professionals in fields such as education, academia, and media. The model&#039;s generalization capability was also tested on a user-generated dataset through the user interface, and it was found to be consistent with the primary dataset, achieving an &quot;Almost Perfect&quot; level according to the Kappa statistic. This study highlights the necessity of robust tools to mitigate ethical and security risks associated with AI-generated content. Moreover, ensemble models show great promise in handling complex classification tasks.</p></abstract>
                                                            
            
                                                            <kwd-group>
                                                    <kwd>Natural Language Processing</kwd>
                                                    <kwd>  Artificial Intelligence and Ethics</kwd>
                                                    <kwd>  Machine Learning</kwd>
                                                    <kwd>  Ensemble Models</kwd>
                                                    <kwd>  Text Classification</kwd>
                                            </kwd-group>
                            
                                                                                                                        </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">Y. Kökver, H. M. Pektaş, and H. Çelik, “Artificial intelligence applications in education: Natural language processing in detecting misconceptions,” Educ. Inf. Technol., pp. 1–32, Aug. 2024, doi: 10.1007/S10639-024-12919-1/FIGURES/2.</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">T. B. Brown et al., “Language Models are Few-Shot Learners,” Adv. Neural Inf. Process. Syst., vol. 2020-December, May 2020, Accessed: Oct. 17, 2024. [Online]. Available: https://arxiv.org/abs/2005.14165v4</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">J. Devlin, M.-W. Chang, K. Lee, K. T. Google, and A. I. Language, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” Proc. 2019 Conf. North, pp. 4171–4186, 2019, doi: 10.18653/V1/N19-1423.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">C. D. Manning, “Introduction to information retrieval,” 2008, Cambridge university press.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">H. Alamleh, A. A. S. Alqahtani, and A. Elsaid, “Distinguishing Human-Written and ChatGPT-Generated Text Using Machine Learning,” 2023 Syst. Inf. Eng. Des. Symp. SIEDS 2023, pp. 154–158, 2023, doi: 10.1109/SIEDS58326.2023.10137767.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">M. Nour, B. Arabacı, H. Öcal, and K. Polat, “New approaches to epileptic seizure prediction based on EEG signals using hybrid CNNs,” Int. J. Intell. Eng. Informatics, vol. 12, no. 1, pp. 85–102, 2024, doi: 10.1504/IJIEI.2024.137706.</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">M. T. Zamir, M. A. Ayub, A. Gul, N. Ahmad, and K. Ahmad, “Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection,” Jan. 2024, Accessed: Oct. 15, 2024. [Online]. Available: https://arxiv.org/abs/2401.06752v1</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">A. de Pablo, O. Araque, and C. A. Iglesias, “Radical Text Detection based on Stylometry,” in International Conference on Information Systems Security and Privacy, Science and Technology Publications, Lda, 2020, pp. 524–531. doi: 10.5220/0008971205240531.</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">A. Pascucci, R. Manna, C. Caterino, V. Masucci, and J. Monti, “Is this hotel review truthful or deceptive? A platform for disinformation detection through computational stylometry,” in Proceedings for the First International Workshop on Social Threats in Online Conversations: Understanding and Management, 2020, pp. 35–40.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">H. El-Fiqi, E. Petraki, and H. A. Abbass, “A computational linguistic approach for the identification of translator stylometry using Arabic-English text,” IEEE Int. Conf. Fuzzy Syst., pp. 2039–2045, 2011, doi: 10.1109/FUZZY.2011.6007535.</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">T. Kumarage, J. Garland, A. Bhattacharjee, K. Trapeznikov, S. Ruston, and H. Liu, “Stylometric detection of ai-generated text in twitter timelines,” arXiv Prepr. arXiv2303.03697, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">G. K. Mikros, A. Koursaris, D. Bilianos, and G. Markopoulos, “AI-Writing Detection Using an Ensemble of Transformers and Stylometric Features.,” in IberLEF@ SEPLN, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">H. Wang, J. Li, and Z. Li, “AI-generated text detection and classification based on BERT deep learning algorithm,” Theor. Nat. Sci., vol. 39, no. 1, pp. 312–317, Jul. 2024, doi: 10.54254/2753-8818/39/20240625.</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">D. Trandabat and D. Gifu, “Discriminating AI-generated Fake News,” Procedia Comput. Sci., vol. 225, pp. 3822–3831, Jan. 2023, doi: 10.1016/J.PROCS.2023.10.378.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">R. Kumar and M. Mindzak, “Who wrote this? Detecting artificial intelligence–generated text from human-written text,” Can. Perspect. Acad. Integr., vol. 7, no. 1, 2024.</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">A. Akram, “An empirical study of ai generated text detection tools,” arXiv Prepr. arXiv2310.01423, 2023, doi: https://doi.org/10.48550/arXiv.2310.01423.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">R. Gaggar, A. Bhagchandani, and H. Oza, “Machine-generated text detection using deep learning,” arXiv Prepr. arXiv2311.15425, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">Y. Mo, H. Qin, Y. Dong, Z. Zhu, and Z. Li, “Large Language Model (LLM) AI text generation detection based on transformer deep learning algorithm,” Apr. 2024, doi: 10.48550/arxiv.2405.06652.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">Y. Zhang et al., “Enhancing Text Authenticity: A Novel Hybrid Approach for AI-Generated Text Detection,” Jun. 2024, Accessed: Oct. 15, 2024. [Online]. Available: https://arxiv.org/abs/2406.06558v1</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">A. Najee-Ullah, L. Landeros, Y. Balytskyi, and S. Y. Chang, “Towards Detection of AI-Generated Texts and Misinformation,” Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 13176 LNCS, pp. 194–205, 2022, doi: 10.1007/978-3-031-10183-0_10/FIGURES/4.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">T. T. Nguyen, A. Hatua, and A. H. Sung, “How to Detect AI-Generated Texts?,” 2023 IEEE 14th Annu. Ubiquitous Comput. Electron. Mob. Commun. Conf. UEMCON 2023, pp. 464–471, 2023, doi: 10.1109/UEMCON59035.2023.10316132.</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">P. C. Theocharopoulos, P. Anagnostou, A. Tsoukala, S. V. Georgakopoulos, S. K. Tasoulis, and V. P. Plagianakos, “Detection of Fake Generated Scientific Abstracts,” Proc. - IEEE 9th Int. Conf. Big Data Comput. Serv. Appl. BigDataService 2023, pp. 33–39, 2023, doi: 10.1109/BIGDATASERVICE58306.2023.00011.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">M. A. Quidwai, C. Li, and P. Dube, “Beyond black box ai-generated plagiarism detection: From sentence to document level,” arXiv Prepr. arXiv2306.08122, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">Y. Zhang, T. Zhou, H. Qiao, and T. Li, “Ethical Issues in AI-Generated Texts: A Systematic Review and Analysis,” Int. J. Human–Computer Interact., pp. 1–28, 2025.</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">Z. Ji et al., “Survey of hallucination in natural language generation,” ACM Comput. Surv., vol. 55, no. 12, pp. 1–38, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">R. M. Branch, “Instructional design: The ADDIE approach,” Instr. Des. ADDIE Approach, pp. 1–203, 2010, doi: 10.1007/978-0-387-09506-6/COVER.</mixed-citation>
                    </ref>
                                    <ref id="ref27">
                        <label>27</label>
                        <mixed-citation publication-type="journal">O. Karamustafaoğlu and H. M. Pektaş, “Developing students’ creative problem solving skills with inquiry-based STEM activity in an out-of-school learning environment,” Educ. Inf. Technol., vol. 28, no. 6, pp. 7651–7669, Jun. 2023, doi: 10.1007/S10639-022-11496-5/TABLES/4.</mixed-citation>
                    </ref>
                                    <ref id="ref28">
                        <label>28</label>
                        <mixed-citation publication-type="journal">T. Trust and E. Pektas, “Using the ADDIE Model and Universal Design for Learning Principles to Develop an Open Online Course for Teacher Professional Development,” J. Digit. Learn. Teach. Educ., vol. 34, no. 4, pp. 219–233, Oct. 2018, doi: 10.1080/21532974.2018.1494521.</mixed-citation>
                    </ref>
                                    <ref id="ref29">
                        <label>29</label>
                        <mixed-citation publication-type="journal">U. Shafique and H. Qaiser, “A comparative study of data mining process models (KDD, CRISP-DM and SEMMA),” Int. J. Innov. Sci. Res., vol. 12, no. 1, pp. 217–222, 2014.</mixed-citation>
                    </ref>
                                    <ref id="ref30">
                        <label>30</label>
                        <mixed-citation publication-type="journal">S. Gerami, “AI Vs Human Text.” Accessed: Oct. 17, 2024. [Online]. Available: https://www.kaggle.com/datasets/shanegerami/ai-vs-human-text</mixed-citation>
                    </ref>
                                    <ref id="ref31">
                        <label>31</label>
                        <mixed-citation publication-type="journal">“Better language models and their implications | OpenAI.” Accessed: Dec. 22, 2024. [Online]. Available: https://openai.com/index/better-language-models/</mixed-citation>
                    </ref>
                                    <ref id="ref32">
                        <label>32</label>
                        <mixed-citation publication-type="journal">F. Martinelli, F. Mercaldo, L. Petrillo, and A. Santone, “A Method for AI-generated sentence detection through Large Language Models,” Procedia Comput. Sci., vol. 246, no. C, pp. 4853–4862, Jan. 2024, doi: 10.1016/J.PROCS.2024.09.351.</mixed-citation>
                    </ref>
                                    <ref id="ref33">
                        <label>33</label>
                        <mixed-citation publication-type="journal">“andythetechnerd03/AI-human-text · Datasets at Hugging Face.” Accessed: Jan. 03, 2025. [Online]. Available: https://huggingface.co/datasets/andythetechnerd03/AI-human-text</mixed-citation>
                    </ref>
                                    <ref id="ref34">
                        <label>34</label>
                        <mixed-citation publication-type="journal">W. Y. Wang, “‘Liar, Liar Pants on Fire’: A New Benchmark Dataset for Fake News Detection,” ACL 2017 - 55th Annu. Meet. Assoc. Comput. Linguist. Proc. Conf. (Long Pap., vol. 2, pp. 422–426, 2017, doi: 10.18653/V1/P17-2067.</mixed-citation>
                    </ref>
                                    <ref id="ref35">
                        <label>35</label>
                        <mixed-citation publication-type="journal">“Data Science, Machine Learning, AI &amp; Analytics - KDnuggets.” Accessed: Jan. 03, 2025. [Online]. Available: https://www.kdnuggets.com/</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
