<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>adv. artif. intell. res.</journal-id>
            <journal-title-group>
                                                                                    <journal-title>Advances in Artificial Intelligence Research</journal-title>
            </journal-title-group>
                            <issn pub-type="ppub">2757-7422</issn>
                                                                                                        <publisher>
                    <publisher-name>Osman ÖZKARACA</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id pub-id-type="doi">10.54569/aair.1549781</article-id>
                                                                <article-categories>
                                            <subj-group  xml:lang="en">
                                                            <subject>Machine Learning (Other)</subject>
                                                            <subject>Natural Language Processing</subject>
                                                    </subj-group>
                                            <subj-group  xml:lang="tr">
                                                            <subject>Makine Öğrenme (Diğer)</subject>
                                                            <subject>Doğal Dil İşleme</subject>
                                                    </subj-group>
                                    </article-categories>
                                                                                                                                                        <title-group>
                                                                                                                        <trans-title-group xml:lang="tr">
                                    <trans-title>A Comparative Study of Machine Learning Classifiers for Different Language Spam SMS Detection: Performance Evaluation and Analysis</trans-title>
                                </trans-title-group>
                                                                                                                                                                                                <article-title>A Comparative Study of Machine Learning Classifiers for Different Language Spam SMS Detection: Performance Evaluation and Analysis</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0009-0009-7647-0731</contrib-id>
                                                                <name>
                                    <surname>Dev Sharma</surname>
                                    <given-names>Samrat Kumar</given-names>
                                </name>
                                                                    <aff>Jagannath University</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20241230">
                    <day>12</day>
                    <month>30</month>
                    <year>2024</year>
                </pub-date>
                                        <volume>4</volume>
                                        <issue>2</issue>
                                        <fpage>69</fpage>
                                        <lpage>77</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20240913">
                        <day>09</day>
                        <month>13</month>
                        <year>2024</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20241228">
                        <day>12</day>
                        <month>28</month>
                        <year>2024</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2020, Advances in Artificial Intelligence Research</copyright-statement>
                    <copyright-year>2020</copyright-year>
                    <copyright-holder>Advances in Artificial Intelligence Research</copyright-holder>
                </permissions>
            
                                                                                                <trans-abstract xml:lang="tr">
                            <p>With the continuous rise in the number of mobile device users, SMS (Short Message Service) remains a prevalent communication tool accessible on both smartphones and basic phones. Consequently, SMS traffic has experienced a significant surge. This increase has also led to a rise in spam messages, as spammers seek financial or business gains through activities like marketing promotions, lottery scams, and credit card information theft. Consequently, spam classification has become a focal point of research. In this paper, we explore the effectiveness of 11 machine learning algorithms for SMS spam detection, including multinomial Naïve Bayes, K-Nearest Neighbors (KNN), and Random Forest, among others. Utilizing datasets from UCI and Bangla SMS collections, our experimental results reveal that the multinomial Naïve Bayes algorithm surpasses previous models in spam detection, achieving accuracies of 98.65% and 89.10% in the respective datasets.</p></trans-abstract>
                                                                                                                                    <abstract><p>With the continuous rise in the number of mobile device users, SMS (Short Message Service) remains a prevalent communication tool accessible on both smartphones and basic phones. Consequently, SMS traffic has experienced a significant surge. This increase has also led to a rise in spam messages, as spammers seek financial or business gains through activities like marketing promotions, lottery scams, and credit card information theft. Consequently, spam classification has become a focal point of research. In this paper, we explore the effectiveness of 11 machine learning algorithms for SMS spam detection, including multinomial Naïve Bayes, K-Nearest Neighbors (KNN), and Random Forest, among others. Utilizing datasets from UCI and Bangla SMS collections, our experimental results reveal that the multinomial Naïve Bayes algorithm surpasses previous models in spam detection, achieving accuracies of 98.65% and 89.10% in the respective datasets.</p></abstract>
                                                            
            
                                                                                        <kwd-group>
                                                    <kwd>Spam SMS Detection</kwd>
                                                    <kwd>  NLP</kwd>
                                                    <kwd>  Machine Learning</kwd>
                                                    <kwd>  Deep Learning</kwd>
                                                    <kwd>  Naïve Bayes</kwd>
                                            </kwd-group>
                            
                                                <kwd-group xml:lang="tr">
                                                    <kwd>Spam SMS Detection</kwd>
                                                    <kwd>  NLP</kwd>
                                                    <kwd>  Machine Learning</kwd>
                                                    <kwd>  Deep Learning</kwd>
                                                    <kwd>  Naïve Bayes</kwd>
                                            </kwd-group>
                                                                                                                                        </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">A. Alli and S. Misra, &quot;A deep learning method for automatic SMS spam classification: Performance of learning algorithms on indigenous dataset,&quot; Concurrency and Computation: Practice and Experience, vol. 34, p. 34, 2022.</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">S. D. Gupta, S. Saha and S. K. Das, &quot;SMS spam detection using machine learning,&quot; in Journal of Physics: Conference Series, 2021.</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">T. Almeida and J. Hidalgo, &quot;SMS Spam Collection,&quot; 2011.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">X. Liu, H. Lu and A. Nayak, &quot;A Spam Transformer Model for SMS Spam Detection,&quot; IEEE Access, vol. 9, pp. 80253-80263, 2021.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">S. Gadde, A. Lakshmanarao and S. Satyanarayana, &quot;SMS Spam Detection using Machine Learning and Deep Learning Techniques,&quot; 2021 7th International Conference on Advanced Computing and Communication Systems (ICACCS), vol. 1, pp. 358-362, 2021.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">P. J. Yerima and S, &quot;A comparative study of word embedding techniques for SMS spam detection,&quot; 2022 14th International Conference on Computational Intelligence and Communication Networks (CICN), pp. 149-155, 2022.</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">D. Suleiman and G. Al-Naymat, &quot;SMS spam detection using H2O framework,&quot; Procedia computer science 113, pp. 154-161, 2017.</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">G. L. Haq, S. Nazir and H. U. Khan, &quot;Spam Detection Approach for Secure Mobile Message Communication Using Machine Learning Algorithms,&quot; Secur. Commun. Networks, vol. 2020, pp. 8873639:1-8873639:6, 2020.</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">L. P. Lim and M. M. Singh, &quot;Resolving the imbalance issue in short messaging service spam dataset using cost-sensitive techniques,&quot; Journal of Information Security and Applications, vol. 54, p. 102558, 2020.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">E. Wijaya, G. Noveliora, K. D. Utami, Rojali and G. Z. Nabiilah, &quot;Spam Detection in Short Message Service (SMS) Using Naïve Bayes, SVM, LSTM, and CNN,&quot; 2023 10th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE), pp. 431-436, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">E. Sankar, &quot;Sms Spam Detection Using Machine Learning,&quot; Interantional Journal Of Scientific Research In Engineering And Management, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">Mahadev and H. Jain, &quot;An Analysis of SMS Spam Detection using Machine Learning Model,&quot; 2022 Fifth International Conference on Computational Intelligence and Communication Technologies (CCICT), pp. 151-156, 2022.</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">A. A. M. Tasmia, A. A. N. ,. Jidney and Z. M. A. M. Haque, &quot;Ensemble Approach to Classify Spam SMS from Bengali Text,&quot; in Springer Nature, kolkata, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">F. Khan, R. Mustafa, F. Tasnim, T. Mahmud, M. S. Hossain and K. Andersson, &quot;Exploring BERT and ELMo for Bangla Spam SMS Dataset Creation and Detection,&quot; in 2023 26th International Conference on Computer and Information Technology (ICCIT), 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">R. G. d. Luna, V. C. Magnaye, R. A. L. Reaño, K. L. Enriquez, D. Astorga, T. Celestial, A. M. Española, B. A. Lanting, D. Mugar, M. Ramos and J. Redondo, &quot;A Machine Learning Approach for Efficient Spam Detection in Short Messaging System (SMS),&quot; TENCON 2023 - 2023 IEEE Region 10 Conference (TENCON), pp. 53-58, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">R. G. d. L. Redondo, V. C. Magnaye, R. A. L. Reaño, K. L. E. a. D. Astorga, T. Celestial, A. M. Española, B. A. Lanting, D. Mugar, M. Ramos and Jenjazel, &quot;A Machine Learning Approach for Efficient Spam Detection in Short Messaging System (SMS),&quot; TENCON 2023 - 2023 IEEE Region 10 Conference (TENCON), pp. 53-58, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">Ojo and D. A. Oyeyemi, &quot;SMS Spam Detection and Classification to Combat Abuse in Telephone Networks Using Natural Language Processing,&quot; Journal of Advances in Mathematics and Computer Science, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">S. Alghazzawi and D. Alqahtani, &quot;A survey of Emerging Techniques in Detecting SMS Spam,&quot; Transactions on Machine Learning and Artificial Intelligence, 2019.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">U. M. Kundi, S. Rehman, T. Ali, K. Mahmood and T. Alsaedi, &quot;An Intelligent Framework Based on Deep Learning for SMS and e-mail Spam Detection,&quot; Appl. Comput. Intell. Soft Comput., vol. 2023, pp. 6648970:1-6648970:16, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">M. Gupta, A. Bakliwal, S. Agarwal and P. Mehndiratta, &quot;A comparative study of spam SMS detection using machine learning classifiers,&quot; in IEEE, 2018.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">T. A. H. Almeida and Y. A. Jos&#039;e Maria G, &quot;Contributions to the study of SMS spam filtering: new collection and results,&quot; in Association for Computing Machinery, New York, NY, USA, 2011.</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">Bashar and S. Yerima, &quot;Semi-supervised novelty detection with one class SVM for SMS spam detection,&quot; 2022 29th International Conference on Systems, Signals and Image Processing (IWSSIP), Vols. CFP2255E-ART, pp. 1-4, 2022.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">S. Gadde, A. Lakshmanarao and S. Satyanarayana, &quot;SMS spam detection using machine learning and deep learning techniques,&quot; 2021 7th international conference on advanced computing and communication systems (ICACCS), vol. 1, pp. 358-362, 2021.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">E. W. Nabiilah, G. Noveliora, K. D. Utami, Rojali and G. Zain, &quot;Spam Detection in Short Message Service (SMS) Using Naïve Bayes, SVM, LSTM, and CNN,&quot; 2023 10th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE), pp. 431-436, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">P. K. Roy, J. P. Singh and S. Banerjee, &quot;Deep learning to filter SMS Spam,&quot; Future Generation Computer Systems, vol. 102, pp. 524-533, 2020.</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">S. Yadav and A., &quot;Mobile SMS Spam Filtering for Nepali Text Using Naïve Bayesian and Support Vector Machine,&quot; International Journal of Intelligent Systems, vol. 04, pp. 24-28, 2014.</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
