<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>jnrs</journal-id>
            <journal-title-group>
                                                                                    <journal-title>Journal of New Results in Science</journal-title>
            </journal-title-group>
                                        <issn pub-type="epub">1304-7981</issn>
                                                                                            <publisher>
                    <publisher-name>Tokat Gaziosmanpasa University</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id pub-id-type="doi">10.54187/jnrs.1129440</article-id>
                                                                <article-categories>
                                            <subj-group  xml:lang="en">
                                                            <subject>Statistics</subject>
                                                    </subj-group>
                                            <subj-group  xml:lang="tr">
                                                            <subject>İstatistik</subject>
                                                    </subj-group>
                                    </article-categories>
                                                                                                                                                        <title-group>
                                                                                                                                                            <article-title>A novel data processing approach to detect fraudulent insurance claims for physical damage to cars</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0002-2364-9449</contrib-id>
                                                                <name>
                                    <surname>Yücel</surname>
                                    <given-names>Ahmet</given-names>
                                </name>
                                                                    <aff>ANKARA YILDIRIM BEYAZIT ÜNİVERSİTESİ</aff>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20220831">
                    <day>08</day>
                    <month>31</month>
                    <year>2022</year>
                </pub-date>
                                        <volume>11</volume>
                                        <issue>2</issue>
                                        <fpage>120</fpage>
                                        <lpage>131</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20220611">
                        <day>06</day>
                        <month>11</month>
                        <year>2022</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20220822">
                        <day>08</day>
                        <month>22</month>
                        <year>2022</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2012, Journal of New Results in Science</copyright-statement>
                    <copyright-year>2012</copyright-year>
                    <copyright-holder>Journal of New Results in Science</copyright-holder>
                </permissions>
            
                                                                                                                        <abstract><p>Some automobile insurance companies use computerized auto-detection systems to expedite claims payment decisions for insured vehicles. Claims suspected of fraud are evaluated using empirical data from previously investigated claims. The main objective of this manuscript is to demonstrate a novel data processing system and its potential for use in data classification. The data processing approach was used to develop a machine learning-based sentiment classification model to describe property damage fraud in vehicle accidents and the indicators of fraudulent claims. To this end, Singular Value Decomposition-based components and correlation-based composite variables were created. Machine learning models were then developed, with predictors and composite variables selected based on standard feature selection procedures. Five machine learning models were used: Boosted Trees, Classification and Regression Trees, Random Forests, Artificial Neural Networks, and Support Vector Machines. For all models, the models with composite variables achieved higher accuracy rates, and among these models, the artificial neural network was the model with the highest accuracy performance at 76.56%.</p></abstract>
                                                            
            
                                                                                        <kwd-group>
                                                    <kwd>Artificial neural network</kwd>
                                                    <kwd>  Tree-based decision systems</kwd>
                                                    <kwd>  Support vector machines</kwd>
                                                    <kwd>  Singular value decomposition</kwd>
                                                    <kwd>  Data processing</kwd>
                                                    <kwd>  Natural language
processing</kwd>
                                            </kwd-group>
                            
                                                                                                                                                    </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">S. Viaene, M. Ayuso, M. Guillen, D. V. Gheel, G.  Dedene, Strategies for detecting fraudulent claims in the automobile insurance industry, European Journal of Operational Research, 176(1), (2007) 565–583.</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">T. Baldock, Insurance fraud. Australian Institute of Criminology: Trends and issues in crime and criminal justice, 66, (1997).</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">I. Akomea-Frimpong, C. Andoh, E. Ofosu-Hene, Causes, effects and deterrence of insurance fraud: evidence from Ghana, Journal of Financial Crime, 23(4), (2016) 678–699.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">G. Baader, H. Krcmar, Reducing false positives in fraud detection: Combining the red flag approach with process mining, International Journal of Accounting Information Systems, 31, (2018) 1–16.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">J. Nahr, H. Nozari, M. E. Sadeghi, Artificial intelligence and machine learning for real-world problems (A survey), International journal of innovation in Engineering, 1(3), (2021) 38–47.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">H. Ma, Y. Wang, K. Wang, Automatic detection of false positive RFID readings using machine learning algorithms, Expert Systems with Applications, 91, (2018) 442–451.</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">S. Chand, Y. Zhang, Learning from machines to close the gap between funding and expenditure in the Australian National Disability Insurance Scheme, International Journal of Information Management Data Insights, 2(1), (2022) 1–15.</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">M. K. Mishra, R. Dash, A comparative study of Chebyshev functional link artificial neural network, multi-layer perceptron and decision tree for credit card fraud detection, in: S. P. Mohanty, R. K. Patnaik, M. Gomathisankaran, B. S. Panda (Eds.) International Conference on Information Technology 2014, Bhubaneswar, India, 2014, pp. 228–233.</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">G. van Capelleveen, M. Poel, R. M. Mueller, D. Thornton, J. van Hillegersberg, Outlier detection in healthcare fraud: A case study in the Medicaid dental domain, International Journal of Accounting Information Systems, 21, (2016) 18–31.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">L. Sabetti, R. Heijmans, Shallow or deep? Training an autoencoder to detect anomalous flows in a retail payment system, Latin American Journal of Central Banking, 2(2), (2021) 1–14.</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">J. Jiang, P. Trundle, J. Ren, Medical image analysis with artificial neural networks, Computerized Medical Imaging and Graphics, 34(8), (2010) 617–631.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">A. Ansari, A. Riasi, Modelling and evaluating customer loyalty using neural networks: Evidence from startup insurance companies, Future Business Journal, 2(1), (2016) 15–30.</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">N. K. Frempong, N. Nicholas, M. A. Boateng, Decision tree as a predictive modeling tool for auto insurance claims, International Journal of Statistics and Applications, 7(2), (2017) 117–120.</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">N. K. Gyamfi, J. D. Abdulai, Bank fraud detection using support vector machine, in: V. Leung, S. Vuong, S. Chakrabarti (Eds.), IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) 2018, Vancouver, BC, Canada, 2018, pp. 37–41.</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">E. Badr, S. Almotairi, M. A. Salam, H. Ahmed, New sequential and parallel support vector machine with grey wolf optimizer for breast cancer diagnosis. Alexandria Engineering Journal, 61(3), (2022) 2520–2534.</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">G. Tolan, T. Abou-El-Enien, M. Khorshid, A comparison among support vector machine and other machine learning classification algorithms, IPASJ International Journal of Computer Science (IIJCS), 3(5), (2015) 25–35.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">A. Kao, S. R. Poteet, Natural language processing and text mining, Springer Publishing Company, 2006.</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">N. Chintalapudi, G. Battineni, M. D. Canio, G. G. Sagaro, F. Amenta, Text mining with sentiment analysis on seafarers’ medical documents, International Journal of Information Management Data Insights, 1(1), (2021) 1–9.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">R. Alfrjani, T. Osman, G. Cosma, A hybrid semantic knowledgebase-machine learning approach for opinion mining, Data and Knowledge Engineering, 121, (2019) 88–108.</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">E. Teso, M. Olmedilla, M. Martínez-Torres, S. Toral, Application of text mining techniques to the analysis of discourse in eWOM communications from a gender perspective, Technological Forecasting and Social Change, 129, (2018) 131–142.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">O. Rouane, H. Belhadef, M. Bouakkaz, Combine clustering and frequent itemsets mining to enhance biomedical text summarisation, Expert Systems with Applications, 135, (2019) 362–373.</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">Y. Zhang, A. Hu, J. Wang, Y.  Zhang, Detection of fraud statement based on word vector: Evidence from financial companies in China, Finance Research Letters, 46, (2022) 1–7.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">S. Fu, C. C. Wyles, D. R. Osmon, M. L. Carvour, E. Sagheb, T. Ramazanian, H. M. Kremers, Automated detection of periprosthetic joint infections and data elements using natural language processing, The Journal of Arthroplasty, 36(2), (2021) 688–692.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">V. Nourani, M. Sayyah-Fard, M. T. Alami, E. Sharghi, Data pre-processing effect on ANN-based prediction intervals construction of the evaporation process at different climate regions in Iran, Journal of Hydrology, 588, (2020) 1–15.</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">W. Zhang, T. Liu, L. Ye, M. Ueland, S. L. Forbes, S. W. Su, A novel data pre-processing method for odour detection and identification system, Sensors and Actuators A: Physical, 287, (2019) 113–120.</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">C. Chilipirea, A. C. Petre, L. M. Groza, C. Dobre, F. Pop, An integrated architecture for future studies in data processing for smart cities, Microprocessors and Microsystems, 52, (2017) 335–342.</mixed-citation>
                    </ref>
                                    <ref id="ref27">
                        <label>27</label>
                        <mixed-citation publication-type="journal">M. Hanafy, R. Ming, Using machine learning models to compare various resampling methods in predicting insurance fraud, Journal of Theoretical and Applied Information Technology, 99(12), (2021), 2819–2833.</mixed-citation>
                    </ref>
                                    <ref id="ref28">
                        <label>28</label>
                        <mixed-citation publication-type="journal">M. K. Severino, Y. Peng, Machine learning algorithms for fraud prediction in property insurance: Empirical evidence using real-world microdata, Machine Learning with Applications, 5, (2021) 1–14.</mixed-citation>
                    </ref>
                                    <ref id="ref29">
                        <label>29</label>
                        <mixed-citation publication-type="journal">R. Roy, K. T. George, Detecting insurance claims fraud using machine learning techniques, in: K. P. Isaac, A. Rahiman, G. P. Padmakumar (Eds.), International Conference on Circuit, Power and Computing Technologies (ICCPCT) 2017, Kollam, India, 2017, pp. 1–6.</mixed-citation>
                    </ref>
                                    <ref id="ref30">
                        <label>30</label>
                        <mixed-citation publication-type="journal">G. Miner, D. Delen, J. Elder, A. Fast, T. Hill, R. A. Nisbet, Conceptual foundations of text mining and pre-processing steps, practical text mining and statistical analysis for non-structured text data applications, Academic Press. (2012) 43–51.</mixed-citation>
                    </ref>
                                    <ref id="ref31">
                        <label>31</label>
                        <mixed-citation publication-type="journal">A. K. Menon, C. Elkan, Fast algorithms for approximating the singular value decomposition, ACM Transactions on Knowledge Discovery from Data, 5(2), (2011) 1–36.</mixed-citation>
                    </ref>
                                    <ref id="ref32">
                        <label>32</label>
                        <mixed-citation publication-type="journal">TIBCO product documentation, Data Science Textbook, https://docs.tibco.com/data-science/GUID-4C6F72C1-F4F8-48A9-83C7-D4C72A66A3AC.html (Accessed on 14.08.2022)</mixed-citation>
                    </ref>
                                    <ref id="ref33">
                        <label>33</label>
                        <mixed-citation publication-type="journal">C. Peña-Bautista, T. Durand, C. Oger, M. Baquero, M. Vento, C. Cháfer-Pericás, Assessment of lipid peroxidation and artificial neural network models in early Alzheimer disease diagnosis, Clinical Biochemistry, 72, (2019) 64–70.</mixed-citation>
                    </ref>
                                    <ref id="ref34">
                        <label>34</label>
                        <mixed-citation publication-type="journal">R. Azadnia, K. Kheiralipour, Recognition of leaves of different medicinal plant species using a robust image processing algorithm and artificial neural networks classifier, Journal of Applied Research on Medicinal and Aromatic Plants, 25, (2021) 1–10.</mixed-citation>
                    </ref>
                                    <ref id="ref35">
                        <label>35</label>
                        <mixed-citation publication-type="journal">C. Li, R. Chen, C. Moutafis, S. Furber, Robustness to noisy synaptic weights in spiking neural networks, in: A. Roy (Ed.), International Joint Conference on Neural Networks (IJCNN) 2020, Glasgow, UK, 2020, pp. 1–8.</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
