<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20241031//EN"
        "https://jats.nlm.nih.gov/publishing/1.4/JATS-journalpublishing1-4.dtd">
<article  article-type="research-article"        dtd-version="1.4">
            <front>

                <journal-meta>
                                                                <journal-id>saucis</journal-id>
            <journal-title-group>
                                                                                    <journal-title>Sakarya University Journal of Computer and Information Sciences</journal-title>
            </journal-title-group>
                                        <issn pub-type="epub">2636-8129</issn>
                                                                                            <publisher>
                    <publisher-name>Sakarya University</publisher-name>
                </publisher>
                    </journal-meta>
                <article-meta>
                                        <article-id pub-id-type="doi">10.35377/saucis...1579599</article-id>
                                                                <article-categories>
                                            <subj-group  xml:lang="en">
                                                            <subject>Computer Software</subject>
                                                    </subj-group>
                                            <subj-group  xml:lang="tr">
                                                            <subject>Bilgisayar Yazılımı</subject>
                                                    </subj-group>
                                    </article-categories>
                                                                                                                                                        <title-group>
                                                                                                                                                            <article-title>Enhanced Oil and Gas Production Forecasting Through Stacked generalization Ensemble Learning Technique</article-title>
                                                                                                    </title-group>
            
                                                    <contrib-group content-type="authors">
                                                                        <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0000-0002-1220-0558</contrib-id>
                                                                <name>
                                    <surname>Çit</surname>
                                    <given-names>Gülüzar</given-names>
                                </name>
                                                                    <aff>SAKARYA ÜNİVERSİTESİ</aff>
                                                            </contrib>
                                                    <contrib contrib-type="author">
                                                                    <contrib-id contrib-id-type="orcid">
                                        https://orcid.org/0009-0002-6214-5179</contrib-id>
                                                                <name>
                                    <surname>Alyahya</surname>
                                    <given-names>Azhar</given-names>
                                </name>
                                                            </contrib>
                                                                                </contrib-group>
                        
                                        <pub-date pub-type="pub" iso-8601-date="20250630">
                    <day>06</day>
                    <month>30</month>
                    <year>2025</year>
                </pub-date>
                                        <volume>8</volume>
                                        <issue>2</issue>
                                        <fpage>212</fpage>
                                        <lpage>222</lpage>
                        
                        <history>
                                    <date date-type="received" iso-8601-date="20241105">
                        <day>11</day>
                        <month>05</month>
                        <year>2024</year>
                    </date>
                                                    <date date-type="accepted" iso-8601-date="20250421">
                        <day>04</day>
                        <month>21</month>
                        <year>2025</year>
                    </date>
                            </history>
                                        <permissions>
                    <copyright-statement>Copyright © 2018, Sakarya University Journal of Computer and Information Sciences</copyright-statement>
                    <copyright-year>2018</copyright-year>
                    <copyright-holder>Sakarya University Journal of Computer and Information Sciences</copyright-holder>
                </permissions>
            
                                                                                                                        <abstract><p>Planning a strategy throughout the oil and gas sector depends on production forecasting. Precise projections aid in estimating future output rates, streamlining processes, and effectively allocating resources. Techniques like “ Decline Curve Analysis (DCA) and Numerical Reservoir Simulation (NRS) ” have been used in the past, but they have drawbacks such reliance on static models and time consumption. A stacked generalization ensemble learning method for predicting oil and gas production is presented in this work. Using Python and data from wells in the state of “New York State”, the model contains four machine learning techniques: “ Random Forest Regressor (RFR), Extremely Randomized Trees Regressor (ETR), K-Nearest Neighbors (KNN), and Gradient Boosting Regressor (GBR) ”. The stacked model works better than separate models, according to the results of experiments, via R2 scores of 0.9709 per oil and 0.9998 per gas.</p></abstract>
                                                            
            
                                                                                        <kwd-group>
                                                    <kwd>Machine learning models</kwd>
                                                    <kwd>  Random Forest regressor</kwd>
                                                    <kwd>  Extremely Randomized Trees Regressor</kwd>
                                                    <kwd>  K- Nearest Neighbors</kwd>
                                                    <kwd>  Gradient boosting regressor</kwd>
                                                    <kwd>  Stacking model</kwd>
                                            </kwd-group>
                            
                                                                                                                                                <funding-group specific-use="FundRef">
                    <award-group>
                                                    <funding-source>
                                <named-content content-type="funder_name">Sakarya University</named-content>
                            </funding-source>
                                                                    </award-group>
                </funding-group>
                                </article-meta>
    </front>
    <back>
                            <ref-list>
                                    <ref id="ref1">
                        <label>1</label>
                        <mixed-citation publication-type="journal">British Petroleum, &quot;Statistical Review of World Energy,&quot; BP Global, 2021. [Online]. Available: https://www.bp.com</mixed-citation>
                    </ref>
                                    <ref id="ref2">
                        <label>2</label>
                        <mixed-citation publication-type="journal">J.	G.	Speight,	Handbook	of	Petroleum	Refining.	2014.	[Online].	Available: https://www.academia.edu/63659108/Handbook_of_Petroleum_Refining</mixed-citation>
                    </ref>
                                    <ref id="ref3">
                        <label>3</label>
                        <mixed-citation publication-type="journal">D. Orodu, O. F. Aworinde, and A. F. Alayande, &quot;A hybrid machine learning framework for enhanced reservoir characterization,&quot; J. Petroleum Sci. Eng., vol. 207, p. 109114, 2021, doi: 10.1016/j.petrol.2021.109114.</mixed-citation>
                    </ref>
                                    <ref id="ref4">
                        <label>4</label>
                        <mixed-citation publication-type="journal">A. F. Khan and S. R. Alam, &quot;Adaptive Neuro-Fuzzy Inference System with metaheuristic tuning for petroleum production forecasting,&quot; Applied Soft Computing, vol. 114, p. 108050, 2022, doi: 10.1016/j.asoc.2021.108050.</mixed-citation>
                    </ref>
                                    <ref id="ref5">
                        <label>5</label>
                        <mixed-citation publication-type="journal">M. A. Ullah, S. M. Khaleque, and S. Sikder, &quot;Prediction of oil production using optimized machine learning models,&quot; Energies, vol. 14, no. 16, p. 4923, 2021, doi: 10.3390/en14164923.</mixed-citation>
                    </ref>
                                    <ref id="ref6">
                        <label>6</label>
                        <mixed-citation publication-type="journal">M. J. Fetkovich, &quot;Decline Curve Analysis Using Type Curves,&quot; J. Petroleum Technol., vol. 32, no. 6, pp. 1065-1077, 1980.</mixed-citation>
                    </ref>
                                    <ref id="ref7">
                        <label>7</label>
                        <mixed-citation publication-type="journal">M. J. Abhishek and V. Kumar, &quot;Gradient boosting regression tree model for enhanced oil production prediction,&quot; Processes, vol. 10, no. 2, p. 234, 2022, doi: 10.3390/pr10020234.</mixed-citation>
                    </ref>
                                    <ref id="ref8">
                        <label>8</label>
                        <mixed-citation publication-type="journal">K. M. Ali and J. Zhang, &quot;Application of metaheuristic optimization algorithms for predictive analysis in petroleum engineering,&quot; J. Petroleum Exploration Production Technol., vol. 12, no. 5, pp. 1325–1335, 2022, doi: 10.1007/s13202-021-01402-w.</mixed-citation>
                    </ref>
                                    <ref id="ref9">
                        <label>9</label>
                        <mixed-citation publication-type="journal">C. S. W. Ng, A. J. Ghahfarokhi, and M. N. Amar, &quot;Well production forecast in Volve field: Application of rigorous machine learning techniques and metaheuristic algorithm,&quot; J. Petroleum Sci. Eng., vol. 208, p. 109468, 2022, doi: 10.1016/j.petrol.2021.109468.</mixed-citation>
                    </ref>
                                    <ref id="ref10">
                        <label>10</label>
                        <mixed-citation publication-type="journal">S. D. Mohaghegh, &quot;Machine Learning Applications in Reservoir Engineering: Part 1,&quot; J. Petroleum Technol., vol. 69, no. 6, pp. 70-77, 2017, doi: 10.2118/0617-0070-JPT.</mixed-citation>
                    </ref>
                                    <ref id="ref11">
                        <label>11</label>
                        <mixed-citation publication-type="journal">J. X. Chen, H. L. Wang, and K. Zhao, &quot;Comparative evaluation of machine learning techniques for hydrocarbon reservoir prediction,&quot; Energies, vol. 14, no. 3, p. 806, 2021, doi: 10.3390/en14030806.</mixed-citation>
                    </ref>
                                    <ref id="ref12">
                        <label>12</label>
                        <mixed-citation publication-type="journal">A. S. Abou-Sayed, &quot;AI in the Petroleum Industry,&quot; Society of Petroleum Engineers AI Newsletter, 2021. [Online].
Available: https://www.spe.org</mixed-citation>
                    </ref>
                                    <ref id="ref13">
                        <label>13</label>
                        <mixed-citation publication-type="journal">S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, 4th ed. Pearson, 2021.</mixed-citation>
                    </ref>
                                    <ref id="ref14">
                        <label>14</label>
                        <mixed-citation publication-type="journal">I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. MIT Press, 2016. [Online]. Available: http://www.deeplearningbook.org</mixed-citation>
                    </ref>
                                    <ref id="ref15">
                        <label>15</label>
                        <mixed-citation publication-type="journal">M. Kim, &quot;Deep Learning-Based Prediction of the Cumulative Gas Production of the Montney Formation, Canada,&quot; GeoConvention, 2020. [Online]. Available: https://geoconvention.com/wp-content/uploads/abstracts/2020/57980- deep-learning-based-prediction-of-the-cumulative-g.pdf</mixed-citation>
                    </ref>
                                    <ref id="ref16">
                        <label>16</label>
                        <mixed-citation publication-type="journal">M. S. Zanjani, M. A. Salam, and O. Kandara, &quot;Data-Driven Hydrocarbon Production Forecasting Using Machine Learning Techniques,&quot; Int. J. Comput. Sci. Inf. Security, vol. 18, no. 6, pp. 65–72, 2020.</mixed-citation>
                    </ref>
                                    <ref id="ref17">
                        <label>17</label>
                        <mixed-citation publication-type="journal">C. Tan et al., &quot;Fracturing productivity prediction model and optimization of the operation parameters of shale gas well based on machine learning,&quot; Lithosphere, vol. 2021, no. Special 4, p. 2884679, 2021, doi: 10.2113/2021/2884679.</mixed-citation>
                    </ref>
                                    <ref id="ref18">
                        <label>18</label>
                        <mixed-citation publication-type="journal">G. Hui, S. Chen, Y. He, H. Wang, and F. Gu, &quot;Machine learning-based production forecast for shale gas in unconventional reservoirs via integration of geological and operational factors,&quot; J. Natural Gas Sci. Eng., vol. 94, p. 104045, 2021, doi: 10.1016/j.jngse.2021.104045.</mixed-citation>
                    </ref>
                                    <ref id="ref19">
                        <label>19</label>
                        <mixed-citation publication-type="journal">N. M. Ibrahim et al., &quot;Well Performance Classification and Prediction: Deep Learning and Machine Learning Long Term Regression Experiments on Oil, Gas, and Water Production,&quot; Sensors, vol. 22, no. 14, p. 5326, 2022, doi: 10.3390/s22145326.</mixed-citation>
                    </ref>
                                    <ref id="ref20">
                        <label>20</label>
                        <mixed-citation publication-type="journal">S. Hosseini and T. Akilan, &quot;Advanced Deep Regression Models for Forecasting Time Series Oil Production,&quot; arXiv preprint arXiv:2308.16105, 2023.</mixed-citation>
                    </ref>
                                    <ref id="ref21">
                        <label>21</label>
                        <mixed-citation publication-type="journal">L. Song, C. Wang, C. Lu, S. Yang, and C. Tan, &quot;Machine Learning Model of Oilfield Productivity Prediction and Performance Evaluation,&quot; J. Physics: Conference Series, vol. 2468, no. 1, p. 012084, 2022, doi: 10.1088/1742- 6596/2468/1/012084.</mixed-citation>
                    </ref>
                                    <ref id="ref22">
                        <label>22</label>
                        <mixed-citation publication-type="journal">N. Liu, H. Gao, Z. Zhao, Y. Hu, and L. Duan, &quot;A stacked generalization ensemble model for optimization and prediction of the gas well rate of penetration: a case study in Xinjiang,&quot; J. Petroleum Exploration Production Technol., vol. 11, pp. 3533-3546, 2021, doi: 10.1007/s13202-021-01402-z.</mixed-citation>
                    </ref>
                                    <ref id="ref23">
                        <label>23</label>
                        <mixed-citation publication-type="journal">F. Ye, X. Li, N. Zhang, and F. Xu, &quot;Prediction of Single-Well Production Rate after Hydraulic Fracturing in Unconventional Gas Reservoirs Based on Ensemble Learning Model,&quot; Processes, vol. 12, no. 6, p. 1194, 2024, doi: 10.3390/pr12061194.</mixed-citation>
                    </ref>
                                    <ref id="ref24">
                        <label>24</label>
                        <mixed-citation publication-type="journal">S. Ray, &quot;A quick review of machine learning algorithms,&quot; in Proc. Int. Conf. Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), 2019, pp. 35-39, doi: 10.1109/comitcon.2019.8862451.</mixed-citation>
                    </ref>
                                    <ref id="ref25">
                        <label>25</label>
                        <mixed-citation publication-type="journal">M. I. Jordan and T. M. Mitchell, &quot;Machine learning: Trends, perspectives, and prospects,&quot; Science, vol. 349, no. 6245,
pp. 255-260, 2015, doi: 10.1126/science.aaa8415.</mixed-citation>
                    </ref>
                                    <ref id="ref26">
                        <label>26</label>
                        <mixed-citation publication-type="journal">L. Breiman, &quot;Random Forests,&quot; Machine Learning, vol. 45, no. 1, pp. 5–32, 2001, doi: 10.1023/A:1010933404324.</mixed-citation>
                    </ref>
                                    <ref id="ref27">
                        <label>27</label>
                        <mixed-citation publication-type="journal">M. Y. Khan, &quot;Automated prediction of Good Dictionary EXamples (GDEX): a comprehensive experiment with distant supervision, machine learning, and word embedding-based deep learning techniques,&quot; Complexity, vol. 2021, pp. 1- 18, 2021, doi: 10.1155/2021/2553199.</mixed-citation>
                    </ref>
                                    <ref id="ref28">
                        <label>28</label>
                        <mixed-citation publication-type="journal">L. Breiman, &quot;Bagging Predictors,&quot; Machine Learning, vol. 24, no. 2, pp. 123–140, 1996, doi: 10.1007/BF00058655.</mixed-citation>
                    </ref>
                                    <ref id="ref29">
                        <label>29</label>
                        <mixed-citation publication-type="journal">A. K. Ali and A. M. Abdullah, &quot;Fake accounts detection on social media using stack ensemble system,&quot; Int. J. Electrical Comput. Eng., vol. 12, no. 3, pp. 3013-3022, 2022.</mixed-citation>
                    </ref>
                                    <ref id="ref30">
                        <label>30</label>
                        <mixed-citation publication-type="journal">S. P. Rao and A. V. K. Shetty, &quot;Random forest-based predictive models for enhanced fluid flow estimation in pipelines,&quot; J. Petroleum Sci. Eng., vol. 199, p. 108382, 2021, doi: 10.1016/j.petrol.2021.108382.</mixed-citation>
                    </ref>
                                    <ref id="ref31">
                        <label>31</label>
                        <mixed-citation publication-type="journal">P. Geurts, D. Ernst, and L. Wehenkel, &quot;Extremely randomized trees,&quot; Machine Learning, vol. 63, no. 1, pp. 3–42, 2006, doi: 10.1007/s10994-006-6226-1.</mixed-citation>
                    </ref>
                                    <ref id="ref32">
                        <label>32</label>
                        <mixed-citation publication-type="journal">T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd ed. Springer, 2009.</mixed-citation>
                    </ref>
                                    <ref id="ref33">
                        <label>33</label>
                        <mixed-citation publication-type="journal">T. Aziz and M. R. Camana, &quot;REM-Based Indoor Localization with an Extra-Trees Regressor,&quot; Electronics, vol. 12, no. 20, p. 4350, 2023, doi: 10.3390/electronics12204350.</mixed-citation>
                    </ref>
                                    <ref id="ref34">
                        <label>34</label>
                        <mixed-citation publication-type="journal">R. K. Halder, &quot;Enhancing K-nearest neighbor algorithm: a comprehensive review and performance analysis of modifications,&quot; J. Big Data, vol. 11, no. 1, 2024, doi: 10.1186/s40537-024-00973-y.</mixed-citation>
                    </ref>
                                    <ref id="ref35">
                        <label>35</label>
                        <mixed-citation publication-type="journal">T. Timbers, T. Campbell, and M. Lee, &quot;Chapter 7 Regression I: K-nearest neighbors,&quot; in Data Science: A First Introduction, CRC Press, 2022. [Online]. Available: https://datasciencebook.ca/regression1.html</mixed-citation>
                    </ref>
                                    <ref id="ref36">
                        <label>36</label>
                        <mixed-citation publication-type="journal">C. Gkerekos, I. Lazakis, and G. Theotokatos, &quot;Machine learning models for predicting ship main engine Fuel Oil Consumption: A comparative study,&quot; Ocean Eng., vol. 188, p. 106282, 2019, doi: 10.1016/j.oceaneng.2019.106282.</mixed-citation>
                    </ref>
                                    <ref id="ref37">
                        <label>37</label>
                        <mixed-citation publication-type="journal">J. H. Friedman, &quot;Greedy Function Approximation: A Gradient Boosting Machine,&quot; The Annals of Statistics, vol. 29, no. 5, pp. 1189-1232, 2001.</mixed-citation>
                    </ref>
                                    <ref id="ref38">
                        <label>38</label>
                        <mixed-citation publication-type="journal">A. Ali, &quot;Gradient Boosting Machine Learning Algorithm,&quot; Dec. 2023, doi: 10.13140/RG.2.2.31609.65123.</mixed-citation>
                    </ref>
                                    <ref id="ref39">
                        <label>39</label>
                        <mixed-citation publication-type="journal">M. Kalirane, &quot;Ensemble Learning in Machine Learning: Bagging, Boosting and Stacking,&quot; Analytics Vidhya, Jan. 2024. [Online]. Available: https://www.analyticsvidhya.com/blog/2023/01/ensemble-learning-methods-bagging- boosting-and-stacking/</mixed-citation>
                    </ref>
                                    <ref id="ref40">
                        <label>40</label>
                        <mixed-citation publication-type="journal">&quot;Oil and Gas Annual Production: Beginning 2001,&quot; Data.gov. [Online]. Available: https://catalog.data.gov/dataset/oil- and-gas-annual-production-beginning-2001</mixed-citation>
                    </ref>
                            </ref-list>
                    </back>
    </article>
