Research Article
BibTex RIS Cite

Enhancing Crop Recommendation with SMOTE-Augmented Machine Learning Algorithms

Year 2025, Volume: 9 Issue: 4, 612 - 620

Abstract

AAs the agricultural landscape continues to evolve, the synergy between the crop recommendation and machine learning (ML) innovations holds the potential to revolutionize the way farmers make decisions, ultimately driving sustainable and efficient food production. This study investigates the efficacy of various machine learning algorithms for crop recommendation, a crucial aspect of precision agriculture. Six prominent algorithms are evaluated utilizing the Crop Recommendation Dataset, those are: Attentive Interpretable Tabular Learning, Extreme Gradient Boosting (XGBoost), K-Nearest Neighbors, Decision Tree, LightGBM, and Random Forest (RF). Synthetic Minority Oversampling Technique (SMOTE) is applied to each algorithm to increase the number of samples in the dataset. When datasets are small, models may not be able to learn enough features to classify problems effectively in real-time situations. Data augmentation, including techniques like SMOTE, appears as a way to surpass this limitation by increasing the amount of available data. Our results demonstrate the effectiveness of these algorithms in accurately predicting suitable crops based on environmental and soil parameters. Notably, XGBoost achieved an accuracy of 0.9909, while RF combined with SMOTE attained the highest accuracy of 0.9981. This superior performance is attributed to SMOTE’s ability to generate a balanced dataset by increasing the number of samples in each class based on the available data, thereby enhancing the model’s predictive capability for crop recommendation. The key contribution of this study is the demonstration that applying SMOTE for data augmentation can improve the predictive accuracy of various machine learning algorithms. This research underscores the potential of machine learning, particularly ensemble methods coupled with oversampling techniques, in driving data-driven agricultural practices for enhanced crop yield and resource optimization.

References

  • Ahmed, A. A., & Reddy, G. H. (2021). A mobile-based system for detecting plant leaf diseases using deep learning. AgriEngineering, 3(3), 478-493.
  • Linaza, M. T., Posada, J., Bund, J., Eisert, P., Quartulli, M., Döllner, J., ... & Lucat, L. (2021). Data-driven artificial intelligence applications for sustainable precision agriculture. Agronomy, 11(6), 1227.
  • Mema, B., Basholli, F., & Hyka, D. (2024). Learning transformation and virtual interaction through ChatGPT in Albanian higher education. Advanced Engineering Science, 4, 130-140.
  • Kocalar, A. C. (2023). Sinkholes caused by agricultural excess water using and administrative traces of the process. Advanced Engineering Science, 3, 15-20.
  • Wang, W., & Pai, T. W. (2023). Enhancing small tabular clinical trial dataset through hybrid data augmentation: combining SMOTE and WCGAN-GP. Data, 8(9), 135.
  • Wang, H., Yilihamu, Q., Yuan, M., Bai, H., Xu, H., & Wu, J. (2020). Prediction models of soil heavy metal (loid) s concentration for agricultural land in Dongli: A comparison of regression and random forest. Ecological Indicators, 119, 106801.
  • Gavahi, K., Abbaszadeh, P., & Moradkhani, H. (2021). DeepYield: A combined convolutional neural network with long short-term memory for crop yield forecasting. Expert Systems with Applications, 184, 115511.
  • Hariram, V., Godwin John, J., Saravanan, A., Sangeeth Kumar, E., Vinoth Kumar, M., Ramanathan, V., Balachandar, M & Baskar, S. (2025). Optimized Biodiesel Production from Dunaliella Salina, A Unicellular Green Algae through Artificial Neural Network. Turkish Journal of Engineering, 9 (2), 179-188.
  • Demir, S., & Sahin, E. K. (2023). An investigation of feature selection methods for soil liquefaction prediction based on tree-based ensemble algorithms using AdaBoost, gradient boosting, and XGBoost. Neural Computing and Applications, 35(4), 3173-3190.
  • Nwafor, E. O., & Akintayo, F. O. (2024). Predicting trip purposes of households in Makurdi using machine learning: a comparative analysis of decision tree, CatBoost, and XGBoost algorithms. Engineering Applications, 3(3), 260-274.
  • Meghraoui, K., Sebari, I., Bensiali, S., & Ait El Kadi, K. (2022). On behalf of an intelligent approach based on 3D CNN and multimodal remote sensing data for precise crop yield estimation: Case study of wheat in Morocco. Advanced Engineering Science, 2, 118-126.
  • Pajaziti, A., Basholli, F., & Zhaveli, Y. (2023). Identification and classification of fruits through robotic system by using artificial intelligence. Engineering Applications, 2 (2), 154-163.
  • Koşum, M. F., & Özkan, A. O. (2025). Analysis of the effect of environmental data collected through the WSN on plant development by the WEKA program. Turkish Journal of Engineering, 9(2), 202-210.
  • Sofuoğlu, C. İ., & Bırant, D. (2024). Potato plant leaf disease detection using deep learning method. Journal of Agricultural Sciences, 30(1), 153-165.
  • Hyder, U., & Talpur, M. R. H. (2024). Detection of cotton leaf disease with machine learning model. Turkish Journal of Engineering, 8 (2), 380-393
  • Sinap, V. (2024). Comparative analysis of machine learning techniques for credit card fraud detection: Dealing with imbalanced datasets. Turkish Journal of Engineering, 8(2), 196-208.
  • Börekci, A., & Sevli, O. (2024). A classification study for Turkish folk music makam recognition using machine learning with data augmentation techniques. Neural Computing and Applications, 36(4), 1621-1639.
  • Chabalala, Y., Adam, E., & Ali, K. A. (2023). Exploring the effect of balanced and imbalanced multi-class distribution data and sampling techniques on fruit-tree crop classification using different machine learning classifiers. Geomatics, 3(1), 70-92.
  • Chemchem, A., Alin, F., & Krajecki, M. (2019). Combining SMOTE sampling and machine learning for forecasting wheat yields in France. In Proceedings of the 2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering (AIKE), 9-14.
  • Hasmita, S., Nhita, F., Saepudin, D., & Aditsania, A. (2019). Chili commodity price forecasting in bandung regency using the adaptive synthetic sampling (ADASYN) and K-Nearest neighbor (KNN) algorithms. In Proceedings of the 2019 international Conference on Information and Communications Technology (ICOIACT) (pp. 434-438). IEEE.
  • Sovia, N. A., & Wardhani, N. W. S. (2024). Ensemble CNN with ADASYN for multiclass classification on cabbage pests. Barekeng: Jurnal Ilmu Matematika dan Terapan, 18(2), 1237-1248.
  • Taghizadeh‐Mehrjardi, R., Schmidt, K., Eftekhari, K., Behrens, T., Jamshidi, M., Davatgar, N., ... & Scholten, T. (2020). Synthetic resampling strategies and machine learning for digital soil mapping in Iran. European Journal of Soil Science, 71(3), 352-368.
  • Machado, P., Fernandes, B., & Novais, P. (2022). Benchmarking data augmentation techniques for tabular data. In International Conference on Intelligent Data Engineering and Automated Learning, 104-112. Cham: Springer International Publishing.
  • Kaggle. (2022). Crop recommendation dataset. https://www.kaggle.com/datasets/atharvaingle/crop-recommendation-dataset
  • Arik, S. Ö., & Pfister, T. (2021). Tabnet: Attentive interpretable tabular learning. In Proceedings of the AAAI Conference on Artificial Intelligence, 35(8), pp. 6679-6687.
  • Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785-794.
  • Hossain, E., Hossain, M. F., & Rahaman, M. A. (2019). A color and texture based approach for the detection and classification of plant leaf disease using KNN classifier. In Proceedings of the 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE), 1-6.
  • Freund, Y., & Mason, L. (1999). The alternating decision tree learning algorithm. In Proceedings of the International Conference on Machine Learning (ICML) 99, 124-133.
  • Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., ... & Liu, T. Y. (2017). LightGBM: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems, 30.
  • Maza, D., Ojo, J. O., & Akinlade, G. O. (2024). A predictive machine learning framework for diabetes. Turkish Journal of Engineering, 8(3), 583-592.
  • Ok, A. O., Akar, O., & Gungor, O. (2012). Evaluation of random forest method for agricultural crop classification. European Journal of Remote Sensing, 45(1), 421-432.
  • Saha, G., Suvo, S. H., Tonmoy, F. T., Asad, J. B., Imran, M. T., & Azad, A. A. M. (2024). Smart soil monitoring system with crop and fertilizer recommendation features. In Proceedings of the 2024 IEEE 13th International Conference on Communication Systems and Network Technologies (CSNT), 1190-1196.
  • Boykharayevich, F. J. (2024). Studies on the extraction of oil from watermelon seeds. Engineering Applications, 3(2), 106-116.
  • Palakshappa, A., Nanjappa, S. K., Mahadevappa, P., & Sinchana, S. (2024). Smart irrigation with crop recommendation using machine learning approach. Bulletin of Electrical Engineering and Informatics, 13(3), 1952-1960.
  • Parameswari, P., Rajathi, N., & Harshanaa, K. J. (2021). Machine learning approaches for crop recommendation. In Proceedings of the 2021 International Conference on Advancements in Electrical, Electronics, Communication, Computing and Automation (ICAECA), 1-5.
  • Dey, B., Ferdous, J., & Ahmed, R. (2024). Machine learning based recommendation of agricultural and horticultural crop farming in India under the regime of NPK, soil pH and three climatic variables. Heliyon, 10(3).
  • Wankhade, S. R., & Raut, A. B. (2024). Development of a model for estimation of soil parameters using deep learning. International Journal of Information Technology, 1-17.
  • Chhikara, S., & Kundu, N. (2022). Machine learning based smart crop recommender and yield predictor. In Proceedings of the 2022 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS), 474-478.
  • Bhatt, N., & Varma, S. (2023). Recommendation system for crops integrating with specific soil parameters by machine learning techniques. In Proceedings of the 2023 IEEE International Students' Conference on Electrical, Electronics and Computer Science (SCEECS), 1-8.
  • Ghosh, A., Mohapatra, S. K., Pattanaik, P., Dash, P. K., & Chakravarty, S. (2024). A comprehensive crop recommendation system integrating machine learning and deep learning models. In 2024 1st International Conference on Cognitive, Green and Ubiquitous Computing (IC-CGU), 1-6.
  • Gosai, D., Raval, C., Nayak, R., Jayswal, H., & Patel, A. (2021). Crop recommendation system using machine learning. International Journal of Scientific Research in Computer Science, Engineering and Information Technology, 7(3), 558-569.
  • Disha, M., Dash, S. R., Mohapatra, D., Jena, D., & Barra, P. (2024). Crop recommendation system using supervised learning techniques. In Proceedings of the 2024 International Conference on Emerging Systems and Intelligent Computing (ESIC), 319-323.
  • Thilakarathne, N. N., Bakar, M. S. A., Abas, P. E., & Yassin, H. (2022). A cloud enabled crop recommendation platform for machine learning-driven precision farming. Sensors, 22(16), 6299.
  • Jadav, N. K., Rathod, T., Gupta, R., Tanwar, S., Kumar, N., & Alkhayyat, A. (2023). Blockchain and artificial intelligence-empowered smart agriculture framework for maximizing human life expectancy. Computers and Electrical Engineering, 105, 108486.
  • Raju, C., Ashoka, D. V., & BV, A. P. (2024). CropCast: Harvesting the future with interfused machine learning and advanced stacking ensemble for precise crop prediction. Kuwait Journal of Science, 51(1), 100160.
  • Sharma, A., Tamrakar, A., Dewasi, S., & Naik, N. S. (2022). Early prediction of crop yield in India using machine learning. In Proceedings of the 2022 IEEE Region 10 Symposium (TENSYMP), 1-6.
  • Dey, T., Bera, S., Paul, B., De, D., Mukherjee, A., & Buyya, R. (2024). Fly: Femtolet‐based edge‐cloud framework for crop yield prediction using bidirectional long short‐term memory. Software: Practice and Experience.
  • Kumar, R., Gupta, M., & Singh, U. (2023). Precision agriculture crop recommendation system using KNN algorithm. In Proceedings of the 2023 International Conference on IoT, Communication and Automation Technology (ICICAT), 1-6.
  • Alkhudaydi, T., Albalawi, M. Q., Alanazi, J. S., Al-Anazi, W., & Alfarshouti, R. M. (2023). Deep learning for combined water quality testing and crop recommendation. International Journal of Advanced Computer Science and Applications, 14(4).
  • Nurcahyo, A., Heryadi, Y., Suparta, W., & Sonata, I. (2023). Interpretable machine learning for multi-class crop yield prediction. In Proceedings of the 2023 3rd International Conference on Intelligent Cybernetics Technology & Applications (ICICyTA), 194-200.
  • Terrissa, L., & Ayad, S. (2024). Interpretable crop selection for optimized farming decisions. International Journal of Computing and Digital Systems, 16(1), 189-201.
There are 51 citations in total.

Details

Primary Language English
Subjects Computer Software, Software Engineering (Other)
Journal Section Articles
Authors

Gözde Alp 0000-0002-6479-3500

Fatih Soygazi 0000-0001-8426-2283

Publication Date October 6, 2025
Submission Date April 23, 2025
Acceptance Date July 27, 2025
Published in Issue Year 2025 Volume: 9 Issue: 4

Cite

APA Alp, G., & Soygazi, F. (n.d.). Enhancing Crop Recommendation with SMOTE-Augmented Machine Learning Algorithms. Turkish Journal of Engineering, 9(4), 612-620.
AMA Alp G, Soygazi F. Enhancing Crop Recommendation with SMOTE-Augmented Machine Learning Algorithms. TUJE. 9(4):612-620.
Chicago Alp, Gözde, and Fatih Soygazi. “Enhancing Crop Recommendation With SMOTE-Augmented Machine Learning Algorithms”. Turkish Journal of Engineering 9, no. 4 n.d.: 612-20.
EndNote Alp G, Soygazi F Enhancing Crop Recommendation with SMOTE-Augmented Machine Learning Algorithms. Turkish Journal of Engineering 9 4 612–620.
IEEE G. Alp and F. Soygazi, “Enhancing Crop Recommendation with SMOTE-Augmented Machine Learning Algorithms”, TUJE, vol. 9, no. 4, pp. 612–620.
ISNAD Alp, Gözde - Soygazi, Fatih. “Enhancing Crop Recommendation With SMOTE-Augmented Machine Learning Algorithms”. Turkish Journal of Engineering 9/4 (n.d.), 612-620.
JAMA Alp G, Soygazi F. Enhancing Crop Recommendation with SMOTE-Augmented Machine Learning Algorithms. TUJE.;9:612–620.
MLA Alp, Gözde and Fatih Soygazi. “Enhancing Crop Recommendation With SMOTE-Augmented Machine Learning Algorithms”. Turkish Journal of Engineering, vol. 9, no. 4, pp. 612-20.
Vancouver Alp G, Soygazi F. Enhancing Crop Recommendation with SMOTE-Augmented Machine Learning Algorithms. TUJE. 9(4):612-20.
Flag Counter