Research Article

Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis

Volume: 12 Number: 2 June 30, 2025
EN

Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis

Abstract

Lung cancer (LC) is one of the most lethal malignancies worldwide, and early detection is essential. This study develops a deep learning (DL) based classification model for LC diagnosis using computed tomography (CT) images. In the experiments conducted on the IQ-OTHNCCD LC dataset, the Synthetic Minority Over-sampling Technique (SMOTE) method was applied to eliminate class imbalance, data augmentation techniques were used, and an early stopping mechanism was integrated to enhance the model's generalizability. Commonly used convolutional neural network (CNN) architectures, such as ResNet101, VGG19, and DenseNet121, are compared, and the model's performance is analyzed in detail. With an accuracy of 98%, the trial results demonstrate that the suggested ResNet101 model offers the best classification performance. the DenseNet121 model exhibited a relatively lower accuracy rate in distinguishing between benign and normal classes. The study conclusively demonstrates that an optimized ResNet101-based deep learning model, enhanced with data balancing and augmentation techniques, provides the most accurate and reliable classification performance for lung cancer detection using CT images. It not only outperforms traditional CNN architectures in terms of overall accuracy (98%) but also achieves perfect classification in malignant cases. These results validate the model’s potential as a robust diagnostic aid and highlight its superiority over existing methods in the literature, particularly in handling class imbalance and maintaining generalization across diverse image types.

Keywords

References

  1. AL-Huseiny, M. (2021). National Cancer Center Database (IQ-OTH/NCCD - Lung Cancer Dataset) [https://www.kaggle.com/datasets/adityamahimkar/iqothnccd-lung-cancer-dataset/data]. In Kaggle. https://doi.org/doi: 10.17632/bhmdr45bh2.2
  2. Aslan, E. (2025). Development of malaria diagnosis with convolutional neural network architectures: a CNN-based software for accurate cell image analysis. ITEGAM-JETIA, 11(51), 35–42. https://doi.org/10.5935/JETIA.V11I51.1392
  3. Asuntha, A., & Srinivasan, A. (2020). Deep learning for lung Cancer detection and classification. Multimedia Tools and Applications, 79(11–12), 7731–7762. https://doi.org/10.1007/s11042-019-08394-3
  4. Çelik, M., & İnik, Ö. (2023). Detection of Monkeypox Among Different Pox Diseases with Different Pre-Trained Deep Learning Models. Iğdır Üniversitesi Fen Bilimleri Enstitüsü Dergisi, 13(1), 10–21. https://doi.org/10.21597/JIST.1206453
  5. Crasta, L. J., Neema, R., & Pais, A. R. (2024). A novel Deep Learning architecture for lung cancer detection and diagnosis from Computed Tomography image analysis. Healthcare Analytics, 5(1) 100316. https://doi.org/10.1016/J.HEALTH.2024.100316
  6. Davri, A., Birbas, E., Kanavos, T., Ntritsos, G., Giannakeas, N., Tzallas, A. T., & Batistatou, A. (2023). Deep Learning for Lung Cancer Diagnosis, Prognosis and Prediction Using Histological and Cytological Images: A Systematic Review. Cancers 2023, 15(15), 3981. https://doi.org/10.3390/CANCERS15153981
  7. Devarajan, H. R., Balasubramanian, S., Kumar Swarnkar, S., Kumar, P., & Jallepalli, V. R. (2023). Deep Learning for Automated Detection of Lung Cancer from Medical Imaging Data. International Conference on Artificial Intelligence for Innovations in Healthcare Industries (ICAIIHI 2023). https://doi.org/10.1109/ICAIIHI57871.2023.10488962
  8. Eren, B., Fen, Ü., Dergisi, B., Aslan, E., & Özüpak, Y. (2024). Classification of Blood Cells with Convolutional Neural Network Model. Bitlis Eren Üniversitesi Fen Bilimleri Dergisi, 13(1), 314–326. https://doi.org/10.17798/BITLISFEN.1401294

Details

Primary Language

English

Subjects

Deep Learning

Journal Section

Research Article

Early Pub Date

May 20, 2025

Publication Date

June 30, 2025

Submission Date

February 28, 2025

Acceptance Date

March 25, 2025

Published in Issue

Year 2025 Volume: 12 Number: 2

APA
Alpsalaz, F. (2025). Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis. Gazi University Journal of Science Part A: Engineering and Innovation, 12(2), 373-391. https://doi.org/10.54287/gujsa.1648772
AMA
1.Alpsalaz F. Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis. GU J Sci, Part A. 2025;12(2):373-391. doi:10.54287/gujsa.1648772
Chicago
Alpsalaz, Feyyaz. 2025. “Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis”. Gazi University Journal of Science Part A: Engineering and Innovation 12 (2): 373-91. https://doi.org/10.54287/gujsa.1648772.
EndNote
Alpsalaz F (June 1, 2025) Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis. Gazi University Journal of Science Part A: Engineering and Innovation 12 2 373–391.
IEEE
[1]F. Alpsalaz, “Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis”, GU J Sci, Part A, vol. 12, no. 2, pp. 373–391, June 2025, doi: 10.54287/gujsa.1648772.
ISNAD
Alpsalaz, Feyyaz. “Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis”. Gazi University Journal of Science Part A: Engineering and Innovation 12/2 (June 1, 2025): 373-391. https://doi.org/10.54287/gujsa.1648772.
JAMA
1.Alpsalaz F. Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis. GU J Sci, Part A. 2025;12:373–391.
MLA
Alpsalaz, Feyyaz. “Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis”. Gazi University Journal of Science Part A: Engineering and Innovation, vol. 12, no. 2, June 2025, pp. 373-91, doi:10.54287/gujsa.1648772.
Vancouver
1.Feyyaz Alpsalaz. Deep Learning-Based Lung Cancer Diagnosis: Data Balancing, Model Optimisation and Performance Analysis. GU J Sci, Part A. 2025 Jun. 1;12(2):373-91. doi:10.54287/gujsa.1648772