EN
TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-ray Images and Reports
Abstract
Data augmentation is a common practice in image classification, employing methods such as reflection, random cropping, re-scaling, and transformations to enhance training data. These techniques are prevalent when working with extended real-world datasets, focusing on improving classification accuracy through increased diversity. The use of Generative Adversarial Networks (GANs), known for their high representational power, enables learning the distribution of real data and generating samples with previously unseen discriminative features. However, intra-class imbalances in augmentations are problematic for conventional GAN augmentations. Hence, we propose a framework named Text-Based Style-Manipulated GAN augmentation framework (TB-SMGAN) aims to leverage the generative capabilities of StyleGAN2-ADA. In this framework, we utilize StyleCLIP to control disentangled feature manipulations and intra-class imbalances. We enhance the efficiency of StyleCLIP by fine-tuning CLIP with x-ray images and information extractions from corresponding medical reports. Our proposed framework demonstrates an improvement in terms of mean PR-AUC score when employing the text-based manipulated GAN augmentation technique compared to conventional GAN augmentation.
Keywords
References
- Alqahtani, H., Kavakli-Thorne, M., & Kumar, G. (2021). Applications of generative adversarial networks (gans): An updated review. Archives of Computational Methods in Engineering, 28, 525-552. https://doi.org/10.1007/s11831-019-09388-y
- Altwaijry, N. (2023). Probability-based synthetic minority oversampling technique. IEEE Access, 11, 28831-28839. https://doi.org/10.1109/ACCESS.2023.3260723
- Benčević, M., Habijan, M., Galić, I., & Pizurica, A. (2022, August 29 - September 02). Self-supervised Learning as a Means to Reduce the Need for Labeled Data in Medical Image Analysis. In: Proceedings of the 30th European Signal Processing Conference (EUSIPCO 2022) (pp. 1328-1332). Belgrade, Serbia. https://doi.org/10.23919/EUSIPCO55093.2022.9909542
- Bowles, C., Chen, L., Guerrero, R., Bentley, P., Gunn, R., Hammers, A., Dickie, D. A., Hernández, M. V., Wardlaw, J., & Rueckert, D. (2018). Gan augmentation: Augmenting training data using generative adversarial networks. https://doi.org/10.48550/arXiv.1810.10863
- Dao, H. N., Quang, T. N., & Paik, I. (2022, October 26-28). Transfer Learning for Medical Image Classification on Multiple Datasets using PubMedCLIP. In: Proceedings of the 2022 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia) (pp. 1-4). Yeosu, Korea, Republic of. https://doi.org/10.1109/ICCE-Asia57006.2022.9954669
- Dao, H. N., Nguyen, T., Mugisha, C., & Paik, I. (2023). A Multimodal Transfer Learning Approach Using PubMedCLIP for Medical Image Classification. IEEE Access, 12, 75496-75507. https://doi.org/10.1109/ACCESS.2024.3401777
- Deepshikha, K., & Naman, A. (2020). Removing Class Imbalance using Polarity-GAN: An Uncertainty Sampling Approach. https://doi.org/10.48550/arXiv.2012.04937
- Fetty, L., Bylund, M., Kuess, P., Heilemann, G., Nyholm, T., Georg, D., & Löfstedt, T. (2020). Latent space manipulation for high-resolution medical image synthesis via the StyleGAN. Zeitschrift für Medizinische Physik, 30(4), 305-314. https://doi.org/10.1016/j.zemedi.2020.05.001
Details
Primary Language
English
Subjects
Deep Learning
Journal Section
Research Article
Authors
Early Pub Date
September 28, 2024
Publication Date
September 30, 2024
Submission Date
June 13, 2024
Acceptance Date
September 12, 2024
Published in Issue
Year 2024 Volume: 11 Number: 3
APA
Özfidan, H. B., & Şimşek, M. U. (2024). TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-ray Images and Reports. Gazi University Journal of Science Part A: Engineering and Innovation, 11(3), 497-506. https://doi.org/10.54287/gujsa.1501098
AMA
1.Özfidan HB, Şimşek MU. TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-ray Images and Reports. GU J Sci, Part A. 2024;11(3):497-506. doi:10.54287/gujsa.1501098
Chicago
Özfidan, Hasan Berat, and Mehmet Ulvi Şimşek. 2024. “TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-Ray Images and Reports”. Gazi University Journal of Science Part A: Engineering and Innovation 11 (3): 497-506. https://doi.org/10.54287/gujsa.1501098.
EndNote
Özfidan HB, Şimşek MU (September 1, 2024) TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-ray Images and Reports. Gazi University Journal of Science Part A: Engineering and Innovation 11 3 497–506.
IEEE
[1]H. B. Özfidan and M. U. Şimşek, “TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-ray Images and Reports”, GU J Sci, Part A, vol. 11, no. 3, pp. 497–506, Sept. 2024, doi: 10.54287/gujsa.1501098.
ISNAD
Özfidan, Hasan Berat - Şimşek, Mehmet Ulvi. “TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-Ray Images and Reports”. Gazi University Journal of Science Part A: Engineering and Innovation 11/3 (September 1, 2024): 497-506. https://doi.org/10.54287/gujsa.1501098.
JAMA
1.Özfidan HB, Şimşek MU. TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-ray Images and Reports. GU J Sci, Part A. 2024;11:497–506.
MLA
Özfidan, Hasan Berat, and Mehmet Ulvi Şimşek. “TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-Ray Images and Reports”. Gazi University Journal of Science Part A: Engineering and Innovation, vol. 11, no. 3, Sept. 2024, pp. 497-06, doi:10.54287/gujsa.1501098.
Vancouver
1.Hasan Berat Özfidan, Mehmet Ulvi Şimşek. TB-SMGAN: A GAN Based Hybrid Data Augmentation Framework on Chest X-ray Images and Reports. GU J Sci, Part A. 2024 Sep. 1;11(3):497-506. doi:10.54287/gujsa.1501098