Research Article

Automatic generation of in-vehicle images: StyleGAN-ADA vs. MSG-GAN

Volume: 5 Number: 1 June 30, 2025
EN

Automatic generation of in-vehicle images: StyleGAN-ADA vs. MSG-GAN

Abstract

Deep learning-based methodologies are a key component towards the goal of autonomous driving. For a successful application, these models require a significant amount of training data, which is difficult, time-consuming, and expensive to collect. This study assesses the effectiveness of Generative Adversarial Networks (GANs) in generating high-quality training images for in-vehicle applications using a limited dataset. Two advanced GAN architectures were compared for their ability to produce realistic in-vehicle RGB images. The results showed that the StyleGAN-ADA outperformed the MSG-GAN, generating images with better fidelity and accuracy, making it more suitable for scenarios with limited data. However, challenges such as mode collapse and long training times, particularly for high-resolution images, were identified. The models’ reliance on the quality and diversity of the training dataset also limits their effectiveness in real-world applications. This research highlights the potential of GANs to reduce the lack of data in autonomous driving, pointing to future approaches for optimizing these models.

Keywords

Supporting Institution

FCT - Fundação para a Ciência e Tecnologia

Project Number

UIDB/00319/2020

Thanks

This work has been supported by FCT - Fundação para a Ciência e Tecnologia within the scope of the R&D Units project: UIDB/00319/2020.

References

  1. [1] Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. In Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, Canada, 8–13 December 2014; MIT Press: Cambridge, MA, USA, 2014; pp. 2672–2680.
  2. [2] Perez, L.; Wang, J. The effectiveness of data augmentation in image classification using deep learning. arXiv 2017, arXiv:1712.04621. https://doi.org/10.48550/arXiv.1712.04621.
  3. [3] Salimans, T.; Goodfellow, I.; Zaremba, W.; Cheung, V.; Radford, A.; Chen, X. Improved techniques for training GANs. In Proceedings of the 30th International Conference on Neural Information Processing Systems, Barcelona, Spain, 5–10 December 2016; Curran Associates Inc.: Red Hook, NY, USA, 2016; pp. 2234–2242.
  4. [4] Lin, Z.; Khetan, A.; Fanti, G.; Oh, S. PacGAN: The power of two samples in generative adversarial networks. In Proceedings of the 31st Conference on Neural Information Processing Systems, Montreal, Canada, 3–8 December 2018; Curran Associates Inc.: Red Hook, NY, USA, 2018; pp. 1498–1507.
  5. [5] Arjovsky, M.; Bottou, L. Towards principled methods for training generative adversarial networks. arXiv 2017, arXiv:1701.04862. https://doi.org/10.48550/arXiv.1701.04862.
  6. [6] Zhang, D.; Khoreva, A. PA-GAN: Improving GAN training by progressive augmentation. arXiv 2019, arXiv:1901.10422. https://doi.org/10.48550/arXiv.1901.10422.
  7. [7] LeCun, Y.; Cortes, C.; Burges, C.J. MNIST handwritten digit database. AT&T Labs. 2010. Available online: http://yann.lecun.com/exdb/mnist (accessed on 18 Dec 2024).
  8. [8] Liu, Z.; Luo, P.; Wang, X.; Tang, X. Deep learning face attributes in the wild. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 3730–3738. https://doi.org/10.1109/ICCV.2015.425.

Details

Primary Language

English

Subjects

Image Processing , Distributed Systems and Algorithms , Autonomous Agents and Multiagent Systems

Journal Section

Research Article

Early Pub Date

May 18, 2025

Publication Date

June 30, 2025

Submission Date

January 29, 2024

Acceptance Date

January 16, 2025

Published in Issue

Year 2025 Volume: 5 Number: 1

Vancouver
1.Sahar Azadi, Sandra Dixe, Joao Leite, Joao Borges, Sandro Queiros, Jeime Fonseca. Automatic generation of in-vehicle images: StyleGAN-ADA vs. MSG-GAN. Computers and Informatics. 2025 Jun. 1;5(1):23-31. doi:10.62189/ci.1261718

Cited By

Computers and Informatics is licensed under CC BY-NC 4.0