Researchers are showing great interest in Generative Adversarial Networks (GANs), which use deep learning techniques to mimic the content of datasets and are particularly adept at data generation. Despite their impressive performance, there is uncertainty about how GANs precisely map latent space vectors to realistic images and how the chosen dimensionality of the latent space affects the quality of the generated images. In this paper, we explored the potential of generative models in generating animal face images. For this purpose, we used the Deep Convolutional Generative Adversarial Network (DCGAN) model as a reference. To analyze the impact of selected latent space vectors, we synthesized animal face images by training data representations in the DCGAN model with the well-known AFHQ dataset from the literature. We compared the quantitative evaluation of the produced images using Fréchet Inception Distance (FID) and Inception Score (IS). As a result, we demonstrated that generative models can produce images with latent sizes significantly smaller and larger than the standard size of 100.
Researchers have shown great interest in Generative Adversarial Networks (GANs), which utilize deep learning techniques to mimic the content of datasets and particularly excel in data generation. Despite their impressive performance, there remains uncertainty about how GANs precisely map latent space vectors to realistic images and how the chosen dimensionality of the latent space affects the quality of the generated images. In this study, we analyze the potential of learned data representations to generate different animal face images and examine the impact of the selected latent space dimension on the synthesized image quality using a Deep Convolutional GAN (DCGAN). For quantitative evaluation of the generated synthetic images, we employ metrics such as Fr´echet Inception Distance (FID) and Inception Score (IS). In addition to quantitative assessment results, we also utilize qualitative evaluation methods to assess whether overfitting is present and to form intuitive perception about data samples extracted from the model and the ability to disseminate. Finally, we compare and evaluate the results of generative outputs by training the DCGAN on well-known AFHQ Cat, AFHQ Dog, and AFHQ Wild Animals datasets, measuring the impact of latent space dimensions and image feature quality through a comparative analysis.
Primary Language | English |
---|---|
Subjects | Image Processing, Deep Learning |
Journal Section | Articles |
Authors | |
Early Pub Date | March 29, 2024 |
Publication Date | March 29, 2024 |
Submission Date | November 21, 2023 |
Acceptance Date | March 19, 2024 |
Published in Issue | Year 2024 |