Exploring Semantic Consistency in Generative Artificial Intelligence via Text-to-Image and Image-to-Text Transformation
Öz
Anahtar Kelimeler
Kaynakça
- R. Rombach, A. Blattmann, D. Lorenz, P. Esser and B. Ommer, “High-resolution image synthesis with latent diffusion models”, In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp. 10684-10695, 2022.
- J. Betker, G. Goh, L. Jing, T. Brooks, J. Wang, L. Li ... and A. Ramesh, “Improving image generation with better captions”, Computer Science, 2(3), 8, https://cdn. openai. com/papers/dall-e-3.pdf, 2023.
- J. Li, D. Li, C. Xiong and S. Hoi, “Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation”, In International conference on machine learning, PMLR, pp. 12888-12900, 2022.
- S. Reed, Z. Akata, X. Yan, L. Logeswaran, B. Schiele and H. Lee, “Generative adversarial text to image synthesis”, In Proceedings of the 33rd International Conference on Machine Learning (ICML), PMLR, pp. 1060-1069, 2016.
- R. Kiros, R. Salakhutdinov and R. S. Zemel, “Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models”, arXiv preprint arXiv:1411.2539, 2014.
- A. Lin, L. Monteiro Paes, S. H. Tanneru, S. Srinivas and H. Lakkaraju, “Word-Level Explanations for Analyzing Bias in Text-to-Image Models”, arXiv preprint arXiv:2302.06578, 2023.
- L. Yang, Z. Zhang, Y. Song, S. Hong, R. Xu, Y. Zhao, and M. H. Yang, “Diffusion models: A comprehensive survey of methods and applications”. ACM Computing Surveys, 56(4), 1-39, 2023.
- J. Ho, A. Jain and P. Abbeel, “Denoising diffusion probabilistic models”, Advances in neural information processing systems, 33, 6840-6851, 2020.
Ayrıntılar
Birincil Dil
İngilizce
Konular
Multimodal Analiz ve Sentez, Derin Öğrenme, Doğal Dil İşleme
Bölüm
Araştırma Makalesi
Yazarlar
Onur Doğan
0009-0001-5083-0163
Türkiye
Almila Altıntaş
0009-0008-7955-3789
Türkiye
Buse Yücetürk
0009-0003-3078-4352
Türkiye
Doğa Aydın
0009-0000-7782-0830
Türkiye
Fatih Soygazi
*
0000-0001-8426-2283
Türkiye
Yılmaz Kılıçaslan
0000-0002-5020-6547
Türkiye
Yayımlanma Tarihi
27 Haziran 2025
Gönderilme Tarihi
31 Mayıs 2025
Kabul Tarihi
23 Haziran 2025
Yayımlandığı Sayı
Yıl 2025 Cilt: 5 Sayı: 1