LI, Liang. Overview of Multimodal Generative Models in Natural Language Processing and Computer Vision. Journal of Computer Technology and Applied Mathematics, [S. l.], v. 1, n. 4, p. 69–78, 2024. DOI: 10.5281/zenodo.13988327. Disponível em: https://www.suaspress.org/ojs/index.php/JCTAM/article/view/v1n4a09. Acesso em: 22 may. 2025.