Li, L. (2024) “Overview of Multimodal Generative Models in Natural Language Processing and Computer Vision”, Journal of Computer Technology and Applied Mathematics, 1(4), pp. 69–78. doi: 10.5281/zenodo.13988327.