The Technology of Face Synthesis and Editing Based on Generative Models

Shijie Lyu

doi:10.5281/zenodo.13853413

Authors

Shijie Lyu Georgia Institute of Technology

DOI:

https://doi.org/10.5281/zenodo.13853413

ARK:

https://n2t.net/ark:/40704/JCTAM.v1n4a03

Disciplines:

Computer Science

Subjects:

Computer Vision

References:

39

Keywords:

Generative Artificial Intelligence, Computer Vision, Ethical Issues

Abstract

This paper reviews the current state of research on generative AI both domestically and internationally, exploring its potential applications and challenges across various fields. In education, generative AI enhances students' academic writing skills and learning outcomes by providing personalized learning support. In design, it facilitates personalized and innovative creations, enabling designers to generate novel ideas through algorithms. Additionally, the application of generative AI in psychology reveals the complex relationship between emotion analysis and social behavior, while in computer vision, it advances facial recognition technology. However, with the widespread use of generative AI, ethical and social responsibility issues are increasingly prominent. This paper emphasizes the importance of establishing appropriate regulations and legal frameworks to ensure the authenticity and morality of generated content, making this a key focus for future research. Overall, generative AI is profoundly transforming research and practice in various fields, and future studies must pay greater attention to its social impact and technological responsibilities.

Author Biography

Shijie Lyu, Georgia Institute of Technology

Georgia Institute of Technology, USA.

References

Mo, Y. ., Qin, H., Dong, Y., Zhu, Z., & Li, Z. (2024). Large Language Model (LLM) AI Text Generation Detection based on Transformer Deep Learning Algorithm. International Journal of Engineering and Management Research, 14(2), 154–159. https://doi.org/10.5281/zenodo.11124440

Hao Qin, & Zhi Li. (2024). A Study on Enhancing Government Efficiency and Public Trust: The Transformative Role of Artificial Intelligence and Large Language Models. International Journal of Engineering and Management Research, 14(3), 57–61. https://doi.org/10.5281/zenodo.12619360

Hao Qin, & Li, Z. (2024). Precision in Practice: Enhancing Healthcare with Domain-Specific Language Models. Applied Science and Engineering Journal for Advanced Research, 3(4), 28–33. https://doi.org/10.5281/zenodo.13253336

Qu, M. (2024). High Precision Measurement Technology of Geometric Parameters Based on Binocular Stereo Vision Application and Development Prospect of The System in Metrology and Detection. Journal of Computer Technology and Applied Mathematics, 1(3), 23–29. https://doi.org/10.5281/zenodo.13366612

Wang, D. (Ed.). (2016). Information Science and Electronic Engineering: Proceedings of the 3rd

International Conference of Electronic Engineering and Information Science (ICEEIS 2016), January 4-5, 2016, Harbin, China. CRC Press.

Vivek Tomar & Nitin Kumar. (2024). Single-sample face and ear recognition using virtual sample generation with 2D local patches. The Journal of Supercomputing (prepublish), 1-26.

(2024). Contactless Smart Attendance System Using Facial Recognition and QR Code. International Journal of Engineering and Advanced Technology (IJEAT) (6)

(2024). Nabla Works Corp. Launches 'Face Recognition Access Solution'. Wireless News

Emma Bou Hanna, Sebastian Partarrieu, Arnaud Berenbaum, Stéphanie Allassonnière & Florent L. Besson. (2024). Exploring de-anonymization risks in PET imaging: Insights from a comprehensive analysis of 853 patient scans. Scientific Data (1), 932-932.

Sebastian Ludyga,Fabienne Bruggisser,Rahel Leuenberger,Toru Ishihara,Keita Kamijo,Mark Brotzmann... & Markus Gerber. (2024). Acute effects of exercise on gaze fixation and affective response inhibition in children with autism spectrum disorder: A randomized cross-over study.. Autism research : official journal of the International Society for Autism Research

Yang Wang,Fan Xie,Chuanxin Zhao,Ao Wang,Chang Ma,Shijia Song... & Lijun Zhao. (2024). Robust face recognition model based sample mining and loss functions. Knowledge-Based Systems 112330-112330.

Usman Muhammad, Mourad Oussalah & Jorma Laaksonen. (2024). Saliency-based video summarization for face anti-spoofing. Pattern Recognition Letters 190-196.

(2024). Nabla Works Corp. Introduces 'Face Recognition Access Solution'. Manufacturing Close - Up

Xiaoqun LIU, Mei LIU & Ning LI. (2024). Dual vision visual fusion improved YOLO-V7 intelligent elevator face recognition model. Journal of Optics (prepublish), 1-13.

Juan Cai, Qiangqiang Duan, Min Long, Le Bing Zhang & Xiangling Ding. (2024). Feature Interaction-Based Face De-Morphing Factor Prediction for Restoring Accomplice’s Facial Image. Sensors (17), 5504-5504

Parker L J ,Richard M V ,Acabá A , et al. Negotiating Meaning with Machines: AI's Role in Doctoral Writing Pedagogy [J]. International Journal of Artificial Intelligence in Education, 2024, (prepublish): 1-21.

Kalinin A ,Jafari A A ,Avots E , et al. Generative AI-based style recommendation using fashion item detection and classification [J]. Signal, Image and Video Processing, 2024, (prepublish): 1-11.

He Z ,Wang H Y ,Zhang J . Generative AIBIM: An automatic and intelligent structural design pipeline integrating BIM and generative AI [J]. Information Fusion, 2025, 114 102654-102654.

Rodrigo M ,Cuevas C ,García N . Comprehensive comparison between vision transformers and convolutional neural networks for face recognition tasks [J]. Scientific Reports, 2024, 14 (1): 21392-21392.

Gaur S ,Pandey M ,Himanshu . Realization of Facial Recognition Technology for Attendance Monitoring Through Biometric Modalities Employing MTCNN Integration [J]. SN Computer Science, 2024, 5 (7): 862-862.

Moore N K ,Nesmith L B ,Zwemer U D , et al. Search efforts and face recognition: the role of expectations of encounter and within-person variability in prospective person memory [J]. Cognitive Research: Principles and Implications, 2024, 9 (1): 63-63.

Śliwiak P ,Shah A A S . Text-to-text generative approach for enhanced complex word identification [J]. Neurocomputing, 2024, 610 128501-128501.

Qiao, Y., Li, K., Lin, J., Wei, R., Jiang, C., Luo, Y., & Yang, H. (2024). Robust domain generalization for multi-modal object recognition. arXiv preprint arXiv:2408.05831.

Li, K., Chen, J., Yu, D., Dajun, T., Qiu, X., Jieting, L., ... & Ni, F. (2024). Deep Reinforcement Learning-based Obstacle Avoidance for Robot Movement in Warehouse Environments. arXiv preprint arXiv:2409.14972.

Li, K., Wang, J., Wu, X., Peng, X., Chang, R., Deng, X., ... & Hong, B. (2024). Optimizing Automated Picking Systems in Warehouse Robots Using Machine Learning. arXiv preprint arXiv:2408.16633.Charlie H

He, C., Liu, M., Wang, Z., Chen, G., Zhang, Y., & Hsiang, S. M. (2022). Facilitating Smart Contract in Project Scheduling under Uncertainty—A Choquet Integral Approach. Construction Research Congress 2022, 930–939.

Chen, G., Liu, M., Zhang, Y., Wang, Z., Hsiang, S. M., & He, C. (2023). Using Images to Detect, Plan, Analyze, and Coordinate a Smart Contract in Construction. Journal of Management in Engineering, 39(2), 1–18.

Wu, T., Zaitian, G., Wang, Q., & Gao, F. (2024). Scalable Distance-based Multi-Agent Relative State Estimation via Block Multiconvex Optimization. arXiv preprint arXiv:2405.20883.

Zhao, G., Wu, T., Chen, Y., & Gao, F. (2024). Learning Speed Adaptation for Flight in Clutter. IEEE Robotics and Automation Letters.

Wu, T., Chen, Y., Chen, T., Zhao, G., & Gao, F. (2024). Whole-Body Control Through Narrow Gaps From Pixels To Action. arXiv preprint arXiv:2409.00895.

Bačić, B., Feng, C., & Li, W. (2024). JY61 IMU SENSOR EXTERNAL VALIDITY: A FRAMEWORK FOR ADVANCED PEDOMETER ALGORITHM PERSONALISATION. ISBS Proceedings Archive, 42(1), 60.

Yu, P., Cui, V. Y., & Guan, J. (2021, March). Text classification by using natural language processing. In Journal of Physics: Conference Series (Vol. 1802, No. 4, p. 042010). IOP Publishing.

Xu, Y., Lin, Y. S., Zhou, X., & Shan, X. (2024). Utilizing emotion recognition technology to enhance user experience in real-time. Computing and Artificial Intelligence, 2(1), 1388-1388.

Luo, M., Zhang, W., Song, T., Li, K., Zhu, H., Du, B., & Wen, H. (2021, January). Rebalancing expanding EV sharing systems with deep reinforcement learning. In Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence (pp. 1338-1344).

Luo, M., Du, B., Zhang, W., Song, T., Li, K., Zhu, H., ... & Wen, H. (2023). Fleet rebalancing for expanding shared e-Mobility systems: A multi-agent deep reinforcement learning approach. IEEE Transactions on Intelligent Transportation Systems, 24(4), 3868-3881.

Fan, H., Li, K., Li, X., Song, T., Zhang, W., Shi, Y., & Du, B. (2019). CoVSCode: a novel real-time collaborative programming environment for lightweight IDE. Applied Sciences, 9(21), 4642.

Sun, Y., & Ortiz, J. (2024). Machine Learning-Driven Pedestrian Recognition and Behavior Prediction for Enhancing Public Safety in Smart Cities. Journal of Artificial Intelligence and Information, 1, 51-57.