Article Open Access http://dx.doi.org/10.26855/acc.2023.06.011
Exploration and Improvement of the Stable Diffusion Model in the Field of Image Generation
Lei Liang
Faculty of Humanities and Arts, Macau University of Science and Technology, Avenida Wai Long, Taipa, Macau, China.
*Corresponding author: Lei Liang
Published: July 24,2023
Abstract
Image generation is an important research direction in the field of computer vision, which covers many tasks such as image synthesis, image conversion, image editing and other tasks. In recent years, the rapid development of deep learning technology has provided powerful tools for image generation, where the stable diffusion model has attracted much attention in the image generation field as a kind of image generation model. Stable diffusion model (Stable Diffusion Model) is a generative model based on the diffusion process. The basic principle is to gradually generate the target image by conducting the diffusion process on the noise image. Different from the traditional generation models such as generative adversarial networks (GANs) and variation auto-encoders (VAEs), the stable diffusion model can gradually control the details and quality of the image during the generation process, with good generation stability and sample quality. With the continuous exploration and application of stable diffusion model in the field of image generation, researchers have proposed many improvement methods, including the improvement of generation network structure, loss function and sam-ple optimization method, to further improve the generation effect and generation speed of stable diffusion model. This paper aims to explore and summarize the application status and improvement methods of stable diffusion model in the field of image generation. Through the study and improvement of stable diffusion model, it can provide new methods and ideas to achieve more realistic, diversified and controllable image generation effects.
References
[1] Li Zonglin, Zhang Shengping, Liu Yang, Zhang Zhaoxin, Zhang Weigang, Huang Qingming. Text-driven face image generation and editing based on a multi-level residual mapper [J / OL]. Journal of Software: 1-15 [2023-04-20].
DOI:10.13328/j.cnki.jos.006767.
[2] Lai Lina, Mi Yu, Zhou Longlong, Rao Jiyong, Xu Tianyang, Song Xiaoning. Summary of Generative adversarial networks and text image generation methods [J / OL]. Computer Engineering and application: 1-23 [2023-04-20].
http://kns.cnki.net/kcms/detail/11.2127.TP.20230314.1549.022.html.
[3] Yang Hongyu, Yang Fan. Adversarial sample detection method based on image denoising and image generation [J / OL]. Journal of Hunan University (Natural Science Edition): 1-10 [2023-04-20].
http://kns.cnki.net/kcms/detail/43.1061.n.20230308.1449.002.html.
[4] Jiang Nianyun. High technology and its industrial development mechanism are used to look at it from the history of computer development [J]. Technology think-tank, 2023(03):12-16.DOI:10.19881/j.cnki.1006-3676.2023.03.02.
[5] Li Chun. Computer Development based on Intelligent Information Processing [J]. China New Communications, 2023, 25 (04): 31-33.
How to cite this paper
Exploration and Improvement of the Stable Diffusion Model in the Field of Image Generation
How to cite this paper: Lei Liang. (2023) Exploration and Improvement of the Stable Diffusion Model in the Field of Image Generation. Advances in Computer and Communication, 4(3), 163-166.
DOI: http://dx.doi.org/10.26855/acc.2023.06.011