magazinelogo

Advances in Computer and Communication

ISSN Print: Downloads: 58603 Total View: 486205
Frequency: quarterly ISSN Online: 2767-2875 CODEN: ACCDC3
Email: acc@hillpublisher.com
Article Open Access http://dx.doi.org/10.26855/acc.2023.06.011

Exploration and Improvement of the Stable Diffusion Model in the Field of Image Generation

Lei Liang

Faculty of Humanities and Arts, Macau University of Science and Technology, Avenida Wai Long, Taipa, Macau, China.

*Corresponding author: Lei Liang

Published: July 24,2023

Abstract

Image generation is an important research direction in the field of computer vision, which covers many tasks such as image synthesis, image conversion, image editing and other tasks. In recent years, the rapid development of deep learning technology has provided powerful tools for image generation, where the stable diffusion model has attracted much attention in the image generation field as a kind of image generation model. Stable diffusion model (Stable Diffusion Model) is a generative model based on the diffusion process. The basic principle is to gradually generate the target image by conducting the diffusion process on the noise image. Different from the traditional generation models such as generative adversarial networks (GANs) and variation auto-encoders (VAEs), the stable diffusion model can gradually control the details and quality of the image during the generation process, with good generation stability and sample quality. With the continuous exploration and application of stable diffusion model in the field of image generation, researchers have proposed many improvement methods, including the improvement of generation network structure, loss function and sam-ple optimization method, to further improve the generation effect and generation speed of stable diffusion model. This paper aims to explore and summarize the application status and improvement methods of stable diffusion model in the field of image generation. Through the study and improvement of stable diffusion model, it can provide new methods and ideas to achieve more realistic, diversified and controllable image generation effects.

References

[1] Li Zonglin, Zhang Shengping, Liu Yang, Zhang Zhaoxin, Zhang Weigang, Huang Qingming. Text-driven face image generation and editing based on a multi-level residual mapper [J / OL]. Journal of Software: 1-15 [2023-04-20].

DOI:10.13328/j.cnki.jos.006767.

[2] Lai Lina, Mi Yu, Zhou Longlong, Rao Jiyong, Xu Tianyang, Song Xiaoning. Summary of Generative adversarial networks and text image generation methods [J / OL]. Computer Engineering and application: 1-23 [2023-04-20].

http://kns.cnki.net/kcms/detail/11.2127.TP.20230314.1549.022.html.

[3] Yang Hongyu, Yang Fan. Adversarial sample detection method based on image denoising and image generation [J / OL]. Journal of Hunan University (Natural Science Edition): 1-10 [2023-04-20].

http://kns.cnki.net/kcms/detail/43.1061.n.20230308.1449.002.html.

[4] Jiang Nianyun. High technology and its industrial development mechanism are used to look at it from the history of computer development [J]. Technology think-tank, 2023(03):12-16.DOI:10.19881/j.cnki.1006-3676.2023.03.02.

[5] Li Chun. Computer Development based on Intelligent Information Processing [J]. China New Communications, 2023, 25 (04): 31-33.

How to cite this paper

Exploration and Improvement of the Stable Diffusion Model in the Field of Image Generation

How to cite this paper: Lei Liang. (2023) Exploration and Improvement of the Stable Diffusion Model in the Field of Image Generation. Advances in Computer and Communication4(3), 163-166.

DOI: http://dx.doi.org/10.26855/acc.2023.06.011