109 lines
4.5 KiB
Markdown
109 lines
4.5 KiB
Markdown
# Stable Diffusion web UI
|
|
A browser interface based on Gradio library for Stable Diffusion.
|
|
|
|
Original script with Gradio UI was written by a kind anonymopus user. This is a modification.
|
|
|
|
![](screenshot.png)
|
|
## Installing and running
|
|
|
|
### Stable Diffusion
|
|
|
|
This script assumes that you already have main Stable Diffusion sutff installed, assumed to be in directory `/sd`.
|
|
If you don't have it installed, follow the guide:
|
|
|
|
- https://rentry.org/kretard
|
|
|
|
This repository's `webgui.py` is a replacement for `kdiff.py` from the guide.
|
|
|
|
Particularly, following files must exist:
|
|
|
|
- `/sd/configs/stable-diffusion/v1-inference.yaml`
|
|
- `/sd/models/ldm/stable-diffusion-v1/model.ckpt`
|
|
- `/sd/ldm/util.py`
|
|
- `/sd/k_diffusion/__init__.py`
|
|
|
|
### GFPGAN
|
|
|
|
If you want to use GFPGAN to improve generated faces, you need to install it separately.
|
|
Follow instructions from https://github.com/TencentARC/GFPGAN, but when cloning it, do so into Stable Diffusion main directory, `/sd`.
|
|
After that download [GFPGANv1.3.pth](https://github.com/TencentARC/GFPGAN/releases/download/v1.3.0/GFPGANv1.3.pth) and put it
|
|
into the `/sd/GFPGAN/experiments/pretrained_models` directory. If you're getting troubles with GFPGAN support, follow instructions
|
|
from the GFPGAN's repository until `inference_gfpgan.py` script works.
|
|
|
|
The following files must exist:
|
|
|
|
- `/sd/GFPGAN/inference_gfpgan.py`
|
|
- `/sd/GFPGAN/experiments/pretrained_models/GFPGANv1.3.pth`
|
|
|
|
If the GFPGAN directory does not exist, you will not get the option to use GFPGAN in the UI. If it does exist, you will either be able
|
|
to use it, or there will be a message in console with an error related to GFPGAN.
|
|
|
|
### Web UI
|
|
|
|
Run the script as:
|
|
|
|
`python webui.py`
|
|
|
|
When running the script, you must be in the main Stable Diffusion directory, `/sd`. If you cloned this repository into a subdirectory
|
|
of `/sd`, say, the `stable-diffusion-webui` directory, you will run it as:
|
|
|
|
`python stable-diffusion-webui/webui.py`
|
|
|
|
When launching, you may get a very long warning message related to some weights not being used. You may freely ignore it.
|
|
After a while, you will get a message like this:
|
|
|
|
```
|
|
Running on local URL: http://127.0.0.1:7860/
|
|
```
|
|
|
|
Open the URL in browser, and you are good to go.
|
|
|
|
## Features
|
|
The script creates a web UI for Stable Diffusion's txt2img and img2img scripts. Following are features added
|
|
that are not in original script.
|
|
|
|
### GFPGAN
|
|
Lets you improve faces in pictures using the GFPGAN model. There is a checkbox in every tab to use GFPGAN at 100%, and
|
|
also a separate tab that just allows you to use GFPGAN on any picture, with a slider that controls how strongthe effect is.
|
|
|
|
![](images/GFPGAN.png)
|
|
|
|
### Sampling method selection
|
|
Pick out of three sampling methods for txt2img: DDIM, PLMS, k-diffusion:
|
|
|
|
![](images/sampling.png)
|
|
|
|
### Prompt matrix
|
|
Separate multiple prompts using the `|` character, and the system will produce an image for every combination of them.
|
|
For example, if you use `a busy city street in a modern city|illustration|cinematic lighting` prompt, there are four combinations possible (first part of prompt is always kept):
|
|
|
|
- `a busy city street in a modern city`
|
|
- `a busy city street in a modern city, illustration`
|
|
- `a busy city street in a modern city, cinematic lighting`
|
|
- `a busy city street in a modern city, illustration, cinematic lighting`
|
|
|
|
Four images will be produced, in this order, all with same seed and each with corresponding prompt:
|
|
![](images/prompt-matrix.png)
|
|
|
|
Another example, this time with 5 prompts and 16 variations:
|
|
![](images/prompt_matrix.jpg)
|
|
|
|
If you use this feature, batch count will be ignored, because the number of pictures to produce
|
|
depends on your prompts, but batch size will still work (generating multiple pictures at the
|
|
same time for a small speed boost).
|
|
|
|
|
|
### Flagging
|
|
Click the Flag button under the output section, and generated images will be saved to `log/images` directory, and generation parameters
|
|
will be appended to a csv file `log/log.csv` in the `/sd` directory.
|
|
|
|
### Copy-paste generation parameters
|
|
A text output provides generation parameters in an easy to copy-paste form for easy sharing.
|
|
|
|
![](images/kopipe.png)
|
|
|
|
If you generate multiple pictures, the displayed seed will be the seed of the first one.
|
|
|
|
### Correct seeds for batches
|
|
If you use a seed of 1000 to generate two batches of two images each, four generated images will have seeds: `1000, 1001, 1002, 1003`.
|
|
Previous versions of the UI would produce `1000, x, 1001, x`, where x is an iamge that can't be generated by any seed.
|