stable_cascade_easy

Text to Img with Stable Cascade(on gradio interface), required less vram than original example on official Hugginface(https://huggingface.co/stabilityai/stable-cascade):

44 seconds for a 1280x1536 image with a nvidia RTX3060 with 12 GB VRAM

Why is stable_cascade_easy faster than hugginface example of stability ai?

Answer: Because stable cascade is composed of two models, many gb each... stability ai example loads both models simultaneously into the gpu vram, while this application loads the first one(prior), creates the image, cleans the vram and sends the image to the second model(decode) and then returns the final image and cleans the vram completely... for those with less than 16 gb of vram without this "trick" all 2 models would not fit in the vram and then you would have to use the system ram with a huge drop in performance(the time goes from 10 minutes to 44 seconds, 1280x1536 with nvidia rtx 3060 12 gb vram)

Diffusers

The diffusers branch is currently broken, meanwhile you can install it from an older commit(--force needed):

.\venv\Scripts\activate
pip install git+https://github.com/kashif/diffusers.git@a3dc21385b7386beb3dab3a9845962ede6765887 --force

Installation:

Install Python 3.10.6, checking "Add Python to PATH".
Install git.
On terminal:

git clone https://github.com/shiroppo/stable_cascade_easy
cd stable_cascade_easy
py -m venv venv
.\venv\Scripts\activate
pip install -r requirements.txt
pip install git+https://github.com/kashif/diffusers.git@a3dc21385b7386beb3dab3a9845962ede6765887 --force

Run:

Method 1

Double click on app.bat on stable_cascade_easy directory

Method 2

On terminal:

.\venv\Scripts\activate
py app.py

Update:

git pull(if error: git stash and after git pull)
.\venv\Scripts\activate
pip install -r requirements.txt

Scheduler

You can choose between DDPMWuerstchenScheduler(default), DPM++ 2M Karras and LCM. Euler a and DPM++ SDE Karras create errors so it can't be selected, scheduler only for prior model, decode model only works with default scheduler.

Scheduler - LCM

If you select LCM you can use 6+ steps on prior models so the image creation is even faster

Output

Created images will be saved in the "image" folder

Contrast:

Possibility to change the final image contrast, value from 0.5 to 1.5, no change with value 1(best results from 0.95 to 1.05)

Dimensions(Width and Length)

Multiples of 128 for Stable Cascade, but the app will resize the image for you, so you can use any size you want

Guidance Scale and Guidance Scale Decode

Choice the value that you want for Guidance Scale(Prior), for the Guidance Scale Decode now is hidden because value different than 0 causes errors and consequent not creation of the image

Code(without gradio):

import torch
from diffusers import StableCascadeDecoderPipeline, StableCascadePriorPipeline
import gc

device = "cuda"
num_images_per_prompt = 1

prior = StableCascadePriorPipeline.from_pretrained("stabilityai/stable-cascade-prior", torch_dtype=torch.bfloat16).to(device)
prior.safety_checker = None
prior.requires_safety_checker = False

prompt = "a cat"
negative_prompt = ""

prior_output = prior(
    prompt=prompt,
    width=1280,
    height=1536,
    negative_prompt=negative_prompt,
    guidance_scale=4.0,
    num_images_per_prompt=num_images_per_prompt,
    num_inference_steps=20
)

del prior
gc.collect()
torch.cuda.empty_cache()

decoder = StableCascadeDecoderPipeline.from_pretrained("stabilityai/stable-cascade",  torch_dtype=torch.float16).to(device)
decoder.safety_checker = None
decoder.requires_safety_checker = False

decoder_output = decoder(
    image_embeddings=prior_output.image_embeddings.half(),
    prompt=prompt,
    negative_prompt=negative_prompt,
    guidance_scale=0.0,
    output_type="pil",
    num_inference_steps=12
).images[0].save("image.png")

# del decoder
# gc.collect()
# torch.cuda.empty_cache()

Support:

ko-fi: (https://ko-fi.com/shiroppo)

Name		Name	Last commit message	Last commit date
Latest commit History 142 Commits
env		env
image		image
src		src
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
app.bat		app.bat
app.py		app.py
diffusers_fix.bat		diffusers_fix.bat
install.bat		install.bat
requirements.txt		requirements.txt
update.bat		update.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

stable_cascade_easy

Why is stable_cascade_easy faster than hugginface example of stability ai?

Diffusers

Installation:

Run:

Method 1

Method 2

Update:

Scheduler

Scheduler - LCM

Output

Contrast:

Dimensions(Width and Length)

Guidance Scale and Guidance Scale Decode

Code(without gradio):

Support:

About

Releases

Packages

Languages

License

rjpmestre/stable_cascade_easy

Folders and files

Latest commit

History

Repository files navigation

stable_cascade_easy

Why is stable_cascade_easy faster than hugginface example of stability ai?

Diffusers

Installation:

Run:

Method 1

Method 2

Update:

Scheduler

Scheduler - LCM

Output

Contrast:

Dimensions(Width and Length)

Guidance Scale and Guidance Scale Decode

Code(without gradio):

Support:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages