Fix for sdxl_gen_img.py unable to load in Google Colab Env and Distributed Inference Version (accel_sdxl_gen_img.py) #1901

DKnight54 · 2025-01-26T18:47:08Z

For the Colab enviroment loading issue, it was pretty straight forward by implementing --full_fp16 and --full_bf16 when loading the model.
#1887
Basically modifying the linked lines.

For the distributed version accel_sdxl_gen_img.py, there are some major changes.

Chances are, interactive mode may be broken, but unable to test without a powerful enough GPU.
Instead of processing batches one at a time as it runs through the loop, the code now does the following:
a. It now collects all prompts (prompts x number of images per prompt) in a list.
b. The prompt preparation loop runs only on the main process to prevent messy double printing of prompts, especially when dynamic prompts are used.
c. The list of prompts are then split into a list of batches of prompts.
d. The batches are then distributed onto available GPUs.
Simliar to the previous distributed sample generation, image generation order is mixed up.
a. Using similar workaround, global_step is used as image generation index.
b. An integer variable global_count is added to the class BatchData
c. It is used during the inference to add an index to the filenames.
Something else that is likely to be broken now is the section that overrides Deep Shrink and Gradual Latent as it would only be updated in the main process and I can't think of a way to push it to any sub processes.
a. I have a question about the logic though, because it appears that if the arguements are added to any prompts in a list of prompts, it would override and affect all prompts in that batch and any other batches after that.
b. Is this an intended behaviour? Especially since I don't see a way to unset it.
c. Okay, maybe I do have a way to update all processes. Maybe by using gather_object() to get a list of unet or pipes and iterate through them to update. Needs testing to see if it works., but since I don't understand fully Deep Shrink and Gradual Latent, not sure if I can correctly evaluate.

I've also left in some testing and troubleshooting log outputs as it's pretty useful to see what's going on.

I get that the distributed inference is probably not critical, but I have found it useful in jupyter notebooks enviroments with more than one GPU.

Try force stay FP16 on text encoders

draft accel image gen

testing logger

DKnight54 added 30 commits January 23, 2025 23:41

Update sdxl_train_util.py

fd3a14c

Try force stay FP16 on text encoders

Update sdxl_gen_img.py

75a225d

Update sdxl_gen_img.py

68c65e0

Update sdxl_train_util.py

8daa8b3

Update sdxl_gen_img.py

6231883

Update sdxl_gen_img.py

3ca8dc5

Create accel_sdxl_gen_img.py

8f85024

Update accel_sdxl_gen_img.py

8b241c4

draft accel image gen

Update sdxl_gen_img.py

0770c7b

testing logger

Update accel_sdxl_gen_img.py

e5cf6b6

Update accel_sdxl_gen_img.py

a15d8f7

Update accel_sdxl_gen_img.py

66d4a62

Update sdxl_gen_img.py

d1e3650

Update accel_sdxl_gen_img.py

0933ee9

Update accel_sdxl_gen_img.py

f7fde60

Update accel_sdxl_gen_img.py

2c8cf94

Update accel_sdxl_gen_img.py

f1fc65e

Update accel_sdxl_gen_img.py

8a3548a

Update accel_sdxl_gen_img.py

3534842

Update accel_sdxl_gen_img.py

319a25c

Update accel_sdxl_gen_img.py

e9cb466

Update accel_sdxl_gen_img.py

ba91cce

Update accel_sdxl_gen_img.py

7cfae3d

Update accel_sdxl_gen_img.py

0a16ab6

Update accel_sdxl_gen_img.py

1181460

Update accel_sdxl_gen_img.py

9131e2f

Update accel_sdxl_gen_img.py

be36070

Update accel_sdxl_gen_img.py

c0d3abd

Update accel_sdxl_gen_img.py

e45db09

Update accel_sdxl_gen_img.py

a5e3715

DKnight54 added 30 commits February 8, 2025 21:43

Update accel_sdxl_gen_img.py

8af7ab1

Update accel_sdxl_gen_img.py

19e6bf0

Update accel_sdxl_gen_img.py

c4dd712

Update accel_sdxl_gen_img.py

4a744c3

Update accel_sdxl_gen_img.py

0ee1083

Update accel_sdxl_gen_img.py

7aa3337

Update accel_sdxl_gen_img.py

22ed684

Update accel_sdxl_gen_img.py

375971d

Update accel_sdxl_gen_img.py

36617e9

Update accel_sdxl_gen_img.py

5a01f06

Update accel_sdxl_gen_img.py

95dd367

Update accel_sdxl_gen_img.py

de8620f

Update accel_sdxl_gen_img.py

d865824

Update accel_sdxl_gen_img.py

13150c7

Update accel_sdxl_gen_img.py

213faba

Update accel_sdxl_gen_img.py

dfdfb6d

Update accel_sdxl_gen_img.py

8cc503f

Update accel_sdxl_gen_img.py

a5b8c0b

Update accel_sdxl_gen_img.py

84934b3

Update accel_sdxl_gen_img.py

a215e3f

Update accel_sdxl_gen_img.py

ebbe949

Update accel_sdxl_gen_img.py

58d6434

Update accel_sdxl_gen_img.py

0a74083

Update accel_sdxl_gen_img.py

c6e6643

Update accel_sdxl_gen_img.py

d69619c

Update accel_sdxl_gen_img.py

b640c39

Update accel_sdxl_gen_img.py

70fd06d

Update accel_sdxl_gen_img.py

586e89a

Update accel_sdxl_gen_img.py

b8d3c68

Update accel_sdxl_gen_img.py

9b9d205

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for sdxl_gen_img.py unable to load in Google Colab Env and Distributed Inference Version (accel_sdxl_gen_img.py) #1901

Fix for sdxl_gen_img.py unable to load in Google Colab Env and Distributed Inference Version (accel_sdxl_gen_img.py) #1901

DKnight54 commented Jan 26, 2025

Fix for sdxl_gen_img.py unable to load in Google Colab Env and Distributed Inference Version (accel_sdxl_gen_img.py) #1901

Are you sure you want to change the base?

Fix for sdxl_gen_img.py unable to load in Google Colab Env and Distributed Inference Version (accel_sdxl_gen_img.py) #1901

Conversation

DKnight54 commented Jan 26, 2025