-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
addressing issue #5, SD3 training #11
base: main
Are you sure you want to change the base?
addressing issue #5, SD3 training #11
Conversation
merge branches
issue seems to stem from the load_dataset() func that the SD3 training script uses.
-possible that the issue lies in the process_train() func of the training script. tried passing both image path strings and PIL image objects |
|
debugging output -
|
I suggest you debug directly in this script: https://github.com/huggingface/diffusers/blob/main/examples/controlnet/train_controlnet_sd3.py#L644-L649 Specifically, where marked, you print the directories, and check they exist, and print the dataset, and print the error, etc. If there is an error loading the dataset, but the directory is correct, make sure the dataset is generated correctly in the |
here's the edited function (train_controlnet_sd3.py):
edited function (train_controlnet_sd3.py):
dataset generation check (dataset.py):
dataset generation check (dataset.py) output :(
|
Great. You see that the columns are:
No |
i have also tried that without success -
|
The ['_data_files', '_fingerprint', '_format_columns', '_format_kwargs', '_format_type', '_output_all_columns', '_split'] Maybe, the path to the dataset you are creating and loading is different? Either the path is wrong, or the specific method to load the dataset is wrong. Have you tried |
yes i've tried load_from_disk as well as verified the path and env variables. although not ideal, adding load_from_disk to the training script partially solves the issue of loading the data.
this is odd because the dataset creation already handles 3 channel RGB (here for res. 512). i have also checked
so the channel mismatch issue lies with ControlNet initialization for SD3 specifically. I'm looking into the control for sd3 from diffusers more closely atm |
Alright. It is to be expected that examples on their repo should work directly |
I'm attempting to use the existing training setup with Stable Diffusion 3. Here's the history of errors encountered, please let me know how to proceed with error 4 -
Error 1:
TypeError: argument of type 'PosixPath' is not iterable
Solution 1:
Added str conversion in dataset.py
output_path = str(output_path) # after mkdir
Error 2:
train_controlnet_sd3.py: error: unrecognized arguments: --enable_xformers_memory_efficient_attention
Solution 2:
Removed arg
Error 3:
Solution 3: Modified controlnet_SD3.py to add additional parameters
--dual_attention_layers
and--qk_norm
.Error 4:
P.S. i did try to include only the relevant files for this branch comparison but even with a local reset, file changes from other open PRs still show up in this request.