diff --git a/README.md b/README.md index 234301705..503427a5a 100644 --- a/README.md +++ b/README.md @@ -33,6 +33,7 @@ Summary of the feature: - The image generation during training is now available. However, the VAE for SDXL seems to produce NaNs in some cases when using `fp16`. The images will be black. Currently, the NaNs cannot be avoided even with `--no_half_vae` option. It works with `bf16` or without mixed precision. - `--weighted_captions` option is not supported yet. - `--min_timestep` and `--max_timestep` options are added to each training script. These options can be used to train U-Net with different timesteps. The default values are 0 and 1000. +- `sdxl_gen_img.py` is added. This script can be used to generate images with SDXL, including LoRA. See the help message for the usage. `requirements.txt` is updated to support SDXL training. @@ -47,6 +48,7 @@ Summary of the feature: - The LoRA training can be done with 12GB GPU memory. - `--network_train_unet_only` option is highly recommended for SDXL LoRA. Because SDXL has two text encoders, the result of the training will be unexpected. - PyTorch 2 seems to use slightly less GPU memory than PyTorch 1. +- `--bucket_reso_steps` can be set to 32 instead of the default value 64. Smaller values than 32 will not work for SDXL training. Example of the optimizer settings for Adafactor with the fixed learning rate: ``` @@ -57,6 +59,12 @@ lr_warmup_steps = 100 learning_rate = 4e-7 # SDXL original learning rate ``` +### TODO + +- [ ] Support Textual Inversion training. +- [ ] Support `--weighted_captions` option. +- [ ] Change `--output_config` option to continue the training. + ## About requirements.txt These files do not contain requirements for PyTorch. Because the versions of them depend on your environment. Please install PyTorch at first (see installation guide below.)