Alpha masked loss #1339

kohya-ss · 2024-05-19T10:27:08Z

original PR #1223

* Add alpha_mask parameter and apply masked loss * Fix type hint in trim_and_resize_if_required function * Refactor code to use keyword arguments in train_util.py * Fix alpha mask flipping logic * Fix alpha mask initialization * Fix alpha_mask transformation * Cache alpha_mask * Update alpha_masks to be on CPU * Set flipped_alpha_masks to Null if option disabled * Check if alpha_mask is None * Set alpha_mask to None if option disabled * Add description of alpha_mask option to docs

recris · 2024-05-19T14:09:21Z

Is there a way we can support more than one "channel" in the mask tensor, like we have currently? Even though we're only using the R channel.

Context: I've been experimenting with dynamic masks where the actual mask values/strength varies based on the timestep. The idea is to make the training process to focus on certain things in high timesteps, such as the overall shapes and composition, then in low timesteps change the focus to specific details like faces. To achieve this I use a "high" mask and a "low" mask, and interpolate between the two. High mask is stored as Red, low mask as Green.

Here is my experimental apply_masked_loss version:

def apply_masked_loss(loss, batch, timesteps):
    # mask image is -1 to 1. we need to convert it to 0 to 1
    mask_image_hi = batch["conditioning_images"].to(dtype=loss.dtype)[:, 0].unsqueeze(1)  # use R channel
    mask_image_lo = batch["conditioning_images"].to(dtype=loss.dtype)[:, 1].unsqueeze(1)  # use G channel
    mask_image_hi = mask_image_hi / 2 + 0.5
    mask_image_lo = mask_image_lo / 2 + 0.5

    timesteps = (timesteps / 1000.0).reshape(timesteps.shape + (1,) * 3)
    mask = torch.lerp(mask_image_lo, mask_image_hi, timesteps)

    # resize to the same size as the loss
    mask = torch.nn.functional.interpolate(mask, size=loss.shape[2:], mode="area")
    mask = mask.broadcast_to(loss.shape)
    loss = loss * mask
    return loss

I am getting some decent results with this technique. For this to work it would be necessary to have access to RGB channels in this function (or some structure compatible with multiple masks).

add sample images

kohya-ss · 2024-05-27T12:28:42Z

Is there a way we can support more than one "channel" in the mask tensor, like we have currently? Even though we're only using the R channel.

Thank you for your suggestion. The code is very interesting. I think it may possibly be extended in the future (e.g., using the B channel for something), so I will add it as an experimental feature.

araleza · 2024-05-28T22:06:55Z

Hello - I fetched the latest dev branch and tried passing --alpha_mask in as a command line parameter. (I do not have any .toml files). I got this error message:

Traceback (most recent call last):
  File "/home/ara/m.2/Dev/sdxl/sd-scripts/./sdxl_train.py", line 944, in <module>
    train(args)
  File "/home/ara/m.2/Dev/sdxl/sd-scripts/./sdxl_train.py", line 607, in train
    for step, batch in enumerate(train_dataloader):
  File "/home/ara/m.2/Dev/sdxl/sd-scripts/venv/lib/python3.10/site-packages/accelerate/data_loader.py", line 448, in __iter__
    current_batch = next(dataloader_iter)
  File "/home/ara/m.2/Dev/sdxl/sd-scripts/venv/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 631, in __next__
    data = self._next_data()
  File "/home/ara/m.2/Dev/sdxl/sd-scripts/venv/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 675, in _next_data
    data = self._dataset_fetcher.fetch(index)  # may raise StopIteration
  File "/home/ara/m.2/Dev/sdxl/sd-scripts/venv/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py", line 51, in fetch
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "/home/ara/m.2/Dev/sdxl/sd-scripts/venv/lib/python3.10/site-packages/torch/utils/data/_utils/fetch.py", line 51, in <listcomp>
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "/home/ara/m.2/Dev/sdxl/sd-scripts/venv/lib/python3.10/site-packages/torch/utils/data/dataset.py", line 335, in __getitem__
    return self.datasets[dataset_idx][sample_idx]
  File "/home/ara/m.2/Dev/sdxl/sd-scripts/library/train_util.py", line 1207, in __getitem__
    alpha_mask = None if image_info.alpha_mask is None else torch.flip(image_info.alpha_mask, [1])
TypeError: flip(): argument 'input' (position 1) must be Tensor, not numpy.ndarray

I am doing fp32 sdxl_train.py training with huber loss, a batch size of 4, and I'm training te1 and the unet. I have a single directory of just png images, all with alpha channels, and I'm using multi-line captions.

Did I do something wrong? Do you need more information?

Command line (Linux):

accelerate launch --num_cpu_threads_per_process=2 "./sdxl_train.py" --pretrained_model_name_or_path="/home/ara/Documents/Dev/sdxl/training/earthscape/kohya/dreambooth/earthscape-step00002600.safetensors" --sdpa --enable_bucket --min_bucket_reso=64 --max_bucket_reso=1024 --train_data_dir="/home/ara/Documents/Dev/sdxl/training/earthscape/kohya/img" --resolution="1024,1024" --output_dir="/home/ara/Documents/Dev/sdxl/training/earthscape/kohya/dreambooth" --logging_dir="/home/ara/Documents/Dev/sdxl/training/earthscape/kohya/log" --save_model_as=safetensors --vae="/home/ara/Documents/Dev/sdxl/sdxl_vae.safetensors" --output_name="earthscape" --lr_scheduler_num_cycles="20000" --max_token_length=150 --max_data_loader_n_workers="0" --lr_scheduler="constant_with_warmup" --lr_warmup_steps="100" --max_train_steps="16000" --caption_extension=".txt" --optimizer_type="Adafactor" --optimizer_args scale_parameter=False relative_step=False warmup_init=False --max_data_loader_n_workers="0" --max_token_length=150 --bucket_reso_steps=32 --v_pred_like_loss="0.5" --save_every_n_steps="200" --save_last_n_steps="600" --min_snr_gamma=5 --gradient_checkpointing --xformers --bucket_no_upscale --noise_offset=0.0357 --adaptive_noise_scale=0.00357 --sample_sampler=k_dpm_2 --sample_prompts="/home/ara/Documents/Dev/sdxl/training/earthscape/kohya/dreambooth/sample/prompt.txt" --sample_every_n_steps="50" --fused_backward_pass --cache_latents --loss_type=huber --train_batch_size="4" --train_text_encoder --learning_rate_te1 1e-9 --learning_rate_te2 0 --learning_rate="4e-7" --flip_aug --enable_wildcard --shuffle_caption --alpha_mask

araleza · 2024-05-28T22:37:17Z

I think I may have found the fix. The code path which runs where latents are not being cached to disk does not seem to be complete. If I find these lines (around line 1200 of train_util.py):

                original_size = image_info.latents_original_size
                crop_ltrb = image_info.latents_crop_ltrb  # calc values later if flipped

and add

                if image_info.alpha_mask is not None:
                    image_info.alpha_mask = torch.FloatTensor(image_info.alpha_mask)

immediately after them, then it starts training again.

I haven't checked that alpha masking is working for the flipped versions of the images, because I don't know if I need to "copy to avoid negative stride problem" as the comment says in the .npz section below the non-npz.

If anyone reading this doesn't want to edit the code before this is fixed, maybe try using --cache_latents_to_disk too, as that goes through a different code path, and I think that code is already correct.

kohya-ss · 2024-05-29T12:42:38Z

Thank you for reporting. I will check it sooner.

kohya-ss · 2024-06-02T12:13:39Z

ndarray/Tensor issue is fixed. Sorry for waiting.

* Final implementation * Skip the final 1 step * fix alpha mask without disk cache closes kohya-ss#1351, ref kohya-ss#1339 * update for corner cases * Bump crate-ci/typos from 1.19.0 to 1.21.0, fix typos, and updated _typos.toml (Close kohya-ss#1307) * set static graph flag when DDP ref kohya-ss#1363 * make forward/backward pathes same ref kohya-ss#1363 * update README * add grad_hook after restore state closes kohya-ss#1344 * fix to work cache_latents/text_encoder_outputs * show file name if error in load_image ref kohya-ss#1385 --------- Co-authored-by: Kohya S <[email protected]> Co-authored-by: Kohya S <[email protected]> Co-authored-by: Yuta Hayashibe <[email protected]>

Squashed commit of the following: commit 56bb81c Author: Kohya S <[email protected]> Date: Wed Jun 12 21:39:35 2024 +0900 add grad_hook after restore state closes kohya-ss#1344 commit 22413a5 Merge: 3259928 18d7597 Author: Kohya S <[email protected]> Date: Tue Jun 11 19:52:03 2024 +0900 Merge pull request kohya-ss#1359 from kohya-ss/train_resume_step Train resume step commit 18d7597 Author: Kohya S <[email protected]> Date: Tue Jun 11 19:51:30 2024 +0900 update README commit 4a44188 Merge: 4dbcef4 3259928 Author: Kohya S <[email protected]> Date: Tue Jun 11 19:27:37 2024 +0900 Merge branch 'dev' into train_resume_step commit 3259928 Merge: 1a104dc 5bfe5e4 Author: Kohya S <[email protected]> Date: Sun Jun 9 19:26:42 2024 +0900 Merge branch 'dev' of https://github.com/kohya-ss/sd-scripts into dev commit 1a104dc Author: Kohya S <[email protected]> Date: Sun Jun 9 19:26:36 2024 +0900 make forward/backward pathes same ref kohya-ss#1363 commit 58fb648 Author: Kohya S <[email protected]> Date: Sun Jun 9 19:26:09 2024 +0900 set static graph flag when DDP ref kohya-ss#1363 commit 5bfe5e4 Merge: e5bab69 4ecbac1 Author: Kohya S <[email protected]> Date: Thu Jun 6 21:23:24 2024 +0900 Merge pull request kohya-ss#1361 from shirayu/update/github_actions/crate-ci/typos-1.21.0 Bump crate-ci/typos from 1.19.0 to 1.21.0, fix typos, and updated _typos.toml (Close kohya-ss#1307) commit 4ecbac1 Author: Yuta Hayashibe <[email protected]> Date: Wed Jun 5 16:31:44 2024 +0900 Bump crate-ci/typos from 1.19.0 to 1.21.0, fix typos, and updated _typos.toml (Close kohya-ss#1307) commit 4dbcef4 Author: Kohya S <[email protected]> Date: Tue Jun 4 21:26:55 2024 +0900 update for corner cases commit 321e24d Merge: e5bab69 3eb27ce Author: Kohya S <[email protected]> Date: Tue Jun 4 19:30:11 2024 +0900 Merge pull request kohya-ss#1353 from KohakuBlueleaf/train_resume_step Resume correct step for "resume from state" feature. commit e5bab69 Author: Kohya S <[email protected]> Date: Sun Jun 2 21:11:40 2024 +0900 fix alpha mask without disk cache closes kohya-ss#1351, ref kohya-ss#1339 commit 3eb27ce Author: Kohaku-Blueleaf <[email protected]> Date: Fri May 31 12:24:15 2024 +0800 Skip the final 1 step commit b2363f1 Author: Kohaku-Blueleaf <[email protected]> Date: Fri May 31 12:20:20 2024 +0800 Final implementation commit 0d96e10 Merge: ffce3b5 fc85496 Author: Kohya S <[email protected]> Date: Mon May 27 21:41:16 2024 +0900 Merge pull request kohya-ss#1339 from kohya-ss/alpha-masked-loss Alpha masked loss commit fc85496 Author: Kohya S <[email protected]> Date: Mon May 27 21:25:06 2024 +0900 update docs for masked loss commit 2870be9 Merge: 71ad3c0 ffce3b5 Author: Kohya S <[email protected]> Date: Mon May 27 21:08:43 2024 +0900 Merge branch 'dev' into alpha-masked-loss commit 71ad3c0 Author: Kohya S <[email protected]> Date: Mon May 27 21:07:57 2024 +0900 Update masked_loss_README-ja.md add sample images commit ffce3b5 Merge: fb12b6d d50c1b3 Author: Kohya S <[email protected]> Date: Mon May 27 21:00:46 2024 +0900 Merge pull request kohya-ss#1349 from rockerBOO/patch-4 Update issue link commit a4c3155 Author: Kohya S <[email protected]> Date: Mon May 27 20:59:40 2024 +0900 add doc for mask loss commit 58cadf4 Merge: e8cfd4b fb12b6d Author: Kohya S <[email protected]> Date: Mon May 27 20:02:32 2024 +0900 Merge branch 'dev' into alpha-masked-loss commit d50c1b3 Author: Dave Lage <[email protected]> Date: Mon May 27 01:11:01 2024 -0400 Update issue link commit e8cfd4b Author: Kohya S <[email protected]> Date: Sun May 26 22:01:37 2024 +0900 fix to work cond mask and alpha mask commit fb12b6d Merge: febc5c5 00513b9 Author: Kohya S <[email protected]> Date: Sun May 26 19:45:03 2024 +0900 Merge pull request kohya-ss#1347 from rockerBOO/lora-plus-log-info Add LoRA+ LR Ratio info message to logger commit 00513b9 Author: rockerBOO <[email protected]> Date: Thu May 23 22:27:12 2024 -0400 Add LoRA+ LR Ratio info message to logger commit da6fea3 Author: Kohya S <[email protected]> Date: Sun May 19 21:26:18 2024 +0900 simplify and update alpha mask to work with various cases commit f2dd43e Author: Kohya S <[email protected]> Date: Sun May 19 19:23:59 2024 +0900 revert kwargs to explicit declaration commit db67529 Author: u-haru <[email protected]> Date: Sun May 19 19:07:25 2024 +0900 画像のアルファチャンネルをlossのマスクとして使用するオプションを追加 (kohya-ss#1223) * Add alpha_mask parameter and apply masked loss * Fix type hint in trim_and_resize_if_required function * Refactor code to use keyword arguments in train_util.py * Fix alpha mask flipping logic * Fix alpha mask initialization * Fix alpha_mask transformation * Cache alpha_mask * Update alpha_masks to be on CPU * Set flipped_alpha_masks to Null if option disabled * Check if alpha_mask is None * Set alpha_mask to None if option disabled * Add description of alpha_mask option to docs commit febc5c5 Author: Kohya S <[email protected]> Date: Sun May 19 19:03:43 2024 +0900 update README commit 4c79812 Author: Kohya S <[email protected]> Date: Sun May 19 19:00:32 2024 +0900 update README commit 38e4c60 Merge: e4d9e3c fc37437 Author: Kohya S <[email protected]> Date: Sun May 19 18:55:50 2024 +0900 Merge pull request kohya-ss#1277 from Cauldrath/negative_learning Allow negative learning rate commit e4d9e3c Author: Kohya S <[email protected]> Date: Sun May 19 17:46:07 2024 +0900 remove dependency for omegaconf #ref 1284 commit de0e0b9 Merge: c68baae 5cb145d Author: Kohya S <[email protected]> Date: Sun May 19 17:39:15 2024 +0900 Merge pull request kohya-ss#1284 from sdbds/fix_traincontrolnet Fix train controlnet commit c68baae Author: Kohya S <[email protected]> Date: Sun May 19 17:21:04 2024 +0900 add `--log_config` option to enable/disable output training config commit 47187f7 Merge: e3ddd1f b886d0a Author: Kohya S <[email protected]> Date: Sun May 19 16:31:33 2024 +0900 Merge pull request kohya-ss#1285 from ccharest93/main Hyperparameter tracking commit e3ddd1f Author: Kohya S <[email protected]> Date: Sun May 19 16:26:10 2024 +0900 update README and format code commit 0640f01 Merge: 2f19175 793aeb9 Author: Kohya S <[email protected]> Date: Sun May 19 16:23:01 2024 +0900 Merge pull request kohya-ss#1322 from aria1th/patch-1 Accelerate: fix get_trainable_params in controlnet-llite training commit 2f19175 Author: Kohya S <[email protected]> Date: Sun May 19 15:38:37 2024 +0900 update README commit 146edce Author: Kohya S <[email protected]> Date: Sat May 18 11:05:04 2024 +0900 support Diffusers' based SDXL LoRA key for inference commit 153764a Author: Kohya S <[email protected]> Date: Wed May 15 20:21:49 2024 +0900 add prompt option '--f' for filename commit 589c2aa Author: Kohya S <[email protected]> Date: Mon May 13 21:20:37 2024 +0900 update README commit 16677da Author: Kohya S <[email protected]> Date: Sun May 12 22:15:07 2024 +0900 fix create_network_from_weights doesn't work commit a384bf2 Merge: 1c296f7 8db0cad Author: Kohya S <[email protected]> Date: Sun May 12 21:36:56 2024 +0900 Merge pull request kohya-ss#1313 from rockerBOO/patch-3 Add caption_separator to output for subset commit 1c296f7 Merge: e96a521 dbb7bb2 Author: Kohya S <[email protected]> Date: Sun May 12 21:33:12 2024 +0900 Merge pull request kohya-ss#1312 from rockerBOO/patch-2 Fix caption_separator missing in subset schema commit e96a521 Merge: 39b82f2 fdbb03c Author: Kohya S <[email protected]> Date: Sun May 12 21:14:50 2024 +0900 Merge pull request kohya-ss#1291 from frodo821/patch-1 removed unnecessary `torch` import on line 115 commit 39b82f2 Author: Kohya S <[email protected]> Date: Sun May 12 20:58:45 2024 +0900 update readme commit 3701507 Author: Kohya S <[email protected]> Date: Sun May 12 20:56:56 2024 +0900 raise original error if error is occured in checking latents commit 7802093 Merge: 9ddb4d7 040e26f Author: Kohya S <[email protected]> Date: Sun May 12 20:46:25 2024 +0900 Merge pull request kohya-ss#1278 from Cauldrath/catch_latent_error_file Display name of error latent file commit 9ddb4d7 Author: Kohya S <[email protected]> Date: Sun May 12 17:55:08 2024 +0900 update readme and help message etc. commit 8d1b1ac Merge: 02298e3 64916a3 Author: Kohya S <[email protected]> Date: Sun May 12 17:43:44 2024 +0900 Merge pull request kohya-ss#1266 from Zovjsra/feature/disable-mmap Add "--disable_mmap_load_safetensors" parameter commit 02298e3 Merge: 1ffc0b3 4419041 Author: Kohya S <[email protected]> Date: Sun May 12 17:04:58 2024 +0900 Merge pull request kohya-ss#1331 from kohya-ss/lora-plus Lora plus commit 4419041 Author: Kohya S <[email protected]> Date: Sun May 12 17:01:20 2024 +0900 update docs etc. commit 3c8193f Author: Kohya S <[email protected]> Date: Sun May 12 17:00:51 2024 +0900 revert lora+ for lora_fa commit c6a4370 Merge: e01e148 1ffc0b3 Author: Kohya S <[email protected]> Date: Sun May 12 16:18:57 2024 +0900 Merge branch 'dev' into lora-plus commit 1ffc0b3 Author: Kohya S <[email protected]> Date: Sun May 12 16:18:43 2024 +0900 fix typo commit e01e148 Merge: e9f3a62 7983d3d Author: Kohya S <[email protected]> Date: Sun May 12 16:17:52 2024 +0900 Merge branch 'dev' into lora-plus commit e9f3a62 Merge: 3fd8cdc c1ba0b4 Author: Kohya S <[email protected]> Date: Sun May 12 16:17:27 2024 +0900 Merge branch 'dev' into lora-plus commit 7983d3d Merge: c1ba0b4 bee8cee Author: Kohya S <[email protected]> Date: Sun May 12 15:09:39 2024 +0900 Merge pull request kohya-ss#1319 from kohya-ss/fused-backward-pass Fused backward pass commit bee8cee Author: Kohya S <[email protected]> Date: Sun May 12 15:08:52 2024 +0900 update README for fused optimizer commit f3d2cf2 Author: Kohya S <[email protected]> Date: Sun May 12 15:03:02 2024 +0900 update README for fused optimizer commit 6dbc23c Merge: 607e041 c1ba0b4 Author: Kohya S <[email protected]> Date: Sun May 12 14:21:56 2024 +0900 Merge branch 'dev' into fused-backward-pass commit c1ba0b4 Author: Kohya S <[email protected]> Date: Sun May 12 14:21:10 2024 +0900 update readme commit 607e041 Author: Kohya S <[email protected]> Date: Sun May 12 14:16:41 2024 +0900 chore: Refactor optimizer group commit 793aeb9 Author: AngelBottomless <[email protected]> Date: Tue May 7 18:21:31 2024 +0900 fix get_trainable_params in controlnet-llite training commit b56d5f7 Author: Kohya S <[email protected]> Date: Mon May 6 21:35:39 2024 +0900 add experimental option to fuse params to optimizer groups commit 017b82e Author: Kohya S <[email protected]> Date: Mon May 6 15:05:42 2024 +0900 update help message for fused_backward_pass commit 2a359e0 Merge: 0540c33 4f203ce Author: Kohya S <[email protected]> Date: Mon May 6 15:01:56 2024 +0900 Merge pull request kohya-ss#1259 from 2kpr/fused_backward_pass Adafactor fused backward pass and optimizer step, lowers SDXL (@ 1024 resolution) VRAM usage to BF16(10GB)/FP32(16.4GB) commit 3fd8cdc Author: Kohya S <[email protected]> Date: Mon May 6 14:03:19 2024 +0900 fix dylora loraplus commit 7fe8150 Author: Kohya S <[email protected]> Date: Mon May 6 11:09:32 2024 +0900 update loraplus on dylora/lofa_fa commit 52e64c6 Author: Kohya S <[email protected]> Date: Sat May 4 18:43:52 2024 +0900 add debug log commit 58c2d85 Author: Kohya S <[email protected]> Date: Fri May 3 22:18:20 2024 +0900 support block dim/lr for sdxl commit 8db0cad Author: Dave Lage <[email protected]> Date: Thu May 2 18:08:28 2024 -0400 Add caption_separator to output for subset commit dbb7bb2 Author: Dave Lage <[email protected]> Date: Thu May 2 17:39:35 2024 -0400 Fix caption_separator missing in subset schema commit 969f82a Author: Kohya S <[email protected]> Date: Mon Apr 29 20:04:25 2024 +0900 move loraplus args from args to network_args, simplify log lr desc commit 834445a Merge: 0540c33 68467bd Author: Kohya S <[email protected]> Date: Mon Apr 29 18:05:12 2024 +0900 Merge pull request kohya-ss#1233 from rockerBOO/lora-plus Add LoRA+ support commit fdbb03c Author: frodo821 <[email protected]> Date: Tue Apr 23 14:29:05 2024 +0900 removed unnecessary `torch` import on line 115 as per kohya-ss#1290 commit 040e26f Author: Cauldrath <[email protected]> Date: Sun Apr 21 13:46:31 2024 -0400 Regenerate failed file If a latent file fails to load, print out the path and the error, then return false to regenerate it commit 5cb145d Author: 青龍聖者@bdsqlsz <[email protected]> Date: Sat Apr 20 21:56:24 2024 +0800 Update train_util.py commit b886d0a Author: Maatra <[email protected]> Date: Sat Apr 20 14:36:47 2024 +0100 Cleaned typing to be in line with accelerate hyperparameters type resctrictions commit 4477116 Author: 青龍聖者@bdsqlsz <[email protected]> Date: Sat Apr 20 21:26:09 2024 +0800 fix train controlnet commit 2c9db5d Author: Maatra <[email protected]> Date: Sat Apr 20 14:11:43 2024 +0100 passing filtered hyperparameters to accelerate commit fc37437 Author: Cauldrath <[email protected]> Date: Thu Apr 18 23:29:01 2024 -0400 Allow negative learning rate This can be used to train away from a group of images you don't want As this moves the model away from a point instead of towards it, the change in the model is unbounded So, don't set it too low. -4e-7 seemed to work well. commit feefcf2 Author: Cauldrath <[email protected]> Date: Thu Apr 18 23:15:36 2024 -0400 Display name of error latent file When trying to load stored latents, if an error occurs, this change will tell you what file failed to load Currently it will just tell you that something failed without telling you which file commit 64916a3 Author: Zovjsra <[email protected]> Date: Tue Apr 16 16:40:08 2024 +0800 add disable_mmap to args commit 4f203ce Author: 2kpr <[email protected]> Date: Sun Apr 14 09:56:58 2024 -0500 Fused backward pass commit 68467bd Author: rockerBOO <[email protected]> Date: Thu Apr 11 17:33:19 2024 -0400 Fix unset or invalid LR from making a param_group commit 75833e8 Author: rockerBOO <[email protected]> Date: Mon Apr 8 19:23:02 2024 -0400 Fix default LR, Add overall LoRA+ ratio, Add log `--loraplus_ratio` added for both TE and UNet Add log for lora+ commit 1933ab4 Author: rockerBOO <[email protected]> Date: Wed Apr 3 12:46:34 2024 -0400 Fix default_lr being applied commit c769160 Author: rockerBOO <[email protected]> Date: Mon Apr 1 15:43:04 2024 -0400 Add LoRA-FA for LoRA+ commit f99fe28 Author: rockerBOO <[email protected]> Date: Mon Apr 1 15:38:26 2024 -0400 Add LoRA+ support

u-haru and others added 3 commits May 19, 2024 19:07

revert kwargs to explicit declaration

f2dd43e

simplify and update alpha mask to work with various cases

da6fea3

kohya-ss and others added 6 commits May 26, 2024 22:01

fix to work cond mask and alpha mask

e8cfd4b

Merge branch 'dev' into alpha-masked-loss

58cadf4

add doc for mask loss

a4c3155

Update masked_loss_README-ja.md

71ad3c0

add sample images

Merge branch 'dev' into alpha-masked-loss

2870be9

update docs for masked loss

fc85496

kohya-ss merged commit 0d96e10 into dev May 27, 2024
2 checks passed

kohya-ss deleted the alpha-masked-loss branch May 27, 2024 12:41

kohya-ss added a commit that referenced this pull request Jun 2, 2024

fix alpha mask without disk cache closes #1351, ref #1339

e5bab69

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alpha masked loss #1339

Alpha masked loss #1339

kohya-ss commented May 19, 2024

recris commented May 19, 2024

kohya-ss commented May 27, 2024

araleza commented May 28, 2024 •

edited

Loading

araleza commented May 28, 2024 •

edited

Loading

kohya-ss commented May 29, 2024

kohya-ss commented Jun 2, 2024

Alpha masked loss #1339

Alpha masked loss #1339

Conversation

kohya-ss commented May 19, 2024

recris commented May 19, 2024

kohya-ss commented May 27, 2024

araleza commented May 28, 2024 • edited Loading

araleza commented May 28, 2024 • edited Loading

kohya-ss commented May 29, 2024

kohya-ss commented Jun 2, 2024

araleza commented May 28, 2024 •

edited

Loading

araleza commented May 28, 2024 •

edited

Loading