Fix bug in stable diffusion when mask_pad_tokens is false #147

coryMosaicML · 2024-05-31T05:57:14Z

Currently when mask_pad_tokens=False in the stable diffusion model, the mask is still passed to the text encoder at generation time. This introduces a train/test mismatch because the masked tokens in the text encoder output are different from the unmasked tokens, and the model is trained with the unmasked pad tokens. The result is a degradation in image quality. The fix is to only use the mask from the tokenizer output if mask_pad_tokens=True.

Landanjs

LGTM!

Fix bug in stable diffusion when mask_pad_tokens is false

c69f198

Landanjs approved these changes May 31, 2024

View reviewed changes

coryMosaicML merged commit 7d3a7cc into mosaicml:main May 31, 2024
5 checks passed

coryMosaicML added a commit to coryMosaicML/diffusion that referenced this pull request Jun 4, 2024

Fix bug in stable diffusion when mask_pad_tokens is false (mosaicml#147)

8962008

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug in stable diffusion when mask_pad_tokens is false #147

Fix bug in stable diffusion when mask_pad_tokens is false #147

coryMosaicML commented May 31, 2024

Landanjs left a comment

Fix bug in stable diffusion when mask_pad_tokens is false #147

Fix bug in stable diffusion when mask_pad_tokens is false #147

Conversation

coryMosaicML commented May 31, 2024

Landanjs left a comment

Choose a reason for hiding this comment