Why are 4 different dataset options generated by default? #1175
-
I used I don't understand the differences between the variants with |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
this is because models often train better to generalise across aspects / base resolutions when you include them in the training. DiT models like Flux or SD3 actually enter representation collapse rather easily when you go too far from their training sequence lengths (resolutions) crop=false just buckets things by aspect (image shape) and crop=true makes them all squares. training the same images in bucketed and square format helps the model avoid biasing any particular style or quality to a given resolution bucket, which is something that all Unet and DiT models do. |
Beta Was this translation helpful? Give feedback.
yes they are all treated as additional datasets