Skip to content

Can I train without caching VAE & text embeds? #1179

Answered by bghira
cpsxhao asked this question in Q&A
Discussion options

You must be logged in to vote

you cannot completely disable caching, and to enable --vae_cache_ondemand will greatly slow down training and increase vram requirements perhaps in an uncomfortable manner even on A100 devices at 1024px image area

but there's also --compress_disk_cache option which will apply Gzip to the image embeds and reduce their size.

for 21,000 16-channel VAE embeds at 1024x1024 even without compression they are just 11GB. so for 100k you are looking at just 50G or so disk space to consume.

the real problem is the text embeds, which require disk cache compression to be reasonable. those you can store in a Cloudflare R2 bucket. see DATALOADER.md for how to offload disk embeds.

Replies: 1 comment 5 replies

Comment options

You must be logged in to vote
5 replies
@cpsxhao
Comment options

@bghira
Comment options

@cpsxhao
Comment options

@bghira
Comment options

@cpsxhao
Comment options

Answer selected by cpsxhao
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants