Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

training details #78

Open
wren93 opened this issue Nov 29, 2024 · 0 comments
Open

training details #78

wren93 opened this issue Nov 29, 2024 · 0 comments

Comments

@wren93
Copy link

wren93 commented Nov 29, 2024

Hi,

Thanks for the great work. I'm interested in knowing more details on how the model is trained, as there seems to be some inconsistency between the released checkpoint and the details described in the paper. In paper you mentioned that the model is trained on DIV2K, Flickr2K, OST, and the first 10, 000 face images from FFHQ for 500K steps. However, on the huggingface dataset page you released (https://huggingface.co/datasets/yangtao9009/PASD_dataset) there's also DIV8K and Unsplash2K. Are these two datasets also used in training? Also, the released model is "checkpoint-100000". Does that mean the model is trained for 100k steps instead of 500K steps?

Also, I'm wondering how long will it take to train the model on 8 v100 gpus, as mentioned in the paper. Thanks in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant