-
Notifications
You must be signed in to change notification settings - Fork 157
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question: How to construct wids-meta.json for SanaWebDataset? #121
Comments
Anyone here? |
We will update the wids-meta json related code soon. |
Thank you so much. My original plan was just to train on a simple text image format like ImgDataset (since that's how my data is stored locally), but I'm stuck with a Webdataset. I had to try various methods without documentation to convert this local data into Webdataset format while still meeting Sana's reading needs. It's very frustrating, Sana is such a good project but so dizzying on the minutiae. |
Is there any problem with ImgDataset which turned you to Webdataset? @Pevernow |
Thanks @Pevernow . If you want, we would appreciate it if you could pull a PR for your effort. Also, we will update the metadata.json construction script later, but these do not conflict, It would be great to have a conversion script to convert the image-text dataset into SanaWebDatasetMS, which we don't have it. |
Could you please tell me how to use non-square images for training in this project or the original project?
The official example only has the ImgDataset type, but according to an issue, this does not support non-square images.
The official also did not provide a dataset example or documentation for SanaWebDatasetMS.
I spent a whole day trying to construct wids_meta.json and the dataset structure by reading the code,
but I got stuck on
Error detail: ".json"
and I couldn't solve it no matter how hard I tried.Can you help me? Thank you very much.
The text was updated successfully, but these errors were encountered: