Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding input image resolution #5

Open
ariharasudhanm opened this issue May 8, 2024 · 0 comments
Open

Regarding input image resolution #5

ariharasudhanm opened this issue May 8, 2024 · 0 comments

Comments

@ariharasudhanm
Copy link

ariharasudhanm commented May 8, 2024

I am trying to train using my own dataset and the input image resolution is 512x512. When I tried to feed the image of dimension( 3,512,512) that is just stacking the grayscaled image to make it 3 dimensions, it throws error like

RuntimeError: Given groups=1, weight of size [32, 1, 3, 3], expected input[1, 3, 256, 256] to have 1 channels, but got 3 channels instead

Which i understand that network expects single dimension like 1,512,512 but this throws transformation errors like random rotate and random flips during the data loader process.

Could you please provide more information about the input image dimension how this to be handled?

Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant