Regarding input image resolution #5

ariharasudhanm · 2024-05-08T14:30:12Z

I am trying to train using my own dataset and the input image resolution is 512x512. When I tried to feed the image of dimension( 3,512,512) that is just stacking the grayscaled image to make it 3 dimensions, it throws error like

RuntimeError: Given groups=1, weight of size [32, 1, 3, 3], expected input[1, 3, 256, 256] to have 1 channels, but got 3 channels instead

Which i understand that network expects single dimension like 1,512,512 but this throws transformation errors like random rotate and random flips during the data loader process.

Could you please provide more information about the input image dimension how this to be handled?

Thank you.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding input image resolution #5

Regarding input image resolution #5

ariharasudhanm commented May 8, 2024 •

edited

Loading

Regarding input image resolution #5

Regarding input image resolution #5

Comments

ariharasudhanm commented May 8, 2024 • edited Loading

ariharasudhanm commented May 8, 2024 •

edited

Loading