Issue running single gpu training script #229
Replies: 2 comments 4 replies
-
Solved : Issue was caused by python 3.7. Running it with python 3.10 fixed the issue. |
Beta Was this translation helpful? Give feedback.
-
@Shidhanta95 With batch =2, I reckon the problem could be coming from mask_decoder: with But @JunMa11 Sorry for tagging you out of the blue. I have spent hours on this matter with no success. Since you are the contributor of branch 0.1, perhaps you have encounted this problem before? |
Beta Was this translation helpful? Give feedback.
-
Hi, I am new to deep learning so apologies if the question may be very trivial. I am using a modified version of the train_one_gpu script to train the medsam model on a dataset. The first time I run the script I have no issues. But the second time I ran the script without making any changes I got the following error.
"RuntimeError: The size of tensor a (4) must match the size of tensor b (2) at non-singleton dimension 0"
Passing the code and tensor dimensions to chatgpt and asking it to output the tensor sizes shows that there should be no mismatch with the tensor dimensions.
I am clueless as to why it runs the first time and then it doesnt run again. I have attached the screenshots of the first run and the error. If required I can share my script as well.
data:image/s3,"s3://crabby-images/1e2dc/1e2dc3fa6de0a2ddb2e09f7141baa949a9b67a78" alt="medsam single gpu training"
data:image/s3,"s3://crabby-images/2b7f4/2b7f41093ccfddd0911c1d975d04602a007cc23f" alt="medsam single gpu training error (2)"
Beta Was this translation helpful? Give feedback.
All reactions