Error : Training the compressed model in my own local server. #31
Unanswered
kbjsniper
asked this question in
Feedbacks-Model Compressor
Replies: 2 comments 1 reply
-
Beta Was this translation helpful? Give feedback.
1 reply
-
We had a meeting regarding this, but I'm sorry for not reporting it here. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, Thank you for your great works about Deep learning.
I've explored the NetsPresso through the web.
I've tested the module of compression, after I compressed the model, downloaded that model, and I got the error.
In detail, I compressed the model with ratio 0.5 which the original model was sample_yolov5s_voc that you provide in your web,
I downloaded that model, when I tried to train the compressed model, but I got the error as below.
Traceback (most recent call last):
File "train.py", line 642, in
main(opt)
File "train.py", line 531, in main
train(opt.hyp, opt, device, callbacks)
File "train.py", line 127, in train
csd = ckpt['model'].float().state_dict() # checkpoint state_dict as FP32
TypeError: 'GraphModule' object is not subscriptable
Traceback (most recent call last):
File "train.py", line 642, in
main(opt)
File "train.py", line 531, in main
train(opt.hyp, opt, device, callbacks)
File "train.py", line 127, in train
csd = ckpt['model'].float().state_dict() # checkpoint state_dict as FP32
TypeError: 'GraphModule' object is not subscriptable
YOLOv5s summary: 214 layers, 7105933 parameters, 7105933 gradients, 16.2 GFLOPs
Traceback (most recent call last):
File "train.py", line 642, in
main(opt)
File "train.py", line 531, in main
train(opt.hyp, opt, device, callbacks)
File "train.py", line 127, in train
csd = ckpt['model'].float().state_dict() # checkpoint state_dict as FP32
TypeError: 'GraphModule' object is not subscriptable
Traceback (most recent call last):
File "train.py", line 642, in
main(opt)
File "train.py", line 531, in main
train(opt.hyp, opt, device, callbacks)
File "train.py", line 127, in train
csd = ckpt['model'].float().state_dict() # checkpoint state_dict as FP32
TypeError: 'GraphModule' object is not subscriptable
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 0 (pid: 17424) of binary: /DATA/bjkang/tfl_py375/bin/python
Traceback (most recent call last):
File "/usr/lib/python3.7/runpy.py", line 193, in _run_module_as_main
"main", mod_spec)
File "/usr/lib/python3.7/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/DATA/bjkang/tfl_py375/lib/python3.7/site-packages/torch/distributed/launch.py", line 195, in
main()
File "/DATA/bjkang/tfl_py375/lib/python3.7/site-packages/torch/distributed/launch.py", line 191, in main
launch(args)
File "/DATA/bjkang/tfl_py375/lib/python3.7/site-packages/torch/distributed/launch.py", line 176, in launch
run(args)
File "/DATA/bjkang/tfl_py375/lib/python3.7/site-packages/torch/distributed/run.py", line 756, in run
)(*cmd_args)
File "/DATA/bjkang/tfl_py375/lib/python3.7/site-packages/torch/distributed/launcher/api.py", line 132, in call
return launch_agent(self._config, self._entrypoint, list(args))
File "/DATA/bjkang/tfl_py375/lib/python3.7/site-packages/torch/distributed/launcher/api.py", line 248, in launch_agent
failures=result.failures,
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
Please, help me about this problem.
Thank in advance.
Beta Was this translation helpful? Give feedback.
All reactions