You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm running fine-tuning according to the tutorial with the provided dataset. It seems the fine tune finished successfully, because there is no error in train.txt. But the checkpoints folder is empty, I can't find the trained model. Would you mind guiding me how to deal with it?
Referring to issues #990 , I started from fine-tune config yml file at /fairchem/configs/omat24/finetune. The changes I made:
add the dataset part according to the fine tuning tutorial;
load_balancing_on_error: warn_and_no_balance; gpus: 1; set logger: tensorboard
The command I used to fine-tune is :
! python {fairchem_main()} --mode train --config-yml {yml} --checkpoint {checkpoint_path} --run-dir fine-tuning --identifier ft-oxides --num-gpus 1 > train.txt 2>&1
There are no errors in the train.txt, but it looks quite different from the one in tutorial
Thank you so much for your attention!
The text was updated successfully, but these errors were encountered:
What would you like to report?
Hello contributors,
I'm running fine-tuning according to the tutorial with the provided dataset. It seems the fine tune finished successfully, because there is no error in train.txt. But the checkpoints folder is empty, I can't find the trained model. Would you mind guiding me how to deal with it?
Referring to issues #990 , I started from fine-tune config yml file at /fairchem/configs/omat24/finetune. The changes I made:
The command I used to fine-tune is :
! python {fairchem_main()} --mode train --config-yml {yml} --checkpoint {checkpoint_path} --run-dir fine-tuning --identifier ft-oxides --num-gpus 1 > train.txt 2>&1
There are no errors in the train.txt, but it looks quite different from the one in tutorial
Thank you so much for your attention!
The text was updated successfully, but these errors were encountered: