Fix benchmarking scripts #1005

devernay · 2022-11-22T18:32:17Z

closes nerfacto overfitting on blender scenes? #1000
see also Poor performance on NeRF Blender Synthetic Data & potential bugs #806
launch_train_blender.sh:
- add -s option to launch a single job per GPU
- add -v option to use tensorboard instead of wandb
- set nerfacto options according to Poor performance on NeRF Blender Synthetic Data & potential bugs #806 (comment)
- use a single timestamp for all training jobs
- last GPU was ignored
- kill all subprocesses when script is terminated
- print the eval script command-line
launch_eval_blender.sh:
- add shebang
- add -s option to launch a single job per GPU
- last GPU was ignored
- kill all subprocesses when script is terminated
update benchmarking doc

- add shebang - add -s option to launch a single job per GPU - last GPU was ignored - kill all subprocesses when script is terminated

- add -s option to launch a single job per GPU - add -v option to use tensorboard instead of wandb - set nerfacto options according to #806 (comment) - use a single timestamp for all training jobs - last GPU was ignored - kill all subprocesses when script is terminated - print the eval script command-line

tancik · 2022-11-22T22:10:22Z

docs/developer_guides/debugging_tools/benchmarking.md

@@ -71,6 +73,7 @@ The flags used in the benchmarking script are defined as follows:
 - `-m`: config name (e.g. `instant-ngp`). This should be the same as what was passed in for -c in the train script.
 - `-o`: base output directory for where all of the benchmarks are stored (e.g. `outputs/`). Corresponds to the `--output-dir` in the base `Config` for training.
 - `-t`: timestamp of benchmark; also the identifier (e.g. `2022-08-10_172517`).
+- `-s`: Launch a single job per GPU.
 - `-g`: specifies the gpus to use and if not specified (no -g flag), will automaticaly search for available gpus.


Is this flag still used?

That's a flag I added. It basically launches one job per GPU, then waits on the first one to be finished before relaunching on that GPU.
In the previous version, all jobs were launched in parallel at script launch (and Ctrl-C didn't kill the jobs). That works fine if you have several GPUs and lots of GPU memory, but not if you have a single 16Gb GPU.

Ahh sorry, I was trying to highlight the -g flag.

well this flag was simply ignored in the previous version of the scripts. It just took the list of remaining arguments as the list of GPUs, so I kept it that way and removed the unused flag. I'll adjust this doc.

tancik

LGTM!

* launch_eval_blender.sh: fix script - add shebang - add -s option to launch a single job per GPU - last GPU was ignored - kill all subprocesses when script is terminated * launch_train_blender.sh: fix script - add -s option to launch a single job per GPU - add -v option to use tensorboard instead of wandb - set nerfacto options according to nerfstudio-project#806 (comment) - use a single timestamp for all training jobs - last GPU was ignored - kill all subprocesses when script is terminated - print the eval script command-line * launch_eval_blender.sh: fix script * Update benchmarking.md * launch_train_blender.sh: add -s to eval command-line * Update launch_train_blender.sh * Update benchmarking.md

devernay added 8 commits November 22, 2022 09:55

launch_eval_blender.sh: fix script

d9f39a3

- add shebang - add -s option to launch a single job per GPU - last GPU was ignored - kill all subprocesses when script is terminated

launch_eval_blender.sh: fix script

24077cd

Update benchmarking.md

d11d76d

launch_train_blender.sh: add -s to eval command-line

1cbd681

Merge branch 'nerfstudio-project:main' into fix-benchmarking-scripts

2e49bf0

Update launch_train_blender.sh

625b37f

Merge branch 'main' into fix-benchmarking-scripts

104068a

devernay marked this pull request as ready for review November 22, 2022 20:39

tancik reviewed Nov 22, 2022

View reviewed changes

devernay and others added 2 commits November 26, 2022 11:53

Update benchmarking.md

f20362a

Merge branch 'main' into fix-benchmarking-scripts

241da61

tancik approved these changes Nov 28, 2022

View reviewed changes

tancik merged commit 8fc7950 into nerfstudio-project:main Nov 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix benchmarking scripts #1005

Fix benchmarking scripts #1005

devernay commented Nov 22, 2022 •

edited

Loading

tancik Nov 22, 2022

devernay Nov 22, 2022

tancik Nov 23, 2022

devernay Nov 26, 2022

tancik left a comment

Fix benchmarking scripts #1005

Fix benchmarking scripts #1005

Conversation

devernay commented Nov 22, 2022 • edited Loading

tancik Nov 22, 2022

Choose a reason for hiding this comment

devernay Nov 22, 2022

Choose a reason for hiding this comment

tancik Nov 23, 2022

Choose a reason for hiding this comment

devernay Nov 26, 2022

Choose a reason for hiding this comment

tancik left a comment

Choose a reason for hiding this comment

devernay commented Nov 22, 2022 •

edited

Loading