Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

upgrade nerfacc to 0.3.x with CUB accel #1187

Merged
merged 1 commit into from
Dec 30, 2022
Merged

upgrade nerfacc to 0.3.x with CUB accel #1187

merged 1 commit into from
Dec 30, 2022

Conversation

liruilong940607
Copy link
Contributor

White -- nerfacc 0.3.x.
Blue -- nerfacc 0.2.x.

nerfacc 0.3.x uses CUB to accelerate the accumulation (both fwd and bwd) when available (cuda toolkit >= 11.5). Test rays / secs got ~10% speed up. For some reason the training speed is slightly slower with nerfacc 0.3.x, which needs to be further investigated.

Note the CUB acceleration is more noticeable when the #samples per ray is less, so the gap for test rays / secs got larger when converge. But it's unclear what's the reason of the slow down during training.

Screen Shot 2022-12-30 at 11 34 16 PM

Screen Shot 2022-12-30 at 11 34 34 PM

@liruilong940607
Copy link
Contributor Author

The quality is numerically the same as before.

Copy link
Contributor

@tancik tancik left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks for updating!

This was linked to issues Dec 30, 2022
@liruilong940607 liruilong940607 merged commit 028df82 into main Dec 30, 2022
@liruilong940607 liruilong940607 deleted the nerfacc_cub branch December 30, 2022 23:28
tancik pushed a commit to dozeri83/nerfstudio that referenced this pull request Jan 20, 2023
lucasthahn pushed a commit to tne-ai/nerfstudio that referenced this pull request Jan 26, 2023
chris838 pushed a commit to chris838/nerfstudio that referenced this pull request Apr 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

ns-export Consider newer nerfacc (supposedly faster)
2 participants