Add camera pose and projection gradient flow #123

jh-surh · 2024-02-08T00:29:36Z

Added gradient backward flow to project_gaussians_backward_kernel for viewmat and projmat
Added a test script examples/test_pose_grad.py to test pose gradient update. The script interface is the same as simple_trainer.py:

python examples/test_pose_grad.py
python examples/test_pose_grad.py --img_path path/to/image

Validated on nerfstudio Splatfacto in this PR: Add pose optimization to Splatfacto nerfstudio#2885

ichsan2895 · 2024-02-08T14:47:02Z

python examples/test_pose_grad.py --img_path path/to/image needs pytorch3D. Maybe we add the requirement to setup.py?

jh-surh · 2024-02-08T15:58:19Z

@ichsan2895,
Good idea, but I think it may be better to add it to examples/requirements.txt

kerrj · 2024-02-08T19:06:24Z

examples/requirements.txt

+Pillow
+pytorch3d @ git+https://github.com/facebookresearch/[email protected]


It seems like only the tests rely on p3d, it would be nice to remove as a dependency since it's a pretty big one and if we only need it for matrix transforms there's more common lightweight options like scipy.

Unfortunately I needed the matrix transforms to be differentiable so that the gradients can flow into the Euler angle and position parameters. I tried implementing my own transformations, but ran into some instability in training. I have been using se3_exp_map and se3_log_map, but if you have any replacements I am open to try them.

kerrj · 2024-02-08T19:20:19Z

It would be good to compare these camera grads to the camera grads from the torch implementation with autograd, this PR adds a torch implementation which should have correct (although slow) gradients since they're computed from torch.

oseiskar · 2024-02-09T14:23:14Z

Also this would be simpler without the redundant projmat (see #97). For camera calibration optimization, you would very likely want to optimize fx, fy, cx, cy (and possibly other distortion parameters if support to such are added) and not the redudant OpenGL/NDC-style projection matrix.

oseiskar · 2024-02-09T17:47:53Z

There's also a simpler way of achieving the same goal of optimizing the view matrix, shown here: #127

jh-surh · 2024-02-14T15:23:36Z

Also this would be simpler without the redundant projmat (see #97). For camera calibration optimization, you would very likely want to optimize fx, fy, cx, cy (and possibly other distortion parameters if support to such are added) and not the redudant OpenGL/NDC-style projection matrix.

I'm not certain if removing the projmat is a good idea. I think it may add some nice functionality in choosing the type of projection you want during the gaussian projection step.
What do you guys think? @ichsan2895 @kerrj

oseiskar · 2024-02-18T16:55:34Z

I'm not certain if removing the projmat is a good idea. I think it may add some nice functionality in choosing the type of projection you want during the gaussian projection step. What do you guys think? @ichsan2895 @kerrj

@jh-surh I elaborated this here #97 (comment). Everything that can be implemented with projection matrices can also be implemented by extending the "intrinsics" to support other camera models than the ideal pinhole (fx,fy,cx,cy). It's also relatively simple to convert any typical (non-ortho) OpenGL projection matrix to (fx,fy,cx,cy) format (and throw away the near & far clip terms which do not matter in this context)

jh-surh · 2024-02-19T14:17:08Z

I'm not certain if removing the projmat is a good idea. I think it may add some nice functionality in choosing the type of projection you want during the gaussian projection step. What do you guys think? @ichsan2895 @kerrj

@jh-surh I elaborated this here #97 (comment). Everything that can be implemented with projection matrices can also be implemented by extending the "intrinsics" to support other camera models than the ideal pinhole (fx,fy,cx,cy). It's also relatively simple to convert any typical (non-ortho) OpenGL projection matrix to (fx,fy,cx,cy) format (and throw away the near & far clip terms which do not matter in this context)

Would it require work "extending the intrinsics to support other camera models" in terms of calculating gradients? If so, it seems like something we can do in a separate PR since the current nerfstudio implementation uses projmat for its own projection matrix.

It would be good to compare these camera grads to the camera grads from the torch implementation with autograd, this PR adds a torch implementation which should have correct (although slow) gradients since they're computed from torch.

Thank you for your suggestion! I found a bug trying to compare the gradient w.r.t. the torch implementation. I have fixed it and updated the test script and have passed the project gaussian test. Seems like the current main branch is failing 3 of the test scripts though. Someone should look at that.

oseiskar · 2024-02-19T16:34:26Z

Would it require work "extending the intrinsics to support other camera models" in terms of calculating gradients? If so, it seems like something we can do in a separate PR since the current nerfstudio implementation uses projmat for its own projection matrix.

Yes, and I agree that such stuff should definitely be implemented in some other PR.

However, the projection matrix is fed to gsplat is not needed by Nersftudio for any other purpose, as demonstrated here SpectacularAI/nerfstudio@bd23489 (works fine with #97).

My argument is that the more stuff is built on top of the current API with separate and redundant "projmat" + fx,fy,cx,cy, the more difficult it becomes to simplify it. I now remembered/realized the situation with the redundant API is worse than I described earlier: "projmat" is actually not a "projection matrix" in any standard sense but a model-view-projection matrix, adding another layer of confusion).

This complications caused by this API are clearly visible in this PR: why you need to compute the "projection matrix" gradient for pose optimization in the first place is because the parameter known as projmat is set to projmat @ viewmat in Nerfstudio, and this term depends on viewmat, which is the thing you actually want to modify in pose optimization.

Without the @ viewmat part, projection matrix gradients would only be needed for camera instrinsics optimization, but for this purpose, you would also need to add gradients w.r.t. fx, fy, cx, cy to work with the current API. Without "projmat", you would only need the latter.

jh-surh · 2024-02-21T05:52:17Z

My argument is that the more stuff is built on top of the current API with separate and redundant "projmat" + fx,fy,cx,cy, the more difficult it becomes to simplify it. I now remembered/realized the situation with the redundant API is worse than I described earlier: "projmat" is actually not a "projection matrix" in any standard sense but a model-view-projection matrix, adding another layer of confusion).

This complications caused by this API are clearly visible in this PR: why you need to compute the "projection matrix" gradient for pose optimization in the first place is because the parameter known as projmat is set to projmat @ viewmat in Nerfstudio, and this term depends on viewmat, which is the thing you actually want to modify in pose optimization.

Without the @ viewmat part, projection matrix gradients would only be needed for camera instrinsics optimization, but for this purpose, you would also need to add gradients w.r.t. fx, fy, cx, cy to work with the current API. Without "projmat", you would only need the latter.

I gotta say this makes a lot of sense. Now that I think of it, this would cause the gradient for projmat to flow to viewmat, which I'm not sure is the right move.

kerrj · 2024-03-20T19:29:01Z

I'm thinking of merging #127 because of how much simpler it is and it seems to give good results, @jh-surh thoughts?

maturk · 2024-03-20T19:32:29Z

I'm thinking of merging #127 because of how much simpler it is and it seems to give good results, @jh-surh thoughts?

I agree. Both are equivalent to the best of my understanding.

jh-surh · 2024-03-21T04:40:27Z

@kerrj @maturk @oseiskar I left a comment on #127.
TL;DR large difference between pure pytorch gradient and approximated v_viewmat.

kerrj · 2024-06-27T04:51:23Z

Closing this since the 1.0 version includes more exact gradients I believe

jh-surh mentioned this pull request Feb 8, 2024

Add pose optimization to Splatfacto nerfstudio-project/nerfstudio#2885

Closed

kerrj requested a review from vye16 February 8, 2024 19:01

kerrj reviewed Feb 8, 2024

View reviewed changes

oseiskar mentioned this pull request Feb 9, 2024

Approximate view matrix gradient for pose optimization #127

Merged

oseiskar mentioned this pull request Feb 10, 2024

Camera pose optimization for Splatfacto nerfstudio-project/nerfstudio#2891

Merged

jh-surh added 4 commits February 19, 2024 00:06

Add gradient calculation for viewmat and projmat

a3e2a67

Add test script for pose gradient propagation

10163f1

Update v_viewmat and v_projmat to mirror their respective input shapes

60ce334

Update requirements.txt to add Pytorch3D for the test script

d4fd07a

jh-surh force-pushed the jhsurh/add-pose-grad branch from 7540e37 to d4fd07a Compare February 18, 2024 15:07

jh-surh added 2 commits February 19, 2024 00:13

Fix typo

4c8f835

Update to account for compensation term in _ProjectGaussians

fa33fe8

jh-surh added 2 commits February 19, 2024 22:23

Fix project gaussian backward wrt projmat

409ee72

Update test

aa950a7

jh-surh requested a review from kerrj February 21, 2024 05:48

oseiskar mentioned this pull request Feb 21, 2024

Remove redundant projection matrix #97

Merged

kerrj closed this Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add camera pose and projection gradient flow #123

Add camera pose and projection gradient flow #123

jh-surh commented Feb 8, 2024 •

edited

Loading

ichsan2895 commented Feb 8, 2024 •

edited

Loading

jh-surh commented Feb 8, 2024

kerrj Feb 8, 2024

jh-surh Feb 14, 2024

kerrj commented Feb 8, 2024

oseiskar commented Feb 9, 2024

oseiskar commented Feb 9, 2024 •

edited

Loading

jh-surh commented Feb 14, 2024

oseiskar commented Feb 18, 2024

jh-surh commented Feb 19, 2024 •

edited

Loading

oseiskar commented Feb 19, 2024

jh-surh commented Feb 21, 2024

kerrj commented Mar 20, 2024

maturk commented Mar 20, 2024

jh-surh commented Mar 21, 2024

kerrj commented Jun 27, 2024

		Pillow
		pytorch3d @ git+https://github.com/facebookresearch/[email protected]

Add camera pose and projection gradient flow #123

Add camera pose and projection gradient flow #123

Conversation

jh-surh commented Feb 8, 2024 • edited Loading

ichsan2895 commented Feb 8, 2024 • edited Loading

jh-surh commented Feb 8, 2024

kerrj Feb 8, 2024

Choose a reason for hiding this comment

jh-surh Feb 14, 2024

Choose a reason for hiding this comment

kerrj commented Feb 8, 2024

oseiskar commented Feb 9, 2024

oseiskar commented Feb 9, 2024 • edited Loading

jh-surh commented Feb 14, 2024

oseiskar commented Feb 18, 2024

jh-surh commented Feb 19, 2024 • edited Loading

oseiskar commented Feb 19, 2024

jh-surh commented Feb 21, 2024

kerrj commented Mar 20, 2024

maturk commented Mar 20, 2024

jh-surh commented Mar 21, 2024

kerrj commented Jun 27, 2024

jh-surh commented Feb 8, 2024 •

edited

Loading

ichsan2895 commented Feb 8, 2024 •

edited

Loading

oseiskar commented Feb 9, 2024 •

edited

Loading

jh-surh commented Feb 19, 2024 •

edited

Loading