Support splat export in original dataset coordinates #2951

brentyi · 2024-02-23T07:59:43Z

The .ply export for Gaussians previously defaulted to saving in Nerfstudio's auto-oriented / auto-scaled coordinate frame.

I added support for exporting this in the original dataset coordinate frame, which the point cloud export supports via a "Save in world frame" checkbox.

I matched this in both the GUI and the export script CLI:

To me it makes the most sense to default this to True (I also flipped this for the point cloud export), but open to thoughts!

cc @jb-ye, #2909

kerrj

seems good

jb-ye · 2024-02-23T19:09:56Z

I don't think the option would work actually for spherical harmonic parameters. The spherical harmonics are saved in the transformed frame and cannot be "re-orient" back to the original coordinates. (We can apply translation and scale, but not rotations, otherwise SH becomes inconsistent.)

My recommendation is to save the transforms as meta data as part of exporter.

brentyi · 2024-02-24T07:14:50Z

I don't think the option would work actually for spherical harmonic parameters. The spherical harmonics are saved in the transformed frame and cannot be "re-orient" back to the original coordinates. (We can apply translation and scale, but not rotations, otherwise SH becomes inconsistent.)

My recommendation is to save the transforms as meta data as part of exporter.

Agree the current implementation is flawed, will revisit this after ECCV deadline!

See comments about SH parameters

…dio-project/nerfstudio into brent/splat_export_world_frame

pwais

@jb-ye Many viewers ignore spherical harmonics, and in my experience doing a rigid transform on splats looks plenty fine. I think ideally Nerfstudio just does a scale and re-center internally as part of the model rather than changing any of the camera poses, initial points etc that are part of the dataset. It should be up to the model to normalize as needed, not up to the user.

pwais · 2024-03-15T06:13:51Z

nerfstudio/data/datamanagers/full_images_datamanager.py

                )
            )

        CONSOLE.log("Caching / undistorting eval images")
-        with ThreadPoolExecutor() as executor:
+        with ThreadPoolExecutor(max_workers=2) as executor:


please make this configurable? or maybe this is just for debugging

I'm actually not sure why this shows up in this diff, the change is from #2969. it speeds up undistortion a lot for big datasets I've been toying with!

it's hardcoded to 2 because we can really only expect benefits from one thread doing IO while the other thread is doing undistortion; the implementation is still weird given this (ideally we'd just have 1 worker doing IO while the main one is sequentially undistorting) but I'm a fan of not letting perfect be the enemy of... better 🤷

FWIW each worker might need cv2.setNumThreads(1) or else it can choke the CPU. I seem to see way more than 200% util here hence why i brought it up so maybe it's just not tuned well for all users.

maybe in a future refactor this stuff will just get pushed to a torch dataloader... the pinned memory part breaks for me for large datasets anyways, literally I got a OOM and hard lock-up because too too too much much much pinned memory

pwais · 2024-03-15T06:17:28Z

nerfstudio/scripts/exporter.py

+
+                output_scale = 1 / dataparser_scale
+                output_transform = np.zeros((3, 4))
+                output_transform[:3, :3] = dataparser_transform[:3, :3].T


pretty please don't do transform math w/out at least comments, this sort of code is 110% likely to put a future reader in transform hell

also pretty please use pipeline.datamanager.train_dataparser_outputs.transform_poses_to_original_space() because
(1) that's what's used elsewhere in this file
(2) using that function ensures future refactors won't break things, and most past nerfstudio refactors have indeed broken lots of things

pwais · 2024-03-15T06:18:58Z

nerfstudio/scripts/exporter.py

+                np.einsum("ij,bj->bi", output_transform[:3, :3], model.means.cpu().numpy() * output_scale)
+                + output_transform[None, :3, 3]


please don't do this, ESPECIALLY w/out comments. I have lost a lot of time reading nerfstudio code that's like this. instead consider:

positions = model.means.cpu().numpy() poses = np.eye(4, dtype=np.float32)[None, ...].repeat(positions.shape[0], axis=0)[:, :3, :] poses[:, :3, 3] = positions poses = pipeline.datamanager.train_dataparser_outputs.transform_poses_to_original_space( torch.from_numpy(poses) )

pwais · 2024-03-15T06:24:58Z

nerfstudio/scripts/exporter.py

            for i in range(3):
                map_to_tensors[f"scale_{i}"] = scales[:, i, None]

-            quats = model.quats.data.cpu().numpy()
+            def quaternion_multiply(wxyz0: np.ndarray, wxyz1: np.ndarray) -> np.ndarray:


??? First of all, scipy.spatial.transform already has quaternion multiply... at least this code is clear about scalar-first versus scalar-last.

this could be made more concise, but consider instead:

from scipy.spatial.transform import Rotation as ScR # ns gplat says quaternions are [w,x,y,z] scalar-first format # scipy is [x, y, z, w] scalar-last format raw_quats = model.quats.data.cpu().numpy().squeeze() R_quats = ScR.from_quat(raw_quats[:, [1, 2, 3, 0]]) # apply the inverse dataparser transform to the splat rotations poses = np.eye(4, dtype=np.float32)[None, ...].repeat(raw_quats.shape[0], axis=0)[:, :3, :] poses[:, :3, :3] = R_quats.as_matrix() poses = pipeline.datamanager.train_dataparser_outputs.transform_poses_to_original_space( torch.from_numpy(poses) ) rots_in_input = poses[:, :3, :3].numpy() quat_in_input = ScR.from_matrix(rots_in_input) quats = quat_in_input.as_quat()[:, [3, 0, 1, 2], None]

Again, this uses transform_poses_to_original_space(), which might amalgamate several different transforms and scales, who knows? Instead of trying to re-derive the transform as the current PR does. And hopefully transform_poses_to_original_space() gets maintained. But it's really really important to be clear about frames etc, and transform_poses_to_original_space() helps with that a ton.

(note for when we revive this PR, which is planned) for the quaternion multiply if we don't want to deal with the xyzw/wxyz conversion of scipy we can also use (vtf.SO3(wxyz0) @ vtf.SO3(wxyz1)).wxyz with import viser.transforms as vtf where viser>=0.1.30

voicing a preference for the use of standard scipy / numpy / torch wherever possible

yes it's unfortunate that there are different quaternion encodings, different camera conventions, different euler angle conventions ....

brentyi · 2024-03-15T07:32:49Z

I think ideally Nerfstudio just does a scale and re-center internally as part of the model rather than changing any of the camera poses, initial points etc that are part of the dataset.

Tough to overcome the inertia here but I agree that this would solve a lot of problems!

pwais · 2024-03-15T08:14:18Z

Tough to overcome the inertia here but I agree that this would solve a lot of problems!

Yes inertia but at least the PR discussions leave breadcrumbs. Things that fall off the rolling katamari ball become seeds for the next re-write.

jkulhanek · 2024-03-15T09:43:03Z

Wait, spherical harmonics can easily be rotated by using Wigner matrices, right?

jb-ye · 2024-03-15T18:10:54Z

@jkulhanek Could you share a gist of sample code of rotating spherical harmonics? I am not aware of Wigner matrices.

jkulhanek · 2024-03-15T18:25:03Z

Just a note here. Since SH is a complete basis, the rotation is possible exactly. Here is the code:
https://github.com/jkulhanek/nerfbaselines/blob/develop/nerfbaselines/math_utils.py

The code is heavy, and I don't understand it fully, but I played with it in a notebook, and it seems to rotate the SH correctly. I can also share the notebook if you want.

It's the recursive impl (whatever it means) which is supposed to be more stable (for higher order). The good thing about it is that the wigner matrix can be computed once and then applied to all SHs at once.

Frenchman997 · 2024-05-06T15:25:17Z

Is there any update on this? I would also be interested in the ability to export splats in their original world frame. I haven't looked into it yet at all but would be willing to contribute.

hardikdava · 2024-05-08T10:20:35Z

@jkulhanek can you provide the notebook? Thanks

hongsukchoi · 2024-09-04T19:05:43Z

Spherical harmonics functions are equivariant to SO(3) and can be rotated.

I don't know the exact code to do it, but probably possible with e3nn api:

From ChatGPT

import torch
from e3nn.o3 import Irrep, Irreps
from e3nn.o3 import spherical_harmonics, Rotation

R = torch.tensor([ [0.36, 0.48, -0.8], [-0.8, 0.60, 0], [0.48, 0.64, 0.6] ])

l = 2  # Degree of spherical harmonics

directions = torch.tensor([[0.0, 0.0, 1.0]]) # z-axis unit vector 

Y_lm = spherical_harmonics(l, directions)

# Get the irreducible representation (Irrep) of the spherical harmonics of degree 
l irrep = Irrep(f"{l}e") # e denotes even parity 

# Apply the rotation to the spherical harmonics 
Y_lm_rotated = irrep.D_from_matrix(R) @ Y_lm

Explanation:

spherical_harmonics(l, directions): Computes the spherical harmonics of degree l for the given directions.
Irrep(f"{l}e"): Creates an irreducible representation for the spherical harmonics of degree l. The "e" stands for even parity, which is typical for standard spherical harmonics.
D_from_matrix(R): Computes the Wigner D-matrix for the rotation matrix R, which describes how the spherical harmonics transform under rotation.
@: Performs matrix multiplication between the Wigner D-matrix and the original spherical harmonics to obtain the rotated spherical harmonics.

…brent/splat_export_world_frame

…o-project/nerfstudio into brent/splat_export_world_frame

…brent/splat_export_world_frame

Ben-Mack · 2025-01-13T06:53:28Z

Is Playcanvas related code for SH rotation a good fit to unblock this PR? https://github.com/playcanvas/engine/blob/release-1.69/src/scene/gsplat/gsplat.js#L259

Also, how can we get the matrix that represent the output coordinate? (Or the diff between original and output coordinate)

pwais · 2025-01-17T02:54:00Z

The line you cite seems to just load splats? rather than rotate the SHs given an arbitrary transform.

Also, how can we get the matrix that represent the output coordinate? (Or the diff between original and output coordinate)

In nerfstudio there's a dataparser_transform that gets stored in the dataparser outputs / model metadata files. AFAIK there's no "splat file format" that normally contains this transform. The code in this change uses that transform to put the trained splats back into the training frame (right now the PR just has translations and rotations tho I believe, yeah).

jkulhanek · 2025-01-17T07:42:23Z

You can use this code https://gist.github.com/jkulhanek/aae3ec12d779ffc729c72157315df0da

Ben-Mack · 2025-01-17T12:21:56Z

Very helpful explanation, thank you!

The line you cite seems to just load splats? rather than rotate the SHs given an arbitrary transform.

Sorry, this is the correct link, which they mentioned Rotate spherical harmonics up to band 3 based on https://github.com/andrewwillmott/sh-lib:
https://github.com/playcanvas/supersplat/blob/b008eca2bd28d2aae7d2defd67363ae6c4f7174b/src/sh-utils.ts#L42

pwais · 2025-01-17T19:28:48Z

on first blush it's hard to compare the cited PlayCanvas impl vs @jkulhanek 's link (PlayCanvas looks possibly like an algebraic simplification somehow?).

But it would be nice for any final solution to have some unit tests. For example, if dataparser_transform breaks somehow, want to easily determine it's that instead of something in complex sh rotations.

…rame

Export splats in "world frame" by default

a24faf6

brentyi requested a review from kerrj February 23, 2024 07:59

Remove old comment

cc999fc

brentyi changed the title ~~Export splats in "world frame" by default~~ Support splat export in original dataset coordinates Feb 23, 2024

ruff

b86d7e1

kerrj previously approved these changes Feb 23, 2024

View reviewed changes

Fix progress bar for full images datamanager

f9ae770

brentyi marked this pull request as draft February 28, 2024 13:09

Merge branch 'brent/fix_undistort_progress_bar' of github.com:nerfstu…

55639be

…dio-project/nerfstudio into brent/splat_export_world_frame

jb-ye mentioned this pull request Mar 15, 2024

Why is the world coord system rotated in colmap_utils.py? #1504

Open

pwais reviewed Mar 15, 2024

View reviewed changes

brentyi mentioned this pull request Aug 28, 2024

Update exporter.py to export sh_degree 0 case #3371 #3374

Merged

brentyi added 4 commits September 10, 2024 17:37

Merge branch 'main' of github.com:nerfstudio-project/nerfstudio into …

8cdd470

…brent/splat_export_world_frame

Merge branch 'brent/splat_export_world_frame' of github.com:nerfstudi…

1da15f9

…o-project/nerfstudio into brent/splat_export_world_frame

revert datamanager

696a2a6

Merge branch 'main' of github.com:nerfstudio-project/nerfstudio into …

3921dc2

…brent/splat_export_world_frame

aayushg55 and others added 6 commits January 18, 2025 20:15

SH rotation, cleanup

cd0ff1d

Merge remote-tracking branch 'origin' into brent/splat_export_world_f…

2786a57

…rame

Complete merge

0c3a4cd

Fixed e3nn change of basis

7777fd7

changed output rotation to R from RT

43abb0f

Account for nerfacto vs splatfacto spherical harmonics differences

69d65b5

brentyi force-pushed the brent/splat_export_world_frame branch from a739636 to 69d65b5 Compare January 21, 2025 02:33

Remove unused code

00b942d

brentyi mentioned this pull request Jan 21, 2025

Fix SH #3010

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support splat export in original dataset coordinates #2951

Support splat export in original dataset coordinates #2951

brentyi commented Feb 23, 2024

kerrj left a comment

jb-ye commented Feb 23, 2024 •

edited

Loading

brentyi commented Feb 24, 2024

pwais left a comment

pwais Mar 15, 2024

brentyi Mar 15, 2024

pwais Mar 15, 2024

pwais Mar 15, 2024

pwais Mar 15, 2024 •

edited

Loading

pwais Mar 15, 2024 •

edited

Loading

brentyi May 28, 2024

pwais May 28, 2024

brentyi commented Mar 15, 2024

pwais commented Mar 15, 2024

jkulhanek commented Mar 15, 2024

jb-ye commented Mar 15, 2024

jkulhanek commented Mar 15, 2024 •

edited

Loading

Frenchman997 commented May 6, 2024

hardikdava commented May 8, 2024

hongsukchoi commented Sep 4, 2024

Ben-Mack commented Jan 13, 2025 •

edited

Loading

pwais commented Jan 17, 2025

jkulhanek commented Jan 17, 2025

Ben-Mack commented Jan 17, 2025 •

edited

Loading

pwais commented Jan 17, 2025

		np.einsum("ij,bj->bi", output_transform[:3, :3], model.means.cpu().numpy() * output_scale)
		+ output_transform[None, :3, 3]

Support splat export in original dataset coordinates #2951

Are you sure you want to change the base?

Support splat export in original dataset coordinates #2951

Conversation

brentyi commented Feb 23, 2024

kerrj left a comment

Choose a reason for hiding this comment

jb-ye commented Feb 23, 2024 • edited Loading

brentyi commented Feb 24, 2024

pwais left a comment

Choose a reason for hiding this comment

pwais Mar 15, 2024

Choose a reason for hiding this comment

brentyi Mar 15, 2024

Choose a reason for hiding this comment

pwais Mar 15, 2024

Choose a reason for hiding this comment

pwais Mar 15, 2024

Choose a reason for hiding this comment

pwais Mar 15, 2024 • edited Loading

Choose a reason for hiding this comment

pwais Mar 15, 2024 • edited Loading

Choose a reason for hiding this comment

brentyi May 28, 2024

Choose a reason for hiding this comment

pwais May 28, 2024

Choose a reason for hiding this comment

brentyi commented Mar 15, 2024

pwais commented Mar 15, 2024

jkulhanek commented Mar 15, 2024

jb-ye commented Mar 15, 2024

jkulhanek commented Mar 15, 2024 • edited Loading

Frenchman997 commented May 6, 2024

hardikdava commented May 8, 2024

hongsukchoi commented Sep 4, 2024

Ben-Mack commented Jan 13, 2025 • edited Loading

pwais commented Jan 17, 2025

jkulhanek commented Jan 17, 2025

Ben-Mack commented Jan 17, 2025 • edited Loading

pwais commented Jan 17, 2025

jb-ye commented Feb 23, 2024 •

edited

Loading

pwais Mar 15, 2024 •

edited

Loading

pwais Mar 15, 2024 •

edited

Loading

jkulhanek commented Mar 15, 2024 •

edited

Loading

Ben-Mack commented Jan 13, 2025 •

edited

Loading

Ben-Mack commented Jan 17, 2025 •

edited

Loading