Implement Flip transforms with CVCUDA backend #9277

zy1git · 2025-11-19T19:07:08Z

Summary:
Implemented _horizontal_flip_image_cvcuda and _vertical_flip_image_cvcuda kernels using cvcuda.flip operator. The kernels are automatically registered when CVCUDA is available and route cvcuda.Tensor inputs appropriately.

Test Plan:

Added test_functional_cvcuda and test_image_correctness_cvcuda tests
Verified parity between PyTorch and CVCUDA implementations
All tests pass with CVCUDA backend

pytorch-bot · 2025-11-19T19:07:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9277

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

test/test_transforms_v2.py

torchvision/transforms/v2/functional/_geometry.py

justincdavis · 2025-11-19T23:35:51Z

@zy1git What is the strategy for creating the tests for the transforms with CV-CUDA backends? Do we want to have all the tests live entirely inside the existing classes or make a new class?

The PRs for gaussian_blur, normalize, and to_dtype I made all use new classes, but I can switch it be more centralized.

AntoineSimoulin

Thanks a lot for submitting this PR! This is looking good. I added some comments to make sure we have an extensive test coverage:)

test/test_transforms_v2.py

torchvision/transforms/v2/functional/_geometry.py

NicolasHug · 2025-11-20T12:49:59Z

@justincdavis replying to your question in #9277 (comment): we prefer centralizing the tests in the existing test class. The idea is that, as much as possible, we'd just add CV-CUDA as a parametrization entry with pytest.mark.parametrize to the existing tests. Antoine and I left a few comments related to that above. Does that make sense?

justincdavis · 2025-11-20T16:31:11Z

@NicolasHug Makes sense! I will follow the comments you and Antoine left on this PR

… according to the comments

AntoineSimoulin

Thanks a lot for addressing the initial comments! I left some final adjustments to make. Let's also make sure linting and tests are passing!

test/test_transforms_v2.py

torchvision/transforms/v2/_geometry.py

torchvision/transforms/v2/functional/_geometry.py

test/test_transforms_v2.py

justincdavis · 2025-11-27T00:07:12Z

@AntoineSimoulin @NicolasHug @zy1git

I would propose that we take an approach like this to simplify the comparison of CV-CUDA tensors with PIL references.

We would add the following function to common_utils.py

def cvcuda_to_pil_compatible_tensor(tensor):
    tensor = cvcuda_to_tensor(tensor)
    if tensor.ndim != 4:
        raise ValueError(f"CV-CUDA Tensor should be 4 dimensional. Got {tensor.ndim} dimensions.")
    if tensor.shape[0] != 1:
        raise ValueError(f"CV-CUDA Tensor should have batch dimension 1 for comparison with PIL.Image.Image. Got {tensor.shape[0]}.")
    return tensor.squeeze(0).cpu()

Then we would modify ImagePair as follows

class ImagePair(TensorLikePair):
    def __init__(
        self,
        actual,
        expected,
        *,
        mae=False,
        **other_parameters,
    ):
        if all(isinstance(input, PIL.Image.Image) for input in [actual, expected]):
            actual, expected = (to_image(input) for input in [actual, expected])
        elif CVCUDA_AVAILABLE and all(isinstance(input, _import_cvcuda().Tensor) for input in [actual, expected]):
            actual, expected = (cvcuda_to_tensor(input) for input in [actual, expected])
        elif CVCUDA_AVAILABLE and isinstance(actual, _import_cvcuda().Tensor) and isinstance(expected, PIL.Image.Image):
            actual = cvcuda_to_pil_compatible_tensor(actual)
            expected = to_image(expected)
        elif CVCUDA_AVAILABLE and isinstance(actual, _import_cvcuda().Tensor):
            actual = cvcuda_to_pil_compatible_tensor(actual)

        super().__init__(actual, expected, **other_parameters)
        self.mae = mae

Then when we compare the actual tensor (cvcuda.Tensor) to the expected (PIL.Image.Image), the conversion will be handled automatically for us.

Additionally, with the helper function cvcuda_to_pil_compatible_tensor, we can simplify the logic for handling CV-CUDA specific comparisions. For example, in TestRgbToGrayscale::test_image_correctness, the logic for handling CV-CUDA goes from:

if make_input is make_image_cvcuda:
        actual = F.cvcuda_to_tensor(actual).to(device="cpu")
        actual = actual.squeeze(0)
        # drop the batch dimension
        image = F.cvcuda_to_tensor(image).to(device="cpu")
        image = image.squeeze(0)

to

# here the conversion of actual is handled in either assert_close or assert_equal itself
if make_input is make_image_cvcuda:
        image = cvcuda_to_pil_compatible_tensor(image)

These changes are integrated in this PR

NicolasHug

Thanks for the great work @zy1git ! I made another pass.

NicolasHug · 2025-11-28T11:42:17Z

test/common_utils.py

+    tol=1e-5,
+    msg=None,
+    agg_method="mean",
+    allowed_percentage_diff=None,


Above: this looks like a cosmetic change? Try to avoid that and revert the previous state, it's distracting when reviewing and it also affects git blame unnecessarily :)

I think this cosmetic change happens after I use pre-commit hooks which run both ufmt and flake8 as described here

NicolasHug · 2025-11-28T11:42:32Z

test/common_utils.py

+            "ignore",
+            message=re.escape("operator() profile_node %"),
+            category=UserWarning,
+        )


Same here, this looks like a cosmetic change?

NicolasHug · 2025-11-28T11:44:10Z

torchvision/transforms/v2/_transform.py


 from .functional._utils import _get_kernel

+CVCUDA_AVAILABLE = _is_cvcuda_available()


This doesn't seem to be used, we can remove it.

NicolasHug · 2025-11-28T11:51:05Z

test/common_utils.py

+                # Remove batch dimension if it's 1 for easier comparison
+                if actual.shape[0] == 1:
+                    actual = actual[0]


This seems unnecessary, we should be able to compare tensors where the batch dim is 1. Try to remove it, if it doesn't work for any reason let me know.

EDIT: ah, OK, it's for when we compare a 3D PIL image to a 4D cvcuda tensor. That's... fine. Let's explain why then (addition in bold):

Remove batch dimension if it's 1 for easier comparison against 3D PIL images

NicolasHug · 2025-11-28T11:52:38Z

test/common_utils.py

+        try:
+            import cvcuda
+            from torchvision.transforms.v2.functional import cvcuda_to_tensor


Let's use the _is_cvcuda_available() instead of a try/except. You can then call cvcuda = _import_cvcuda() which is safer and less surprising than a raw import cvcuda.

NicolasHug · 2025-11-28T11:55:32Z

test/test_transforms_v2.py

+        if isinstance(input, cvcuda.Tensor):
+            assert_equal(F.cvcuda_to_tensor(output), F.cvcuda_to_tensor(input))
+        else:
+            assert_equal(output, input)


there should be no need to manually convert anymore, assert_equal should be able to handle it. Also don't make raw cvcuda accesses! It would force a hard dependency :)

NicolasHug · 2025-11-28T11:56:01Z

test/test_transforms_v2.py


        assert_equal(valid, torch.tensor(expected_valid_mask))
-        assert type(valid) == torch.Tensor
+        assert type(valid) is torch.Tensor


revert both above, they seem unrelated and not needed.

NicolasHug · 2025-11-28T12:02:44Z

test/common_utils.py

+        # Convert PIL images to tv_tensors.Image (regardless of what the other is)
+        if isinstance(actual, PIL.Image.Image):
+            actual = to_image(actual)
+        if isinstance(expected, PIL.Image.Image):
+            expected = to_image(expected)


Noting that the above is a change of behavior: we used to convert both inputs, or none of them, which means we'd error when comparing a tensor to a PIL image. We now accept it. I think that's fine.

@justincdavis thanks for sharing your implementation in https://github.com/pytorch/vision/pull/9284/files#diff-f833e6bb21df531837a51e84306266c2f1f6d1565340a498095868e24b3f27de. I think you were being slightly more conservative in your added logic. Was there any particular edge-case you had in mind that you wanted to guard against?

torchvision/transforms/v2/_geometry.py

NicolasHug · 2025-11-28T12:15:55Z

torchvision/transforms/v2/_geometry.py

+    if CVCUDA_AVAILABLE:
+        _transformed_types = (torch.Tensor, PIL.Image.Image, cvcuda.Tensor)


I think this should be

Suggested change

if CVCUDA_AVAILABLE:

_transformed_types = (torch.Tensor, PIL.Image.Image, cvcuda.Tensor)

_transformed_types = _RandomApplyTransform._transformed_types + (is_cvcuda_tensor, )

where is_cvcuda_tensor is @justincdavis 's implementation from:
https://github.com/pytorch/vision/pull/9283/files#diff-6e892912803ab29861746f7118e9e462a5c9eaab50ad286903e87f09f6c44ff3R175

The function is great and it allows us to avoid the big if CVCUDA_AVAILABLE.

To follow-up from the discussion in https://github.com/pytorch/vision/pull/9277/files#r2566354008 about whether this _transformed_types should be in each child transform class or in the base Transform class: I think it should be in each child transform for now (as done here):

we may have a to push a release where we won't have implemented cvcuda support for all transforms, so if this was in the base class then we'd be claiming cvcuda support for some transforms that actually do not support it

we may publish new transforms in the future which won't immediately come with cvcuda support, causing the same kind of problem.

We could revisit this eventually. It might eventually be simpler to just have that once and for all in the base Transform class. But having it in each child classes isn't much work, and it makes it easier to track which ones support cvcuda, and which ones don't.

meta-cla bot added the cla signed label Nov 19, 2025

justincdavis reviewed Nov 19, 2025

View reviewed changes

test/test_transforms_v2.py Outdated Show resolved Hide resolved

justincdavis reviewed Nov 19, 2025

View reviewed changes

test/test_transforms_v2.py Outdated Show resolved Hide resolved

justincdavis reviewed Nov 19, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Outdated Show resolved Hide resolved

justincdavis reviewed Nov 19, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Outdated Show resolved Hide resolved

zy1git force-pushed the cvcuda-flip-transforms branch from 9cb272b to 02c320a Compare November 19, 2025 23:09

zy1git force-pushed the cvcuda-flip-transforms branch from 02c320a to 330db00 Compare November 20, 2025 00:29

zy1git closed this Nov 20, 2025

zy1git reopened this Nov 20, 2025

AntoineSimoulin reviewed Nov 20, 2025

View reviewed changes

test/test_transforms_v2.py Outdated Show resolved Hide resolved

test/test_transforms_v2.py Outdated Show resolved Hide resolved

test/test_transforms_v2.py Outdated Show resolved Hide resolved

AntoineSimoulin reviewed Nov 20, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Show resolved Hide resolved

NicolasHug reviewed Nov 20, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Outdated Show resolved Hide resolved

NicolasHug reviewed Nov 20, 2025

View reviewed changes

torchvision/transforms/v2/functional/_geometry.py Outdated Show resolved Hide resolved

Update CVCUDA tests for horizontal and vertical flip and make changes…

98616f4

… according to the comments

zy1git force-pushed the cvcuda-flip-transforms branch from 330db00 to 98616f4 Compare November 24, 2025 23:25

AntoineSimoulin reviewed Nov 25, 2025

View reviewed changes

test/test_transforms_v2.py Outdated Show resolved Hide resolved

test/test_transforms_v2.py Outdated Show resolved Hide resolved

torchvision/transforms/v2/_geometry.py Show resolved Hide resolved

torchvision/transforms/v2/functional/_geometry.py Outdated Show resolved Hide resolved

justincdavis reviewed Nov 25, 2025

View reviewed changes

test/test_transforms_v2.py Outdated Show resolved Hide resolved

WIP: cvcuda flip transforms - pending tech lead review

42fcc41

NicolasHug reviewed Nov 26, 2025

View reviewed changes

test/test_transforms_v2.py Outdated Show resolved Hide resolved

Address review comments from Nov 26th

9423b4d

NicolasHug reviewed Nov 28, 2025

View reviewed changes


		from .functional._utils import _get_kernel

		CVCUDA_AVAILABLE = _is_cvcuda_available()

		if CVCUDA_AVAILABLE:
		_transformed_types = (torch.Tensor, PIL.Image.Image, cvcuda.Tensor)

	if CVCUDA_AVAILABLE:
	_transformed_types = (torch.Tensor, PIL.Image.Image, cvcuda.Tensor)
	_transformed_types = _RandomApplyTransform._transformed_types + (is_cvcuda_tensor, )

Implement Flip transforms with CVCUDA backend #9277

Are you sure you want to change the base?

Implement Flip transforms with CVCUDA backend #9277

Conversation

zy1git commented Nov 19, 2025

Uh oh!

pytorch-bot bot commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/vision/9277

❗ 2 Active SEVs

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justincdavis commented Nov 19, 2025

Uh oh!

AntoineSimoulin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NicolasHug commented Nov 20, 2025

Uh oh!

justincdavis commented Nov 20, 2025

Uh oh!

AntoineSimoulin left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justincdavis commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Nov 19, 2025 •

edited

Loading

AntoineSimoulin left a comment •

edited

Loading

justincdavis commented Nov 27, 2025 •

edited

Loading