[Issue still exists in TRT 10.4] Fail to build the engine if input tensor number > 2^31 #4123

dzzhang96 · 2024-09-12T17:27:29Z

Description

The release note of 10.0.1 mentioned that the issue “UNets with tensors containing >2^31 elements may fail during the engine building step” was fixed. However, my model (nnUNet) has 1x32x512x512x512 input and it still failed to build the engine in TRT 10.4.

Environment

TensorRT Version: 10.4

Relevant Files

Model link: I uploaded a dummy model for testing here.

Similar case #3815 #4004

moraxu · 2024-09-16T19:22:34Z

@dzzhang96 can you provide your OS and the exact command/snippet you use to build the engine? I'll instance an internal bug at that point, thanks.

dzzhang96 · 2024-09-16T20:28:43Z

@moraxu Hi, thanks for the reply. I am using Ubuntu 22.04, NVIDIA-SMI: 525.147.05 Driver Version: 525.147.05 CUDA Version: 12.0, RTX A4500 with 16G RAM. Sorry I cannot share my codes here but you may use trtexec to reproduce the error. :) Thanks again

moraxu · 2024-09-18T16:27:49Z

@dzzhang96 I was told that Conv doesn't support > INT32_MAX volume size (see https://docs.nvidia.com/deeplearning/tensorrt/operators/docs/Convolution.html#volume-limits). The "fix" in release note of 10.0.1 may refer to other operators/cases.

dzzhang96 · 2024-09-18T17:08:19Z

@moraxu Hi thanks for the useful information! I assume that TensorRT is using this Conv operator (nvinfer1::IConvolutionLayer Class) when running inference. Because I did not use it directly in my codes. In the release note, it is hard to be convinced that a unet structure model does not use a Conv operator.

moraxu · 2024-09-18T17:17:15Z

Yes, correct.

In the release note, it is hard to be convinced that a unet structure model does not use a Conv operator.

Makes sense, we can improve the release note moving forward.

dzzhang96 · 2024-09-18T17:57:14Z

@moraxu Thanks! It would be very helpful if the input tensor number could be > 2^31, which means we can input the original 512x512x512 image for inference without losing any resolution.

moraxu · 2024-09-20T21:20:21Z

Noted, although the 2^31 limitation is from an internal library that we depend on..

pieris98 · 2024-09-21T17:45:39Z

@moraxu This is crippling with any model if you use high resolution inputs. What is the alternative solution? Only to modify the architecture and downsample everything?

moraxu · 2024-09-28T00:22:34Z

Unfortunately there might be no other solution in this case for now

moraxu added triaged Issue has been triaged by maintainers Build Time labels Sep 16, 2024

moraxu mentioned this issue Sep 16, 2024

[Issue still exists in TRT 10.1] Fail to build the engine if input tensor number > 2^31 #4004

Closed

moraxu added the internal-bug-tracked label Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Issue still exists in TRT 10.4] Fail to build the engine if input tensor number > 2^31 #4123

[Issue still exists in TRT 10.4] Fail to build the engine if input tensor number > 2^31 #4123

dzzhang96 commented Sep 12, 2024

moraxu commented Sep 16, 2024

dzzhang96 commented Sep 16, 2024

moraxu commented Sep 18, 2024

dzzhang96 commented Sep 18, 2024

moraxu commented Sep 18, 2024

dzzhang96 commented Sep 18, 2024

moraxu commented Sep 20, 2024

pieris98 commented Sep 21, 2024

moraxu commented Sep 28, 2024

[Issue still exists in TRT 10.4] Fail to build the engine if input tensor number > 2^31 #4123

[Issue still exists in TRT 10.4] Fail to build the engine if input tensor number > 2^31 #4123

Comments

dzzhang96 commented Sep 12, 2024

Description

Environment

Relevant Files

moraxu commented Sep 16, 2024

dzzhang96 commented Sep 16, 2024

moraxu commented Sep 18, 2024

dzzhang96 commented Sep 18, 2024

moraxu commented Sep 18, 2024

dzzhang96 commented Sep 18, 2024

moraxu commented Sep 20, 2024

pieris98 commented Sep 21, 2024

moraxu commented Sep 28, 2024