Question about padFilterWeights op. #3740

theNefelibata · 2024-03-26T12:39:32Z

There are many nvifer1:: rt:: cuda:: padFilterWeights calls in my model, and I found that this is before conv2d op. I want to know what this function do and if there is any way to avoid it? thank you.

zerollzeng · 2024-03-28T02:50:29Z

Where did you observe the call? build phase or inference phase? from the name looks like it just pad the weights so that it can fit the format require by a performant kernel, which should be needed.

theNefelibata · 2024-03-28T02:57:46Z

I observed this call during the inference phase, and I think it should be because I used the weights and bias of conv2d as inputs to the model, so there is this call before each conv2d layer. Is there a way to do this operation in advance?

zerollzeng · 2024-04-06T13:43:43Z

@nvpohanh Is this expected?

nvpohanh · 2024-04-07T14:54:01Z

@theNefelibata Could you make the weights/bias constants? Or do they have to be network inputs?

theNefelibata · 2024-04-08T01:19:23Z

@theNefelibata Could you make the weights/bias constants? Or do they have to be network inputs?

they have to be inputs.

nvpohanh · 2024-04-08T02:01:22Z

Then the padFilterWeights kernels are expected because we need to pad the weights for the Conv kernels to run. If the weights were constants, that could have been done offline.

theNefelibata · 2024-04-08T02:09:59Z

Then the padFilterWeights kernels are expected because we need to pad the weights for the Conv kernels to run. If the weights were constants, that could have been done offline.

Can I do this operation manually？

nvpohanh · 2024-04-08T02:18:25Z

they have to be inputs.

If the weights have to be network inputs, is it because you need to change the weights for each inference? Or do you only need to change the weights once and then run multiple inferences with the same set of weights?

If the use case is the latter, then I would recommend using the Refit feature instead of marking weights as network inputs: https://docs.nvidia.com/deeplearning/tensorrt/developer-guide/index.html#refitting-engine-c

This would allow you to refit the weights once at runtime and then run multiple inferences with the refitted weights without the need to padFilterWeights for every inference

theNefelibata · 2024-04-08T02:26:53Z

I have tried Refit, which can affect the inference speed

zerollzeng · 2024-04-12T14:11:21Z

I'm interested about in what user case that the weights has to be changed in each inference, @theNefelibata could you please share you use case? Thanks!

theNefelibata · 2024-04-15T07:34:45Z

I'm interested about in what user case that the weights has to be changed in each inference, @theNefelibata could you please share you use case? Thanks!

I am trying alternative solutions of Refit.

zerollzeng · 2024-04-20T14:16:56Z

Got it, thanks! Can we close this issue?

zerollzeng self-assigned this Mar 28, 2024

zerollzeng added the triaged Issue has been triaged by maintainers label Mar 28, 2024

theNefelibata closed this as completed Apr 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about padFilterWeights op. #3740

Question about padFilterWeights op. #3740

theNefelibata commented Mar 26, 2024

zerollzeng commented Mar 28, 2024

theNefelibata commented Mar 28, 2024

zerollzeng commented Apr 6, 2024

nvpohanh commented Apr 7, 2024

theNefelibata commented Apr 8, 2024

nvpohanh commented Apr 8, 2024

theNefelibata commented Apr 8, 2024

nvpohanh commented Apr 8, 2024

theNefelibata commented Apr 8, 2024

zerollzeng commented Apr 12, 2024

theNefelibata commented Apr 15, 2024

zerollzeng commented Apr 20, 2024

Question about padFilterWeights op. #3740

Question about padFilterWeights op. #3740

Comments

theNefelibata commented Mar 26, 2024

zerollzeng commented Mar 28, 2024

theNefelibata commented Mar 28, 2024

zerollzeng commented Apr 6, 2024

nvpohanh commented Apr 7, 2024

theNefelibata commented Apr 8, 2024

nvpohanh commented Apr 8, 2024

theNefelibata commented Apr 8, 2024

nvpohanh commented Apr 8, 2024

theNefelibata commented Apr 8, 2024

zerollzeng commented Apr 12, 2024

theNefelibata commented Apr 15, 2024

zerollzeng commented Apr 20, 2024