-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update trt workspace size param #44469
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agree operator definition modification
API 文档更新见 PaddlePaddle/Paddle-Inference-Demo#333 |
@@ -523,7 +523,7 @@ struct PD_INFER_DECL AnalysisConfig { | |||
/// quantization). | |||
/// | |||
/// | |||
void EnableTensorRtEngine(int workspace_size = 1 << 20, | |||
void EnableTensorRtEngine(int64_t workspace_size = 1 << 30, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
文档里面是20.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
文档已修改
PR types
Others
PR changes
Others
Describe
use
int64_t
instead ofint
to set GPU workspace size for TensorRT engine. Paddle's AttrType do not supportunsigned long
data type, so useint64_t
instead.refer to TensorRT
This API will be deprecated in TensorRT 8.3