torch.compile VIT-H-14 and VIT-L-14 #982

dhaivatd · 2024-11-11T21:48:11Z

dhaivatd
Nov 11, 2024

I'm trying to improve inference latency for VIT-H-14 and VIT-L-14 using torch compile following @rwightman's suggestion.

The inference latency is actually worse with default torch.compile on torch 2.2 as compared to running forward pass with just torch.save'd model. Are there any compile flags or settings I can use to improve?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.compile VIT-H-14 and VIT-L-14 #982

{{title}}

Replies: 0 comments

Select a reply

torch.compile VIT-H-14 and VIT-L-14 #982

dhaivatd Nov 11, 2024

Replies: 0 comments

dhaivatd
Nov 11, 2024