Why does the whisper model need 17GB of video memory? #805

paulxin001 · 2024-01-04T03:10:27Z

Why does the whisper model need 17GB of video memory? fast-whipser only needs 4G video memory? And I haven't found a way for whisper to quantize Int. Is it not supported now? This video memory occupies too much, is there any way to optimize it?

kristiankielhofner · 2024-01-09T03:29:27Z

It's getting worked on.

yuekaizhang · 2024-01-10T08:28:19Z

It's getting worked on.

Yeah, you could try the int8 weight only quantization branch, which greatly reduces the memory usage.
Also, memory usage should not be a big issue, as the GPU utilization is already high, and freeing up memory would not be used for other tasks. @paulxin001

yuekaizhang · 2024-01-31T07:10:22Z

@paulxin001 Would you mind trying to removal layernorm plugin and try again? Thank you.

See #992

nv-guomingz · 2024-11-15T15:29:00Z

Hi @paulxin001 would u please try our latest code base to see if the issue still exists?

And do u still have further issue or question now? If not, we'll close it soon.

byshiue assigned yuekaizhang Jan 10, 2024

nv-guomingz added the stale label Nov 15, 2024

nv-guomingz closed this as completed Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does the whisper model need 17GB of video memory? #805

Why does the whisper model need 17GB of video memory? #805

paulxin001 commented Jan 4, 2024

kristiankielhofner commented Jan 9, 2024

yuekaizhang commented Jan 10, 2024 •

edited

Loading

yuekaizhang commented Jan 31, 2024 •

edited

Loading

nv-guomingz commented Nov 15, 2024

Why does the whisper model need 17GB of video memory? #805

Why does the whisper model need 17GB of video memory? #805

Comments

paulxin001 commented Jan 4, 2024

kristiankielhofner commented Jan 9, 2024

yuekaizhang commented Jan 10, 2024 • edited Loading

yuekaizhang commented Jan 31, 2024 • edited Loading

nv-guomingz commented Nov 15, 2024

yuekaizhang commented Jan 10, 2024 •

edited

Loading

yuekaizhang commented Jan 31, 2024 •

edited

Loading