Get CUDA available memory at runtime and skip loading models in order of priority #31

kristiankielhofner · 2023-04-06T14:59:11Z

For cards with insufficient memory we should probably have some basic heuristics to skip loading models that don't fit or whatever, possibly by user configured priority. Just skipping TTS would allow all Whisper models on 3GB cards, which should really be the minimum (GTX 1060 3GB).

kristiankielhofner added a commit that referenced this issue Apr 6, 2023

Get CUDA device memory to start #31

edefcb2

richardklafter mentioned this issue Apr 11, 2023

Refactor model loading so it doesn't load on import #44

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get CUDA available memory at runtime and skip loading models in order of priority #31

Get CUDA available memory at runtime and skip loading models in order of priority #31

kristiankielhofner commented Apr 6, 2023

Get CUDA available memory at runtime and skip loading models in order of priority #31

Get CUDA available memory at runtime and skip loading models in order of priority #31

Comments

kristiankielhofner commented Apr 6, 2023