想咨询下，你们在显存方面有哪些建议和优化 #25

datalee · 2024-01-18T09:19:45Z

稍微长点的文档在32G的v100上就爆点了
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 34.13 GiB (GPU 0; 31.75 GiB total capacity; 18.88 GiB already allocated; 7.01 GiB free; 20.61 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

The text was updated successfully, but these errors were encountered:

miange91 · 2024-01-22T07:14:40Z

稍微长点的文档在32G的v100上就爆点了 torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 34.13 GiB (GPU 0; 31.75 GiB total capacity; 18.88 GiB already allocated; 7.01 GiB free; 20.61 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

可以用tensorRT-LLM来跑推理

datalee · 2024-01-22T07:52:04Z

稍微长点的文档在32G的v100上就爆点了 torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 34.13 GiB (GPU 0; 31.75 GiB total capacity; 18.88 GiB already allocated; 7.01 GiB free; 20.61 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

可以用tensorRT-LLM来跑推理

有一文档可以参考吗？是选用LLAMA2结构？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

想咨询下，你们在显存方面有哪些建议和优化 #25

想咨询下，你们在显存方面有哪些建议和优化 #25

datalee commented Jan 18, 2024

miange91 commented Jan 22, 2024

datalee commented Jan 22, 2024

想咨询下，你们在显存方面有哪些建议和优化 #25

想咨询下，你们在显存方面有哪些建议和优化 #25

Comments

datalee commented Jan 18, 2024

miange91 commented Jan 22, 2024

datalee commented Jan 22, 2024