 Steps to reproduce 1. open a sharded gguf model in the model inspector: https://huggingface.co/ggml-org/models/tree/main/grok-1?show_tensors=grok-1%2Fgrok-1-q4_0-00001-of-00009.gguf 2. wait about 1min 3. observe oom error in browser. specs: Google chrome Version 123.0.6312.86 (Official Build) (64-bit) Windows 11 Ram 32Gb