[BUG] Host memory leak in SerializedBatchIterator #8043
Labels
bug
Something isn't working
reliability
Features to improve reliability or bugs that severly impact the reliability of the plugin
Describe the bug
While testing #7581 with NDS 3TB with GPU memory restricted to 6GB, I am seeing some leaked host memory buffers.
I see these with and without the fix in #8040
Steps/Code to reproduce bug
Run NDS at 3TB on an 8 node A100 cluster with GPU memory restricted to 6GB
The text was updated successfully, but these errors were encountered: