[mem] Flatten memory checkers into composable per-pool invariant checks#22562
[mem] Flatten memory checkers into composable per-pool invariant checks#22562
Conversation
|
Warning You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again! |
b88e691 to
e7b1297
Compare
e7b1297 to
5cd6d17
Compare
…in/checker-refactor
|
/rerun-test test_scheduler_pause_generation.py test_mamba_unittest.py test_metrics.py |
|
✅ ✅ |
|
/rerun-test test_disaggregation_basic.py test_pp_llama.py |
|
❌
Please provide the full path, e.g. ❌ |
|
/rerun-test test/registered/disaggregation/test_disaggregation_basic.py test/registered/distributed/test_pp_single_node.py |
|
✅ ✅ |
|
/rerun-test test/registered/distributed/test_disaggregation_pp.py test/registered/distributed/test_epd_disaggregation.py |
|
✅ ✅ |
|
/rerun-test test_openai_function_calling.py |
|
✅ |
Summary
_check_hybrid_memory,_check_mamba_memory,_check_radix_cache_memory) with composable per-pool checks:_check_full_pool,_check_swa_pool,_check_mamba_pool_check_pool_invariantstatic method:available + evictable + protected + session_held + uncached == totalcheck_memory(idle) uses flatif/ifinstead ofif/elif/else— naturally supports SWA + mamba coexistenceself_check_during_busydelegates to_check_full_pool(ps, uncached=N)instead of reimplementing the invariant_get_total_uncached_sizehelper from busy check boilerplatedump_infoalso uses the flat structureFollows up on #22554 and #22559.
Test plan
test_scheduler_pause_generation.py— CPU, mock PoolStats pathtest_metrics.py— exercises idle metrics path