[UnifiedTree] Support deepseek v4 host pool layout#25282
Conversation
There was a problem hiding this comment.
Code Review
This pull request introduces support for multiple memory layouts (layer_first, page_first, and page_first_direct) and transfer backends (kernel, direct) within the DeepSeek V4 host memory pools. The changes include updating the pool initialization to allocate buffers based on the selected layout, implementing layout-aware data transfer logic for both device-to-host and host-to-device operations, and refining metadata retrieval for page buffers. Additionally, the test suite has been expanded to include smoke tests for various layout and backend combinations. I have no feedback to provide as there were no review comments to evaluate.
Co-authored-by: hzh0425 <hzh0425@apache.org>
82ba104 to
27944ca
Compare
|
/rerun-test test/registered/radix_cache/test_unified_radix_hicache_kl.py |
|
🚀 |
|
/rerun-test test/registered/radix_cache/test_unified_radix_hicache_kl.py |
|
⛔ Please ask a maintainer to run this command, or use the normal CI flow. |
|
/rerun-test test/registered/radix_cache/test_unified_radix_hicache_kl.py |
|
🚀 |
|
/rerun-failed-ci |
|
🚀 |
|
/rerun-test test/registered/radix_cache/test_unified_radix_hicache_kl.py |
|
🚀 |
|
/rerun-failed-ci |
|
/rerun-test test/registered/radix_cache/test_unified_radix_cache_kl_hicache.py |
|
🚀 |
|
/rerun-test test/registered/radix_cache/test_unified_radix_cache_kl_hicache.py |
|
🚀 |
|
/rerun-test test/registered/radix_cache/test_unified_radix_cache_kl_hicache.py |
|
⛔ Please ask a maintainer to run this command, or use the normal CI flow. |
|
/rerun-test test/registered/radix_cache/test_unified_radix_cache_kl_hicache.py |
|
🚀 |
|
/rerun-test test/registered/radix_cache/test_unified_radix_cache_kl_hicache.py |
|
🚀 |
|
/rerun-test test/registered/radix_cache/test_unified_radix_cache_kl_hicache.py |
|
🚀 |
|
/rerun-failed-ci |
…gl-project#25282) Co-authored-by: hzh0425 <hzh0425@apache.org>
…gl-project#25282) Co-authored-by: hzh0425 <hzh0425@apache.org>
Motivation
Modifications
Accuracy Tests
Speed Tests and Profiling
Checklist
Review and Merge Process
/tag-and-rerun-ci,/tag-run-ci-label,/rerun-failed-ciCI States
Latest PR Test (Base): ⏳ Run #26016810718⚠️ Not enabled -- add
Latest PR Test (Extra):
run-ci-extralabel to opt in.