-
Notifications
You must be signed in to change notification settings - Fork 1.9k
[TRTLLM-5830][feat] Improve LoRA cache memory control #6220
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
shaharmor98
merged 21 commits into
NVIDIA:main
from
amitz-nv:dev-improve-pytorch-lora-cache-memory-control
Jul 31, 2025
Merged
Changes from all commits
Commits
Show all changes
21 commits
Select commit
Hold shift + click to select a range
9171f88
Remove deprecated lora args from BaseLlmArgs, using peft_cache_config…
amitz-nv 07cde29
Enabled use of LoraConfig in TRT_python flow, added tests of expected…
amitz-nv eabe716
Improve comments in tests
amitz-nv d1a896f
Correct mistake in PeftCacheConfig.num_device_module_layer description
amitz-nv e90872a
Add validation of unsupported field in peft cache manager
amitz-nv 7e4e37c
Fix docstring line length
amitz-nv 004eaf9
Fix validate_peft_cache_config
amitz-nv 1afafa7
Fix validate_peft_cache_config formatting
amitz-nv c486af2
Fix lora_prefetch_dir description and 'unsupported warning' message, …
amitz-nv 138c4b1
Fix tests to configure lora cache size by number of adapters for test…
amitz-nv e26ca0a
Fix tests to API update - use LoraConfig instead of base LLM args for…
amitz-nv ef99dd2
Fix tests to explicitly configure lora_config's max_loras and max_cpu…
amitz-nv 797715e
Define default values in PeftCacheConfig model class for device_cache…
amitz-nv 53b4233
Add default value to description
amitz-nv 0d51a80
Fix PeftCacheConfig.create_from_pybind after changing python fields t…
amitz-nv e0fcbeb
Fix examples/llm-api/llm_multilora.py - use one LoraConfig
amitz-nv 61a994b
Fix examples/llm-api/llm_multilora.py to not use BuildConfig that's i…
amitz-nv 191a0ed
Changed create_from_pybind method to be a more generic classmethod in…
amitz-nv 8cca194
Minor docstring fix
amitz-nv 391d0f9
Fix rename
amitz-nv bce06ad
Fix test_ptp_quickstart_multimodal_phi4mm - for stability set lora ca…
amitz-nv File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.