-
Notifications
You must be signed in to change notification settings - Fork 766
feat: add RuntimeConfig to ModelEntry #2311
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
PeaBrane
merged 38 commits into
ai-dynamo:main
from
jorgeantonio21:feat/ja/runtime-configs-mdc
Aug 14, 2025
Merged
Changes from 3 commits
Commits
Show all changes
38 commits
Select commit
Hold shift + click to select a range
b1e6eb4
first commit
jorgeantonio21 8ffe717
register runtime config after engine initialization
jorgeantonio21 58d73d2
add sglang runtime config values retrieval
jorgeantonio21 dfc9154
merge main and resolve conflicts
jorgeantonio21 87865fc
Merge branch 'main' into feat/ja/runtime-configs-mdc
jorgeantonio21 5707890
address comments in the PR
jorgeantonio21 9770e75
Merge branch 'main' into feat/ja/runtime-configs-mdc
jorgeantonio21 61f6424
refactor logic to pass in engine initialization runtime args directly…
jorgeantonio21 6fbe951
merge main and resolve conflicts
jorgeantonio21 b376cfb
resolve _core.py import issues
jorgeantonio21 9d3cbb1
resolve runtime issues
jorgeantonio21 d1b87f5
resolve import issues
jorgeantonio21 d18881b
resolve import issues
jorgeantonio21 24712cb
resolve vllm cache config issues
jorgeantonio21 c20f0e1
resolve non-int gpu_mem_integer issue
jorgeantonio21 af94e4b
resolve non-int gpu_mem_integer issue
jorgeantonio21 57e12c2
remove uneeded async in python code
jorgeantonio21 3304c8d
Merge branch 'main' into feat/ja/runtime-configs-mdc
jorgeantonio21 acddc6b
Merge branch 'main' into feat/ja/runtime-configs-mdc
PeaBrane d4b1edf
revert llama-cpp version in Cargo.lock
PeaBrane b7ca2f5
move runtime config into local_model
PeaBrane becb754
put runtime config in ModelEntry so it gets registered to etcd
PeaBrane 5adaeb1
fmt
PeaBrane 950e6a4
if mocker, override runtime configs
PeaBrane cbbd03b
router listens to runtime configs (kv total blocks)
PeaBrane e697253
clippy
PeaBrane b0dc6f3
mv runtime config bindings to new file local_model.rs
PeaBrane 8004bbd
tensorrtllm support (vibe coded)
PeaBrane ef3d419
max_num_batched_tokens instead
PeaBrane 6842494
fix sglang server_info args
PeaBrane 3b175cf
direct access to server_Args
PeaBrane 10773c7
sglang: access total num tokens via scheduler info
PeaBrane 69d5d80
isort
PeaBrane e6de5a2
trtllm: extract directly from config
PeaBrane 09f1cb0
trtllm: get total_kv_blocks from get_stats_async
PeaBrane 36a6fbb
Merge branch 'feat/ja/runtime-configs-mdc' of https://github.com/jorg…
jorgeantonio21 280e98a
ceil division for sglang total_kv_blocks calculation
jorgeantonio21 3f8bcdd
hooks
jorgeantonio21 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.