Add multi-lora support for Triton vLLM backend#23

Merged

oandreeva-nv merged 28 commits intotriton-inference-server:mainfrom

l1cacheDell:main

Apr 18, 2024

Commits on Nov 28, 2023

add lora support for backend
SamuraiBUPT
committed

Commits on Nov 29, 2023

finish vllm triton lora support
SamuraiBUPT
committed
add docs for deploying multi-lora on triton
SamuraiBUPT
committed
update docs
SamuraiBUPT
committed

Commits on Nov 30, 2023

bug fix
SamuraiBUPT
committed

Commits on Dec 11, 2023

Merge branch 'triton-inference-server:main' into main
l1cacheDell
authored

Commits on Dec 28, 2023

CodeReview: remove comment and update docs
SamuraiBUPT
committed

Commits on Dec 29, 2023

bug fix: non-graceful terminate
SamuraiBUPT
committed

Commits on Dec 30, 2023

update docs to specify container version
SamuraiBUPT
committed

Commits on Jan 30, 2024

Commits on Jan 31, 2024

CodeReview: remove multi_lora.json, update docs and model.py logic
SamuraiBUPT
committed
update docs: create docker container first
SamuraiBUPT
committed
add test stage 1: modify test.sh
SamuraiBUPT
committed

Commits on Mar 2, 2024

resolve merge conflict
SamuraiBUPT
committed
remove redundant lines and fix for ci test
SamuraiBUPT
committed

Commits on Mar 3, 2024

update client_lora.py for main branch recent commits
SamuraiBUPT
committed

Commits on Mar 5, 2024

modify ci test.sh and docs
SamuraiBUPT
committed
remove redundant line
SamuraiBUPT
committed

Commits on Mar 13, 2024

fix client_lora process_stream
SamuraiBUPT
committed

Commits on Mar 15, 2024

add ci test for multi-lora
SamuraiBUPT
committed

Commits on Apr 9, 2024

Update src/model.py

l1cacheDell
and
oandreeva-nv
authored
modify to model.py and ci
SamuraiBUPT
committed

Commits on Apr 10, 2024

spell check & helper func & copyright & version modify
SamuraiBUPT
committed
shebang: file permissions
SamuraiBUPT
committed

Commits on Apr 11, 2024

Merge branch 'triton-inference-server:main' into main
l1cacheDell
authored

Commits on Apr 13, 2024

code review: changes to docs, client, ci test
SamuraiBUPT
committed
modify docs
SamuraiBUPT
committed