Skip to content

Add multi-lora support for Triton vLLM backend#23

Merged
oandreeva-nv merged 28 commits intotriton-inference-server:mainfrom
l1cacheDell:main
Apr 18, 2024
Merged

Add multi-lora support for Triton vLLM backend#23
oandreeva-nv merged 28 commits intotriton-inference-server:mainfrom
l1cacheDell:main

Commits

Commits on Nov 28, 2023

Commits on Nov 29, 2023

Commits on Nov 30, 2023

Commits on Dec 11, 2023

Commits on Dec 28, 2023

Commits on Dec 29, 2023

Commits on Dec 30, 2023

Commits on Jan 30, 2024

Commits on Jan 31, 2024

Commits on Mar 2, 2024

Commits on Mar 3, 2024

Commits on Mar 5, 2024

Commits on Mar 13, 2024

Commits on Mar 15, 2024

Commits on Apr 9, 2024

Commits on Apr 10, 2024

Commits on Apr 11, 2024

Commits on Apr 13, 2024