Pinned Loading
Repositories
    Showing 10 of 26 repositories
    
  
  
    
      
-           llm-compressor PublicTransformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM vllm-project/llm-compressor’s past year of commit activity 
-           speculators PublicA unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM vllm-project/speculators’s past year of commit activity 
-           vllm-project.github.io Publicvllm-project/vllm-project.github.io’s past year of commit activity 
-           production-stack PublicvLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization vllm-project/production-stack’s past year of commit activity