DefTruth

Follow

🎯

#pragma unroll

DefTruth DefTruth

🎯

#pragma unroll

Follow

AI Infra Engineer @vipshop, Owner @xlite-dev, Prev @PaddlePaddle🤖

1.9k followers · 173 following

@xlite-dev, @vipshop
Guangzhou, China
04:44 (UTC +08:00)
https://github.com/xlite-dev

Achievements

Achievements

Organizations

Pinned Loading

xlite-dev/LeetCUDA xlite-dev/LeetCUDA Public

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8.5k 837
xlite-dev/lite.ai.toolkit xlite-dev/lite.ai.toolkit Public

🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉

C++ 4.3k 765
xlite-dev/Awesome-LLM-Inference xlite-dev/Awesome-LLM-Inference Public

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4.7k 322
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 63.4k 11.4k
vipshop/cache-dit vipshop/cache-dit Public

A Unified and Flexible Inference Engine with Hybrid Cache Acceleration and Parallelism for 🤗DiTs.

Python 559 24
xlite-dev/ffpa-attn xlite-dev/ffpa-attn Public

🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.

Cuda 231 11