Over 10 years of experience in server architecture design and optimization, proficient in network, cache, and memory. Now focusing on LLM inference
-
mi.com
- Beijing
-
03:05
- 8h ahead - https://www.zhihu.com/people/csioza
- https://mp.weixin.qq.com/s/XhoaZYNBepX8VhRU1nlrag
Pinned Loading
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
flashinfer-ai/flashinfer
flashinfer-ai/flashinfer PublicFlashInfer: Kernel Library for LLM Serving
-
pytorch/pytorch
pytorch/pytorch PublicTensors and Dynamic neural networks in Python with strong GPU acceleration
-
sgl-project/sglang
sgl-project/sglang PublicSGLang is a fast serving framework for large language models and vision language models.
69 contributions in the last year
Day of Week | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | April Apr | |||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Activity overview
Loading
Contribution activity
April 2025
Reviewed 2 pull requests in 1 repository
vllm-project/vllm
2 pull requests
-
[P/D][V1] KV Connector API V1
This contribution was made on Apr 12
-
[WIP][V1/0][P/D] XpYd based on p2p communication without cache store
This contribution was made on Apr 6
8
contributions
in private repositories
Apr 26 – Apr 27