[WIP] [Speculative Decoding] Use MQA kernel for target model verification#5691
Closed
LiuXiaoxuanPKU wants to merge 23 commits into
Closed
[WIP] [Speculative Decoding] Use MQA kernel for target model verification#5691LiuXiaoxuanPKU wants to merge 23 commits into
LiuXiaoxuanPKU wants to merge 23 commits into
Commits
Commits on Jun 19, 2024
- committed
- committed
- committed
- committed
Commits on Jun 20, 2024
- committed
- committed
Commits on Jun 25, 2024
Commits on Jul 2, 2024
Commits on Jul 9, 2024
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed