[Feature] plan to support medusa? #859

CSEEduanyu · 2024-08-01T02:41:21Z

Motivation

plan to support medusa?

Related resources

No response

zhyncs · 2024-08-01T06:10:08Z

Speculative decoding support is on our roadmap. Currently, FlashInfer has implemented the corresponding kernel and made targeted optimizations, please stay tuned.

chuangzhidan · 2024-09-30T07:22:53Z

Speculative decoding support is on our roadmap. Currently, FlashInfer has implemented the corresponding kernel and made targeted optimizations, please stay tuned.

really looking for fast decoding methods like medusa,Speculative decoding, LOOKAHEAD DECODING and such

vkc1vk · 2024-10-20T21:57:18Z

Hi I was wondering Medusa will be supported with full tree attention or the Top-1 version currently available in vLLM?

Thanks.
cc: @zhyncs @merrymercy

github-actions · 2024-12-20T00:16:51Z

This issue has been automatically closed due to inactivity. Please feel free to reopen it if needed.

zhyncs self-assigned this Aug 1, 2024

zhyncs mentioned this issue Aug 1, 2024

Development Roadmap (2024 Q3) #634

Closed

29 tasks

zhyncs added the feature label Aug 1, 2024

Ying1123 mentioned this issue Sep 21, 2024

Development Roadmap (2024 Q4) #1487

Open

37 tasks

github-actions bot closed this as completed Dec 20, 2024

github-actions bot added the inactive label Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] plan to support medusa? #859

[Feature] plan to support medusa? #859

CSEEduanyu commented Aug 1, 2024

zhyncs commented Aug 1, 2024

chuangzhidan commented Sep 30, 2024 •

edited

Loading

vkc1vk commented Oct 20, 2024

github-actions bot commented Dec 20, 2024

[Feature] plan to support medusa? #859

[Feature] plan to support medusa? #859

Comments

CSEEduanyu commented Aug 1, 2024

Motivation

Related resources

zhyncs commented Aug 1, 2024

chuangzhidan commented Sep 30, 2024 • edited Loading

vkc1vk commented Oct 20, 2024

github-actions bot commented Dec 20, 2024

chuangzhidan commented Sep 30, 2024 •

edited

Loading