Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Medusa speculative decoding #2859

Open
wants to merge 19 commits into
base: main
Choose a base branch
from
Open

Conversation

AllentDan
Copy link
Collaborator

No description provided.

@AllentDan AllentDan added the WIP label Dec 5, 2024
@AllentDan AllentDan removed the WIP label Dec 11, 2024
@Tushar-ml
Copy link

can we have docs section for using this feature? @AllentDan

@lvhan028 lvhan028 added the enhancement New feature or request label Dec 12, 2024
@snippetzero
Copy link

Is there any plan for the Turbomind engine to support speculative sampling?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants