Skip to content

Conversation

@zhaochenyang20
Copy link
Collaborator

Motivation

Modifications

Checklist

@zhaochenyang20 zhaochenyang20 merged commit ebddb65 into sgl-project:main Mar 6, 2025
10 of 12 checks passed
aoshen524 pushed a commit to aoshen524/sglang that referenced this pull request Mar 10, 2025
@diggle001
Copy link

Hi, can anyone tell me Why doesn't torch-compile support FP8 models?
When I deployed the deepseek model of fp8, I added the --enable-torch-compile parameter to the startup command, which worked fine, but the startup still took a long time.

@ispobock
Copy link
Collaborator

Hi, can anyone tell me Why doesn't torch-compile support FP8 models? When I deployed the deepseek model of fp8, I added the --enable-torch-compile parameter to the startup command, which worked fine, but the startup still took a long time.

The torch.compile has supported FP8 since torch 2.5. The doc should be updated. @zhaochenyang20

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants