-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deepseek v3 mla fp8 #9897
base: develop
Are you sure you want to change the base?
Deepseek v3 mla fp8 #9897
Conversation
support append_attn c16 for deep-seek-v3
Fix rope&fix precision
…fp8 dual gemm api on cutlass3.x
…nto support-deepseek-v3
Thanks for your contribution! |
…eNLP into deepseek-v3-mla-fp8
Codecov ReportAttention: Patch coverage is
❌ Your patch check has failed because the patch coverage (3.91%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage. Additional details and impacted files@@ Coverage Diff @@
## develop #9897 +/- ##
===========================================
- Coverage 51.66% 50.97% -0.70%
===========================================
Files 739 749 +10
Lines 117426 119529 +2103
===========================================
+ Hits 60668 60925 +257
- Misses 56758 58604 +1846 ☔ View full report in Codecov by Sentry. |
Before submitting
tests
folder. If there are codecov issues, please add tests cases first.PR types
PR changes
Description