-
-
Notifications
You must be signed in to change notification settings - Fork 11.6k
[Model]Add Tencent HunYuanMoEV1 Model Support #20114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 2 commits
a8ce7ea
2cdc7fe
097b898
2b48071
ddd2f51
44f0b57
aeb3d61
d8bc477
9d09275
35e66f2
cf81a2f
fa7b4cd
9838cbc
f078320
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -259,6 +259,7 @@ def check_available_online( | |
| "Zamba2ForCausalLM": _HfExamplesInfo("Zyphra/Zamba2-7B-instruct"), | ||
| "MiMoForCausalLM": _HfExamplesInfo("XiaomiMiMo/MiMo-7B-RL", | ||
| trust_remote_code=True), | ||
| "HunYuanMoEV1ForCausalLM": _HfExamplesInfo("tencent/Hunyuan-A13B-Instruct"), | ||
|
||
| # [Encoder-decoder] | ||
| "BartModel": _HfExamplesInfo("facebook/bart-base"), | ||
| "BartForConditionalGeneration": _HfExamplesInfo("facebook/bart-large-cnn"), | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we haven't supported cross attention in v1 yet, does this model work with v1?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because it is
self-attn, it currently supports v1 and has been verifiedThere was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you also update the dense model in document? And seems that PP should also support too?