Commit 337f67a
Merged in jurassic-2.5 (pull request vllm-project#1)
BA-78554: Jurassic 2.5
* worked on jurasic2.5 configuration file, updated jurassic2_5 modeling file to support alternating experts/attn layers
* finished working the forward pass of jurassic3.py
* finished working the forward pass of jurassic3.py
* finished working the forward pass of jurassic3.py
* jurassic_3 modeling file works, uses dummy weights initialized by "dummy" flag. Tokenizer raises issues, for now copying the mixtral tokenizer
* changed default tokenizer vocab values, loading of custom .pt weight files works.
* removed notebook
* merging master to jurassic-2.5 to reset head
* Merge branch 'master' into jurassic-2.5
* align to master
Approved-by: Tomer Asida
Approved-by: Mor Zusman1 parent 0543476 commit 337f67a
File tree
5 files changed
+614
-0
lines changed- vllm
- model_executor/models
- transformers_utils
- configs
5 files changed
+614
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
54 | 54 | | |
55 | 55 | | |
56 | 56 | | |
| 57 | + | |
57 | 58 | | |
58 | 59 | | |
59 | 60 | | |
| |||
0 commit comments