Commit b130f85
Character ai (#587)
* integrate aiter
Signed-off-by: fsx950223 <[email protected]>
* add env variable
Signed-off-by: fsx950223 <[email protected]>
* rename function
Signed-off-by: fsx950223 <[email protected]>
* optimize kernels with small query lens
Signed-off-by: fsx950223 <[email protected]>
* change condition
Signed-off-by: fsx950223 <[email protected]>
* add rocm aiter backend
Signed-off-by: fsx950223 <[email protected]>
* new fa impl
Signed-off-by: fsx950223 <[email protected]>
* update api
Signed-off-by: fsx950223 <[email protected]>
* optimize performance
Signed-off-by: fsx950223 <[email protected]>
* remove try catch
Signed-off-by: fsx950223 <[email protected]>
* clean code
Signed-off-by: fsx950223 <[email protected]>
* remove type cast
Signed-off-by: fsx950223 <[email protected]>
* use on_gfx9 instead of on_mi250_mi300
Signed-off-by: charlifu <[email protected]>
* add fp8 support
Signed-off-by: fsx950223 <[email protected]>
* revert layernorm
Signed-off-by: fsx950223 <[email protected]>
* enable aiter pa
Signed-off-by: fsx950223 <[email protected]>
* fix bug
Signed-off-by: fsx950223 <[email protected]>
* fix bug
Signed-off-by: fsx950223 <[email protected]>
* fix upstream issue
Signed-off-by: fsx950223 <[email protected]>
* change condition
Signed-off-by: fsx950223 <[email protected]>
* support head size 256
Signed-off-by: fsx950223 <[email protected]>
* enable fp8 aiter pa in vllm v1
Signed-off-by: fsx950223 <[email protected]>
* fix workspace buffer
Signed-off-by: fsx950223 <[email protected]>
* fix fa crash issue
Signed-off-by: fsx950223 <[email protected]>
* add namespace aiter
Signed-off-by: fsx950223 <[email protected]>
---------
Signed-off-by: fsx950223 <[email protected]>
Signed-off-by: charlifu <[email protected]>
Co-authored-by: charlifu <[email protected]>
Signed-off-by: fsx950223 <[email protected]>1 parent e28533a commit b130f85
File tree
4 files changed
+64
-50
lines changed- vllm
- attention
- backends
- ops
- platforms
- v1/attention/backends
4 files changed
+64
-50
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
913 | 913 | | |
914 | 914 | | |
915 | 915 | | |
916 | | - | |
917 | | - | |
| 916 | + | |
918 | 917 | | |
919 | 918 | | |
920 | 919 | | |
| |||
930 | 929 | | |
931 | 930 | | |
932 | 931 | | |
933 | | - | |
934 | 932 | | |
935 | 933 | | |
936 | 934 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
12 | | - | |
13 | 12 | | |
14 | 13 | | |
15 | 14 | | |
| |||
305 | 304 | | |
306 | 305 | | |
307 | 306 | | |
308 | | - | |
| 307 | + | |
309 | 308 | | |
310 | 309 | | |
311 | 310 | | |
| |||
316 | 315 | | |
317 | 316 | | |
318 | 317 | | |
319 | | - | |
320 | | - | |
| 318 | + | |
321 | 319 | | |
322 | | - | |
| 320 | + | |
323 | 321 | | |
324 | 322 | | |
325 | 323 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
138 | 138 | | |
139 | 139 | | |
140 | 140 | | |
141 | | - | |
| 141 | + | |
142 | 142 | | |
143 | | - | |
| 143 | + | |
144 | 144 | | |
145 | 145 | | |
146 | 146 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | | - | |
9 | 8 | | |
10 | | - | |
11 | | - | |
| 9 | + | |
12 | 10 | | |
13 | 11 | | |
14 | 12 | | |
| |||
17 | 15 | | |
18 | 16 | | |
19 | 17 | | |
| 18 | + | |
| 19 | + | |
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
| |||
38 | 38 | | |
39 | 39 | | |
40 | 40 | | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
41 | 44 | | |
42 | 45 | | |
43 | 46 | | |
| |||
59 | 62 | | |
60 | 63 | | |
61 | 64 | | |
62 | | - | |
| 65 | + | |
63 | 66 | | |
64 | 67 | | |
65 | 68 | | |
66 | 69 | | |
67 | 70 | | |
68 | 71 | | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
69 | 78 | | |
70 | 79 | | |
71 | 80 | | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
72 | 86 | | |
73 | 87 | | |
74 | 88 | | |
| |||
78 | 92 | | |
79 | 93 | | |
80 | 94 | | |
81 | | - | |
| 95 | + | |
| 96 | + | |
82 | 97 | | |
83 | 98 | | |
84 | 99 | | |
85 | | - | |
86 | 100 | | |
87 | | - | |
| 101 | + | |
88 | 102 | | |
89 | 103 | | |
90 | | - | |
| 104 | + | |
91 | 105 | | |
92 | 106 | | |
93 | 107 | | |
94 | 108 | | |
95 | 109 | | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
96 | 117 | | |
97 | 118 | | |
98 | 119 | | |
| |||
101 | 122 | | |
102 | 123 | | |
103 | 124 | | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
104 | 128 | | |
105 | 129 | | |
106 | 130 | | |
| |||
120 | 144 | | |
121 | 145 | | |
122 | 146 | | |
| 147 | + | |
| 148 | + | |
123 | 149 | | |
124 | 150 | | |
125 | | - | |
| 151 | + | |
| 152 | + | |
126 | 153 | | |
127 | 154 | | |
128 | 155 | | |
| |||
154 | 181 | | |
155 | 182 | | |
156 | 183 | | |
| 184 | + | |
| 185 | + | |
157 | 186 | | |
158 | 187 | | |
159 | 188 | | |
| |||
184 | 213 | | |
185 | 214 | | |
186 | 215 | | |
187 | | - | |
188 | 216 | | |
189 | 217 | | |
190 | 218 | | |
| |||
281 | 309 | | |
282 | 310 | | |
283 | 311 | | |
| 312 | + | |
| 313 | + | |
| 314 | + | |
| 315 | + | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
| 321 | + | |
| 322 | + | |
| 323 | + | |
284 | 324 | | |
285 | 325 | | |
286 | 326 | | |
| |||
292 | 332 | | |
293 | 333 | | |
294 | 334 | | |
| 335 | + | |
295 | 336 | | |
296 | 337 | | |
297 | 338 | | |
| |||
315 | 356 | | |
316 | 357 | | |
317 | 358 | | |
318 | | - | |
| 359 | + | |
319 | 360 | | |
320 | 361 | | |
321 | 362 | | |
| |||
364 | 405 | | |
365 | 406 | | |
366 | 407 | | |
| 408 | + | |
367 | 409 | | |
368 | 410 | | |
369 | 411 | | |
| |||
442 | 484 | | |
443 | 485 | | |
444 | 486 | | |
445 | | - | |
446 | | - | |
447 | | - | |
448 | | - | |
449 | 487 | | |
450 | 488 | | |
451 | 489 | | |
| |||
516 | 554 | | |
517 | 555 | | |
518 | 556 | | |
519 | | - | |
520 | | - | |
521 | | - | |
522 | | - | |
523 | | - | |
524 | | - | |
525 | 557 | | |
526 | 558 | | |
527 | 559 | | |
| |||
559 | 591 | | |
560 | 592 | | |
561 | 593 | | |
562 | | - | |
563 | | - | |
| 594 | + | |
| 595 | + | |
| 596 | + | |
564 | 597 | | |
565 | 598 | | |
566 | | - | |
567 | | - | |
568 | | - | |
569 | | - | |
570 | | - | |
571 | | - | |
572 | | - | |
573 | | - | |
574 | | - | |
575 | | - | |
576 | | - | |
577 | | - | |
578 | | - | |
579 | | - | |
580 | | - | |
581 | | - | |
| 599 | + | |
582 | 600 | | |
583 | | - | |
| 601 | + | |
584 | 602 | | |
585 | 603 | | |
586 | 604 | | |
| |||
0 commit comments