adding fused moe kernel config for A100 TP2 by sahilsuneja1 · Pull Request #3240 · vllm-project/vllm

sahilsuneja1 · 2024-03-06T19:43:58Z

Using benchmark_mixtral_moe.py from #2979 to tune a fused moe kernel config for TP2 A100

Latency measurements using: python benchmarks/benchmark_latency.py --model=mistralai/Mixtral-8x7B-Instruct-v0.1 --input-len 1000 --output-len 50 -tp 2 --num-iters 100 --batch-size <bs>:

This PR:

BS: 1, Avg latency: 0.7621450612053741 seconds
BS: 2, Avg latency: 1.0328856136795366 seconds
BS: 4, Avg latency: 1.4489717756788014 seconds
BS: 8, Avg latency: 2.0341408042260447 seconds
BS: 16, Avg latency: 2.893355064672651 seconds
BS: 32, Avg latency: 4.530912061399431 seconds
BS: 64, Avg latency: 7.537396909691161 seconds

Compared to master:

BS: 1, Avg latency: 0.8453641083685216 seconds
BS: 2, Avg latency: 1.1280082573764958 seconds
BS: 4, Avg latency: 1.6140852882619947 seconds
BS: 8, Avg latency: 2.348028304380132 seconds
BS: 16, Avg latency: 3.5489811494306194 seconds
BS: 32, Avg latency: 5.627054951939499 seconds
BS: 64, Avg latency: 9.691197272467543 seconds

@njhill @pcmoritz

pcmoritz

Nice, thanks for adding this!

simon-mo · 2024-10-22T22:39:00Z

Closing as stale. Looks like there's already a JSON in place.

njhill · 2024-10-23T00:02:25Z

@simon-mo actually the JSON there is for TP4 ... I didn't realize that this never got merged 😅 .. I'll re-open and maybe we can add it...

njhill · 2024-10-23T00:04:36Z

Oh my bad I was looking at wrong fork 🤦

adding fused moe kernel config for A100 TP2

0bab2e1

Yard1 requested a review from pcmoritz March 7, 2024 18:03

pcmoritz approved these changes Mar 7, 2024

View reviewed changes

simon-mo closed this Oct 22, 2024

njhill reopened this Oct 23, 2024

njhill closed this Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adding fused moe kernel config for A100 TP2#3240

adding fused moe kernel config for A100 TP2#3240
sahilsuneja1 wants to merge 1 commit intovllm-project:mainfrom
sahilsuneja1:moe_tuning

sahilsuneja1 commented Mar 6, 2024

Uh oh!

pcmoritz left a comment

Uh oh!

simon-mo commented Oct 22, 2024

Uh oh!

njhill commented Oct 23, 2024

Uh oh!

njhill commented Oct 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

sahilsuneja1 commented Mar 6, 2024

Uh oh!

pcmoritz left a comment

Choose a reason for hiding this comment

Uh oh!

simon-mo commented Oct 22, 2024

Uh oh!

njhill commented Oct 23, 2024

Uh oh!

njhill commented Oct 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants