Save request outputs and add eval accuracy support #8

FanhaiLu1 · 2024-03-07T21:53:39Z

This PR add two features:

Save request outputs to a local file path with Json format, it is additional function in benchmark_serving, please use --save-request-outputs to enable it.
Measure accuracy of inference generated text accuracy.

Below are command line examples to use above features:

python JetStream/benchmarks/benchmark_serving.py \
--tokenizer /home/{username}/maxtext/assets/tokenizer \
--num-prompts 10  \
--dataset ~/data/ShareGPT_V3_unfiltered_cleaned_split.json \
--save-request-outputs

python JetStream/benchmarks/eval_accuracy.py

JoeZijunZhou

LGTM! Do you mind adding a README under /benchmarks to introduce the eval script, thanks!

requirements.in

benchmarks/README.md

Save request outputs and add eval accuracy support

798038c

JoeZijunZhou reviewed Mar 7, 2024

View reviewed changes

requirements.in Outdated Show resolved Hide resolved

FanhaiLu1 added 4 commits March 8, 2024 01:18

add readme and requirements

1fc3669

add space in readme

e7f121e

add username

9c7371c

replace Benchmarks with Benchmark

f205a12

JoeZijunZhou approved these changes Mar 8, 2024

View reviewed changes

benchmarks/README.md Outdated Show resolved Hide resolved

benchmarks/README.md Outdated Show resolved Hide resolved

fix the path

8fa60a9

JoeZijunZhou merged commit 41ad033 into AI-Hypercomputer:main Mar 8, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save request outputs and add eval accuracy support #8

Save request outputs and add eval accuracy support #8

FanhaiLu1 commented Mar 7, 2024

JoeZijunZhou left a comment •

edited

Loading

Save request outputs and add eval accuracy support #8

Save request outputs and add eval accuracy support #8

Conversation

FanhaiLu1 commented Mar 7, 2024

JoeZijunZhou left a comment • edited Loading

Choose a reason for hiding this comment

JoeZijunZhou left a comment •

edited

Loading