questions in vicuna_bench with reference_answer cannot run #2508

toslunar · 2023-10-03T03:09:26Z

I'd like to use vicuna_bench. After python gen_api_answer.py --bench-name vicuna_bench --model gpt-3.5-turbo succeeds, python gen_judgment.py --bench-name vicuna_bench --model-list gpt-3.5-turbo failed with

88%|█████████████████████████████████████████████████████████████████████████████████████████▎            | 70/80 [17:27<02:29, 14.97s/it]
Traceback (most recent call last):
  File ".../FastChat/fastchat/llm_judge/gen_judgment.py", line 309, in <module>
    play_a_match_func(match, output_file=output_file)
  File ".../FastChat/fastchat/llm_judge/common.py", line 203, in play_a_match_single
    score, user_prompt, judgment = run_judge_single(
  File ".../FastChat/fastchat/llm_judge/common.py", line 141, in run_judge_single
    kwargs["ref_answer_2"] = ref_answer["choices"][0]["turns"][1]
IndexError: list index out of range

The text was updated successfully, but these errors were encountered:

toslunar mentioned this issue Oct 3, 2023

Fix for single turn dataset #2509

Merged

3 tasks

merrymercy closed this as completed in #2509 Oct 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions in vicuna_bench with reference_answer cannot run #2508

questions in vicuna_bench with reference_answer cannot run #2508

toslunar commented Oct 3, 2023

questions in vicuna_bench with reference_answer cannot run #2508

questions in vicuna_bench with reference_answer cannot run #2508

Comments

toslunar commented Oct 3, 2023