ADD GPT-4 as Judge #206

philschmid · 2024-06-20T13:33:02Z

No description provided.

src/lighteval/metrics/metrics_sample.py

clefourrier

LGTM, ideally you'd also want to add it in metrics

philschmid · 2024-07-03T15:31:57Z

ideally you'd also want to add it in metrics

Where?

clefourrier · 2024-07-03T15:35:16Z

src/lighteval/metrics/metrics.py, like these 2 atm


    llm_judge_multi_turn_openai = SampleLevelMetricGrouping(
        metric=["single_turn", "multi_turn"],
        higher_is_better=True,
        category=MetricCategory.LLM_AS_JUDGE_MULTI_TURN,
        use_case=MetricUseCase.SUMMARIZATION,
        sample_level_fn=JudgeLLM(
            judge_model_name="gpt-3.5-turbo",
            template_path=os.path.join(os.path.dirname(__file__), "judge_prompts.jsonl"),
            multi_turn=True,
        ).compute,
        corpus_level_fn={
            "single_turn": np.mean,
            "multi_turn": np.mean,
        },
    )
    llm_judge_openai = SampleLevelMetricGrouping(
        metric=["judge_score"],
        higher_is_better=True,
        category=MetricCategory.LLM_AS_JUDGE,
        use_case=MetricUseCase.SUMMARIZATION,
        sample_level_fn=JudgeLLM(
            judge_model_name="gpt-3.5-turbo",
            template_path=os.path.join(os.path.dirname(__file__), "", "judge_prompts.jsonl"),
            multi_turn=False,
        ).compute,
        corpus_level_fn={
            "judge_score": np.mean,
        },
    )

(name might be slightly different)

clefourrier · 2024-07-03T15:35:45Z

Basically it allows the judges you defined to be in the general metrics available to users

* ADD GPT-4 as Judge * Fix style --------- Co-authored-by: Clémentine Fourrier <[email protected]>

ADD GPT-4 as Judge

7587843

philschmid requested a review from NathanHB June 20, 2024 13:33

clefourrier reviewed Jul 3, 2024

View reviewed changes

src/lighteval/metrics/metrics_sample.py Outdated Show resolved Hide resolved

Fix style

795f212

clefourrier approved these changes Jul 3, 2024

View reviewed changes

clefourrier added 2 commits July 3, 2024 17:42

Merge branch 'main' into add-gpt-4-judge

934cadd

Merge branch 'main' into add-gpt-4-judge

8cf0a64

clefourrier merged commit 0bceaee into main Jul 4, 2024

hynky1999 pushed a commit to hynky1999/lighteval that referenced this pull request Jul 12, 2024

ADD GPT-4 as Judge (huggingface#206)

7a4bf2a

* ADD GPT-4 as Judge * Fix style --------- Co-authored-by: Clémentine Fourrier <[email protected]>

hynky1999 pushed a commit that referenced this pull request May 22, 2025

ADD GPT-4 as Judge (#206)

d5d5d54

* ADD GPT-4 as Judge * Fix style --------- Co-authored-by: Clémentine Fourrier <[email protected]>

NathanHB pushed a commit that referenced this pull request Sep 19, 2025

ADD GPT-4 as Judge (#206)

1f1a5df

* ADD GPT-4 as Judge * Fix style --------- Co-authored-by: Clémentine Fourrier <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ADD GPT-4 as Judge #206

ADD GPT-4 as Judge #206

Uh oh!

philschmid commented Jun 20, 2024

Uh oh!

Uh oh!

clefourrier left a comment

Uh oh!

philschmid commented Jul 3, 2024

Uh oh!

clefourrier commented Jul 3, 2024

Uh oh!

clefourrier commented Jul 3, 2024

Uh oh!

Uh oh!

ADD GPT-4 as Judge #206

ADD GPT-4 as Judge #206

Uh oh!

Conversation

philschmid commented Jun 20, 2024

Uh oh!

Uh oh!

clefourrier left a comment

Choose a reason for hiding this comment

Uh oh!

philschmid commented Jul 3, 2024

Uh oh!

clefourrier commented Jul 3, 2024

Uh oh!

clefourrier commented Jul 3, 2024

Uh oh!

Uh oh!