Skip to content

[EVAL] Aligning with oz-eval benchmark #2

@djmilena

Description

@djmilena

Aligning with oz-eval benchmark

oz-eval is evaluated through the pattern:

Pitanje: {question}

Ponuđeni odgovori:
A. {option_a}
B. {option_b}
C. {option_c}
D. {option_d}
E. {option_e}

Krajnji odgovor:

Please note that in your serbian_eval.py there is

 instruction = "Na osnovu sledećeg pitanja, izaberite tačanu opciju iz ponuđenih odgovora.\n"

which we are not using and should be avoided when evaluating this dataset.
In addition, this enumeration is not in line as well:

   for i, choice in enumerate(choices):
        query += f"{i}. {choice}\n"

since in our case array of letters A-E as shown in the example above.

Evaluation metadata

Hope this helps make the changes! We are available for any open question and further info 🤗

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

✅ Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions