Adds config tempaltes #295

hynky1999 · 2024-09-06T12:39:46Z

This is PR, which will bring easier way to task prompts, which are language-aware.

Following templates are create

multichoice
nli
copa
continuation

clefourrier

The templates folder mixes up a lot of items of different logic level, we'll have to disuss the structure IRL.
Do we really need "NLITaskConfig, CopaTaskConfig, BoolQATaskConfig", etc? I think it would be better to aim for a general structure instead of having lots of small ones

pyproject.toml

src/lighteval/metrics/metrics_sample.py

src/lighteval/utils/language.py

src/lighteval/metrics/normalizations.py

src/lighteval/tasks/templates/formulation.py

src/lighteval/tasks/templates/multichoice_config.py

src/lighteval/tasks/templates/tasks.py

src/lighteval/utils/tokenizers.py

src/lighteval/tasks/templates/formatting_utils.py

src/lighteval/tasks/templates/formulation.py

hynky1999 · 2024-09-23T23:23:38Z

The templates folder mixes up a lot of items of different logic level, we'll have to disuss the structure IRL. Do we really need "NLITaskConfig, CopaTaskConfig, BoolQATaskConfig", etc? I think it would be better to aim for a general structure instead of having lots of small ones

I did remove the BoolQATask, for the rest I think they have their place as they have some logic in them.
If you would be super strict only the continuation and multichoice would make the cut. The thing is I will need to for the multilang tasks, so I will create it one way or another. If you insist on greater seperation we can have general/task templates.

Secondly I decided that I won't create a question-answer template as one can just use multichoice with cf to achieve just that

src/lighteval/tasks/templates/continuation.py

clefourrier

Thanks for the PR!
Some comments:

avoid too many layers of nested function: in continuation, you nest 3 functions in one other function over about 100 lines of code, it's not super clear
in general, you also need to be careful to not change the naming conventions of the lib (for example, we use choices and you used options), it would be best if you could rename to fit our naming convention.
other comments are mostly nits/need for doc/comment suggestions/...

src/lighteval/tasks/templates/continuation.py

src/lighteval/tasks/templates/nli.py

src/lighteval/tasks/templates/utils/formatting_utils.py

src/lighteval/tasks/templates/continuation.py

src/lighteval/tasks/templates/copa.py

clefourrier · 2024-09-27T08:41:46Z

src/lighteval/tasks/templates/multichoice.py

+    instruction: NotRequired[str]
+
+
+# Python too dumb to do fancy inference :(


Remove or convert to actionable todo

clefourrier · 2024-09-27T08:44:42Z

src/lighteval/tasks/templates/nli.py

+    translation_literals = TRANSLATION_LITERALS[language]
+    adapter_fn = create_adapter_from_dict(adapter) if isinstance(adapter, dict) else adapter  # type: ignore
+
+    def nli_natural_prompt(line: dict, task_name: str):


add docstring explaining the diff between the 2 prompt functions

It should be clear from the format. The second function uses the natural function for CF formulation

clefourrier · 2024-09-27T08:44:54Z

src/lighteval/tasks/templates/nli.py

+        elif label == "contradiction":
+            return translation_literals.no
+        elif label == "neutral":
+            return translation_literals.also


else raise error?

That can't happen as long as you are respecting the typing, you don't that's on you

I would not underestimate the creativity of some users ^^ and add an error message - but program will fail later anyway so up to you

Well it's like calling load_datset(False), it's undefine behaviour, so it's user's problem.

src/lighteval/tasks/templates/continuation.py

src/lighteval/tasks/templates/copa.py

src/lighteval/tasks/templates/continuation.py

src/lighteval/tasks/templates/multichoice.py

clefourrier · 2024-09-27T10:05:51Z

src/lighteval/tasks/templates/continuation.py

+from lighteval.utils.utils import as_list
+
+
+CONTINUATION_QUERY_CF = "{instruction}{context}"


Would be good to explain a bit what the diff is between continuation and multichoice

I did so in the function docstring

Co-authored-by: Clémentine Fourrier <[email protected]>

…nto config_templates

clefourrier

LGTM, super nice, thanks for the changes

This reverts commit b014b50.

hynky1999 added 2 commits September 5, 2024 18:18

Merge branch 'geneartive_dynamic_metrics' into config_templates

2a5cdca

draft

2df9a08

hynky1999 marked this pull request as draft September 6, 2024 12:39

finish multichoice config

95729ee

clefourrier reviewed Sep 14, 2024

View reviewed changes

NathanHB reviewed Sep 16, 2024

View reviewed changes

src/lighteval/tasks/templates/formatting_utils.py Outdated Show resolved Hide resolved

NathanHB reviewed Sep 16, 2024

View reviewed changes

src/lighteval/tasks/templates/formulation.py Outdated Show resolved Hide resolved

finish implementation of templates + move stuff around

91d9d4f

hynky1999 changed the base branch from main to geneartive_dynamic_metrics September 23, 2024 18:05

hynky1999 added 5 commits September 24, 2024 00:35

Merge branch 'geneartive_dynamic_metrics' into config_templates

db36e16

nicers tests + fix them

44aeecf

nicer todo

2bff963

add nice doscrings 📃

3c9eb21

add even more docstring

4216ae2

hynky1999 marked this pull request as ready for review September 23, 2024 23:23

nit

d8f56b8

hynky1999 requested a review from clefourrier September 23, 2024 23:24

hynky1999 added 2 commits September 25, 2024 12:14

merge nli, add languagees to literals

7ca4239

translation literals

22eeddb

hynky1999 force-pushed the config_templates branch from 78f3518 to 22eeddb Compare September 25, 2024 11:05

Merge branch 'geneartive_dynamic_metrics' into config_templates

2d09256