Eval hackathon #752

fyvo · 2022-04-27T15:13:46Z

Added prompts for machine translation from WMT 14 en-fr (both directions, so these are two separate tasks, the prompt name tells the direction). Templates are close to ones used in GPT-3 and T5 papers. Most prompts come in two flavours: the source language is either explicit or must be inferred. One prompt is also translated in French. WMT14 datasets are in Huggingface Datasets but are unavailable from the promptsource interface. The filter_english_datasets method in utils.py had to be locally modified to include these (huge) datasets.

thinkzink · 2022-04-28T13:38:41Z

I would have changed "If the original version says" to "If the English version says", and I likewise would be hesitant about alternating between using "passage" vs. "sentence" in the prompt, especially considering e.g. the use of questions in the dataset. But maybe that's too pedantic?

jzf2101 · 2022-06-27T04:15:16Z

@fyvo there are build errors. Could you merge with main and repush?

VictorSanh · 2022-06-27T21:55:31Z

@fyvo could you run the test show_new_templates locally since you have already downloaded the datasets?

Merging back the main eval-hackathon branch into this PR won't solve the tests, since it's a GH action limitation (disk space limitation), even though you should do it cause a few things have been updated!

If they pass locally, then let's merge this!

fyvo · 2022-06-28T21:04:50Z

@VictorSanh I just ran the test for wmt14/fr-en, the result is in your mailbox

…s from X into Y

VictorSanh

Thank you @fyvo !
I fixed a small bug in the prompt names
I ran all the tests locally, and they all pass except for wmt17/zh-en, wmt18/zh-en, and wmt19/zh-en, which is not a problem in the prompts but in HF datasets. The prompts lgtm though so still merging

see huggingface/datasets#4575

fyvo added 3 commits April 27, 2022 14:55

test prompts for newsco

4d4e3d4

sample prompts for zero-shot mt

70823d0

one more pattern for wmt14/en-fr

4c5d800

awebson assigned thinkzink Apr 27, 2022

fyvo and others added 2 commits April 28, 2022 13:23

Fix an inverted fr-en prompt for wmt14

e642b56

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

5a9c93f

Prompts for WMT14 de-en, adapted directly from fr-en

7272271

thinkzink approved these changes May 4, 2022

View reviewed changes

fyvo and others added 16 commits May 7, 2022 11:29

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

2e0f4fa

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

27d6a30

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

a47afc1

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

28cb0e8

fixed duplicate uuids, add a new set of prompts

f086b81

restore the original template

fc14336

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

815c09b

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

bbe3981

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

7ebc5ba

remove prompts in the French language to simplify copies

541cc2c

remove faulty german prompts

ba82480

Add prompts for WMT14 hi-En

b6a4fa9

Prompts for wmt14 cs-en

62bcefe

prompts for wmt14-wmt19 all news tasks

dd1f445

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

9711123

added 4 new glm like prompts for wmt* tasks

fb9d6ef

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

14a8ff4

Merge branch 'bigscience-workshop:eval-hackathon' into eval-hackathon

5e3b223

fyvo and others added 2 commits June 29, 2022 15:29

Small change to the translate-* family of prompts - now Translate thi…

443a755

…s from X into Y

fix names

2608a38

VictorSanh approved these changes Jun 29, 2022

View reviewed changes

VictorSanh merged commit 29afd13 into bigscience-workshop:eval-hackathon Jun 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Eval hackathon #752

Eval hackathon #752

Uh oh!

fyvo commented Apr 27, 2022

Uh oh!

thinkzink commented Apr 28, 2022

Uh oh!

jzf2101 commented Jun 27, 2022

Uh oh!

VictorSanh commented Jun 27, 2022 •

edited

Loading

Uh oh!

fyvo commented Jun 28, 2022

Uh oh!

VictorSanh left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Eval hackathon #752

Eval hackathon #752

Uh oh!

Conversation

fyvo commented Apr 27, 2022

Uh oh!

thinkzink commented Apr 28, 2022

Uh oh!

jzf2101 commented Jun 27, 2022

Uh oh!

VictorSanh commented Jun 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fyvo commented Jun 28, 2022

Uh oh!

VictorSanh left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

VictorSanh commented Jun 27, 2022 •

edited

Loading