Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Switch bestbleu to chrF #908

Merged
merged 15 commits into from
Nov 4, 2024
Merged

Switch bestbleu to chrF #908

merged 15 commits into from
Nov 4, 2024

Conversation

eu9ene
Copy link
Collaborator

@eu9ene eu9ene commented Oct 30, 2024

chrF is now considered more reliable than BLEU, and should work better for CJK
Based on advice from #748

+unify sacrebleu and mtdata versions everywhere
+update config generator and skip some datasets there (it's not final though)

closes #748
closes #911

@eu9ene eu9ene requested review from a team as code owners October 30, 2024 00:53
@eu9ene eu9ene requested a review from jcristau October 30, 2024 00:53
@eu9ene eu9ene requested review from gregtatum and removed request for a team and jcristau October 30, 2024 23:24
Copy link
Member

@gregtatum gregtatum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm marking as approved, but I'm a bit concerned on knowing if this code is correct without tests. I looked at the extract-best task, and there is no logging and at the end there is an outfile artifact which content. Clearly it's writing out something, but it's hard to know what's going on here. I'm fine if you want to go ahead and merge, since it's existing code, but we are using new code paths that haven't been proven yet.

https://github.com/mozilla/firefox-translations-training/runs/32308849855

utils/config_generator.py Outdated Show resolved Hide resolved
taskcluster/kinds/extract-best/kind.yml Outdated Show resolved Hide resolved
taskcluster/kinds/extract-best/kind.yml Outdated Show resolved Hide resolved
pipeline/translate/bestbleu.py Show resolved Hide resolved
@eu9ene eu9ene merged commit 8170966 into main Nov 4, 2024
37 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Do not use WMTNews as training! Check decoding for CJK
3 participants