Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[meta] Make the pipeline reliable enough to train many languages #311

Open
23 of 34 tasks
gregtatum opened this issue Dec 16, 2023 · 3 comments
Open
23 of 34 tasks

[meta] Make the pipeline reliable enough to train many languages #311

gregtatum opened this issue Dec 16, 2023 · 3 comments
Assignees
Labels
meta A collection of sub-issues that uses a tasklist

Comments

@gregtatum
Copy link
Member

gregtatum commented Dec 16, 2023

This is the lists of tasks that we need to handle in order to ramp up our ability to train many languages. These are things that break training runs, make things difficult to use the pipeline, or make it difficult for multiple people to train at once.

Pipeline Usability

  1. taskcluster
  2. taskcluster
    bhearsum
  3. taskcluster
    bhearsum
  4. taskcluster
    gabrielBusta
  5. taskcluster tc-p1
    bhearsum
  6. blocker bug taskcluster tc-p1
    bhearsum
  7. taskcluster tc-p1
    bhearsum
  8. taskcluster
    bhearsum
  9. enhancement taskcluster
    gabrielBusta
  10. cost & perf taskcluster tc-p1
    bhearsum
  11. taskcluster tc-p1
    bhearsum
  12. taskcluster
  13. bug taskcluster
    bhearsum
  14. taskcluster
    bhearsum
  15. taskcluster
  16. taskcluster
  17. bug taskcluster
    bhearsum
  18. bug taskcluster
    bhearsum
  19. taskcluster
  20. taskcluster
    bhearsum
  21. taskcluster
  22. taskcluster
  23. cost & perf taskcluster
  24. bug taskcluster
  25. bug cost & perf taskcluster
@gregtatum gregtatum added the meta A collection of sub-issues that uses a tasklist label Dec 16, 2023
@gregtatum gregtatum changed the title [meta] Train many languages with a reliable pipeline [meta] Train 30 languages with a reliable pipeline Jan 16, 2024
@gregtatum gregtatum changed the title [meta] Train 30 languages with a reliable pipeline [meta] Make the pipeline reliable enough to train many languages Jan 16, 2024
@bhearsum
Copy link
Collaborator

Seeing as we're training many languages at once, should we call this done? #250 is the only remaining issue open in the list, and it is mostly fixed (aside from some UI jank in Taskcluster).

@eu9ene
Copy link
Collaborator

eu9ene commented Jun 25, 2024

I think we're still struggling with a bunch of issues, so I'd add them here and keep this one open until we can confidently train new languages without breakage.

@bhearsum
Copy link
Collaborator

Between this and #490 there are a fairly hefty number of issues related to Taskcluster with various dependencies between them. I created a simple diagram with dependencies and current status to hopefully help with this, which I will try to keep up to date.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
meta A collection of sub-issues that uses a tasklist
Projects
None yet
Development

No branches or pull requests

3 participants