Skip to content

Adding Japanese translation to Mondo #7928

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 19 commits into from
Mar 28, 2025
Merged

Adding Japanese translation to Mondo #7928

merged 19 commits into from
Mar 28, 2025

Conversation

matentzn
Copy link
Member

@matentzn matentzn commented Jul 6, 2024

Some initial experiments adding an international version of Mondo with translations, but mostly for later this year, not now.

@matentzn matentzn self-assigned this Jul 29, 2024
@matentzn matentzn changed the title WIP: Adding Japanese translation to Mondo Adding Japanese translation to Mondo Mar 25, 2025
@matentzn matentzn marked this pull request as ready for review March 25, 2025 08:19
@matentzn matentzn requested a review from twhetzel as a code owner March 25, 2025 08:19
@twhetzel
Copy link
Collaborator

@matentzn is this assigned to me or was my addition only through CODEOWNERS? Also, @sabrinatoro is not assigned to this even though you tagged her about this PR in Slack.

@matentzn
Copy link
Member Author

Yes correct, only you can review this (changes are all highly technical), but I am not sure if it's worth your time until we get a decision we want to go ahead with this..

@twhetzel
Copy link
Collaborator

Ok, let me know when this should be reviewed.

@twhetzel
Copy link
Collaborator

We decided in Slack this should be reviewed and added for the upcoming April release.

Copy link
Collaborator

@twhetzel twhetzel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I ran sh run.sh make translations -B to test this, but ran into this error:

make[1]: *** [Makefile:735: mirror-ncbigene] Error 1
rm imports/foodon_terms.txt imports/omo_terms.txt imports/ncbigene_terms.txt imports/envo_terms.txt imports/ncit_terms.txt
make[1]: Leaving directory '/work/src/ontology'
make: *** [mondo.Makefile:1322: translations] Error 2

@matentzn
Copy link
Member Author

This goal should be deleted, I only used it during testing. Make mondo-international.owl (with IMP=false etc)

@twhetzel
Copy link
Collaborator

twhetzel commented Mar 26, 2025

Ok, I will delete the goal translations and test with make mondo-international.owl. I was wondering why each of those existed.

@twhetzel twhetzel assigned twhetzel and unassigned matentzn Mar 26, 2025
@twhetzel twhetzel self-requested a review March 26, 2025 21:48
Copy link
Collaborator

@twhetzel twhetzel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I re-ran as sh run.sh make mondo-international.owl IMP=false MIR=false PAT=false -B

Error:

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xff in position 0: invalid start byte
make: *** [mondo.Makefile:1334: ../translations/mondo-jp-preprocessed.babelon.tsv] Error 1

@twhetzel twhetzel assigned matentzn and unassigned twhetzel Mar 27, 2025
@twhetzel
Copy link
Collaborator

@matentzn there is an error running the make goal.

@twhetzel
Copy link
Collaborator

@matentzn following your email, I re-ran the goal as: sh run.sh make mondo-international.owl IMP=false MIR=false and that did work.

However, when I run the goal as: sh run.sh make mondo-international.owl IMP=false MIR=false -B that does not work with the same error I posted earlier.

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xff in position 0: invalid start byte
make: *** [mondo.Makefile:1330: ../translations/mondo-jp-preprocessed.babelon.tsv] Error 1

The full logs are attached.

Are you able to get the goal to run when using -B?
LOGS_japanese-translation.txt

@twhetzel sorry about this you were right: the babelon file provided by DBCLS is UTF16 LE encoded which our tools cant handle. I asked them to change this to UTF8, but in the meantime I hack the encoding for the pipeline to work
@twhetzel twhetzel self-requested a review March 28, 2025 17:38
Copy link
Collaborator

@twhetzel twhetzel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

With updates to the file from DBCLS (dbcls/mondo-japanese#2) I was able to now run the goal successfully as:
sh run.sh make mondo-international.owl IMP=false MIR=false -B

@twhetzel twhetzel assigned twhetzel and unassigned matentzn Mar 28, 2025
@twhetzel twhetzel merged commit a4c9dba into master Mar 28, 2025
1 check passed
@twhetzel twhetzel deleted the japanese-translation branch March 28, 2025 17:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants