-
-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LTeX doesn't recognize macron #269
Comments
Thanks for the report. LTEX needs to manually map each letter to the correct Unicode code point. As Do you just need support for |
I am typesetting a text with romanization of Japanese pronunciation. Acute, grave and circumflex are used in that romanization to indicate pitch variation and can happen above any letter, even consonant. For example: What would be ideal is if I could add Macron is used only above n̄ and ḡ to indicate nasal pronunciation. But n̄ with macron can also have acute above macron: In this case it would be great to have ben̄kyoo in the dictionary. |
Ah, okay. Then we'll have to use Unicode's combining diacritical marks as there is a Unicode character I think we can't just ignore the diacritics, otherwise we'd have many false negatives for all the languages which use them (languages in which omitting them would be a spelling error). So in your last example, it would add |
That doesn't make any difference, ḡ is still ignored:
It's absolutely fine. If it just doesn't ignore letters with diacritics at all (like ḡ above), it's all good. Thank you! |
Edit: I noticed that Unicode places the combining diacritical marks after the letter, not before like in this comment. The point is still valid, though. That doesn't make any sense, there's no logic in LTEX that would remove this, in contrast to Hex dump of the file:
|
Sorry, mea culpa. You are right. My LTeX is set to check on save, and I forgot to save when I experimented. Thanks for the workaround! |
See also valentjn/ltex-ls#56 (comment). Having \H{o} and \H{u} would be useful in Hungarian. I made a PR valentjn/ltex-ls#57 |
I replaced the whole accent table with Java's Unicode normalization algorithm. This means that combined characters will be used if they exist, otherwise combining diacritical marks will be used (Normalization Form C in Unicode Standard Annex #15). Combining multiple accents on a single letter (e.g., |
Fix released in 10.0.0. |
Thanks! |
Usually, if I have a word unfamiliar to LTeX, it underlines it and in the tooltip I can choose to add the word to dictionary. If the word has letters with grave or accute accent, or with circumflex, LTeX properly adds all diacritics to the dictionary. However, it skips letters with macrons completely.
Takea look at the following MWE:
It's a single word "Nihóŋḡo". Here is how the tooltip looks like:
You can see it is "Nihóŋo", instead of "Nihóŋḡo". "Nihóŋo" as well will be added to dictionary.
The text was updated successfully, but these errors were encountered: