-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
is_valid_char does not correctly follow the Unicode standard #11171
Comments
I agree, I understand that this means that functions should handle noncharacters gracefully and not bail out when encountering one. |
Here is my proposed replacement, I'll submit a PR very shortly... |
What is really meant here is that See Unicode 7.0.0, p119 (pdf):
The documentation of
perhaps even including a reference to definition in the Unicode standard. |
The relevant function |
Ref #11033 |
|
@jiahao Good point about specifying Unicode scale values, and that would be good to fix |
@ScottPJones Now, contrary to what what asked in other PRs, it might be better to fix the problem in utf8proc if that's indeed a bug there. :-) As long as we depend on utf8proc at all, better make it work right. |
|
@nalimilan I didn't say that I wouldn't get around to fixing it in |
Thanks, @ScottPJones! Very much appreciated. |
@StefanKarpinski I'm positively 😊ing from the kind words today! 😉 I do owe all of you a beer (or cider) or two (or three) at the Muddy Charles during JuliaCon, for putting up with me being such a long-winded PITA! |
No worries, @ScottPJones. Glad you've persevered. |
This is per JuliaLang#11171
Add reference to issue JuliaLang#11171
This is per JuliaLang#11171
Add reference to issue JuliaLang#11171
This is per JuliaLang#11171
Add reference to issue JuliaLang#11171
is_valid_char returns false for values which are valid Unicode codepoints.
This is due to a misunderstanding of the way the 66 Unicode "non character" codepoints are supposed to be handled. See: "FAQ - Private-Use Characters, Noncharacters, and Sentinels"
Here are the relevant sections:
[@jiahao - edited formatting of hyperlink]
The text was updated successfully, but these errors were encountered: