You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is not a bug. URIs, and particularly IRIs, use Unicode normalization form KC to eliminate visual ambiguities which may result in phishing attacks. NFKC splits that codepoint up to the characters that Addressable is giving you. If this behavior is undesirable for your use-case, you can normalize instead on a component-by-component basis.
Specifically, this produces an incorrect result:
Note that the normalized URL no longer matches.
I think this is related to Addressable::IDNA.unicode_normalize_kc
Specifiaclly:
The output is now two UTF-8 characters, when previously it was one.
The text was updated successfully, but these errors were encountered: