Unicode escape sequences are not preserved #196

linuxdaemon · 2024-05-06T16:02:09Z

This seems related to #55 and #104 but those are both closed as completed and this issue is still present on v1.0.1.

Example

flynt -tj -s "'\u2122'.join(('a', 'b'))"

Results in:
"a™b"
Instead of:
"a\u2122b"

This seems to also occur with octal values:

flynt -tj -s "'\40'.join(('a', 'b'))"

returns:
"a b"

The text was updated successfully, but these errors were encountered:

ikamensh · 2024-08-21T15:07:57Z

this is a known and unfortunate limitation. Once python parses your code, which I need to do to get to abstract syntax tree, its no longer possible to determine if a character was an escape sequence or a special character. Now, it might be possible to read file as bytes, and find location of each expression in the file, and therefore see if its a unicode character (I think), so there could be two fixes:

Just detect the usage of escape sequences and raise ConversionRefused, to make flynt skip this expression (easier)
Actually preserve unicode sequences where present.

ikamensh added bug Something isn't working enhancement New feature or request labels Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unicode escape sequences are not preserved #196

Unicode escape sequences are not preserved #196

linuxdaemon commented May 6, 2024

ikamensh commented Aug 21, 2024

Unicode escape sequences are not preserved #196

Unicode escape sequences are not preserved #196

Comments

linuxdaemon commented May 6, 2024

Example

ikamensh commented Aug 21, 2024