[`pyupgrade`] Fix parsing named Unicode escape sequences (`UP032`) by phongddo · Pull Request #21901 · astral-sh/ruff

phongddo · 2025-12-10T18:15:35Z

Summary

Fixes #19771

Fixes incorrect parsing of Unicode named escape sequences like Hey \N{snowman} in FormatString, which were being incorrectly split into separate literal and field parts instead of being treated as a single literal unit.

Problem

The FormatString parser incorrectly handles Unicode named escape sequences:

Current: Hey \N{snowman} is parsed into 2 parts Literal("Hey \N") & Field("snowman")
Expected: Hey \N{snowman} should be parsed into 1 part Literal("Hey \N{snowman}")

This affects f-string conversion rules when fixing UP032 that rely on proper format string parsing.

Solution

I modified parse_literal to detect and handle Unicode named escape sequences before parsing single characters:

Introduced a flag to track when a backslash is "available" to escape something.
When the flag is true, and the text starts with N{, try to parse the complete Unicode escape sequence as one unit, and set the flag to false after parsing successfully.
Set the flag to false when the backslash is already consumed.

Manual Verification

"\N{angle}AOB = {angle}°".format(angle=180)

Result

 def foo():
-    "\N{angle}AOB = {angle}°".format(angle=180)
+    f"\N{angle}AOB = {180}°"

Would fix 1 error.

"\N{snowman} {snowman}".format(snowman=1)

Result

 def foo():
-    "\N{snowman} {snowman}".format(snowman=1)
+    f"\N{snowman} {1}"

Would fix 1 error.

"\\N{snowman} {snowman}".format(snowman=1)

Result

 def foo():
-    "\\N{snowman} {snowman}".format(snowman=1)
+    f"\\N{1} {1}"

Would fix 1 error.

Test Plan

Added test cases (happy case, invalid case, edge case) for FormatString when parsing Unicode escape sequence.
Updated snapshots.

astral-sh-bot · 2025-12-10T18:24:28Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

ntBre

Thanks! This makes sense to me, I just had a couple of small suggestions about the tests.

I took a closer look at this code today, and I'm feeling much less wary than in #19774 (review). Our parser will have already flagged weird invalid cases like these that I brought up last time:

"\N{{angle}}".format(angle="angle")
"\N{LATIN {SMALL} LETTER A}"

So we only need to handle valid cases, which this PR seems to do in a nice way. I also ran the fuzzer for a little while, just in case.

crates/ruff_python_literal/src/format.rs

...f_linter/src/rules/pyupgrade/snapshots/ruff_linter__rules__pyupgrade__tests__UP032_0.py.snap

astral-sh-bot · 2025-12-11T00:49:26Z

Diagnostic diff on typing conformance tests

No changes detected when running ty on typing conformance tests ✅

astral-sh-bot · 2025-12-11T00:54:24Z

`mypy_primer` results

Changes were detected when running on open source projects

scikit-build-core (https://github.com/scikit-build/scikit-build-core)
+ src/scikit_build_core/build/wheel.py:98:20: error[no-matching-overload] No overload of bound method `__init__` matches arguments
- Found 41 diagnostics
+ Found 42 diagnostics

Memory usage changes were detected when running on open source projects

sphinx (https://github.com/sphinx-doc/sphinx)
- WARN expected `heap_size` to be provided by Salsa query `class_based_items`
- WARN expected `heap_size` to be provided by Salsa query `class_based_items`
- WARN expected `heap_size` to be provided by Salsa query `class_based_items`
- WARN expected `heap_size` to be provided by Salsa query `class_based_items`

prefect (https://github.com/PrefectHQ/prefect)
+ WARN expected `heap_size` to be provided by Salsa query `class_based_items`
+ WARN expected `heap_size` to be provided by Salsa query `class_based_items`
+ WARN expected `heap_size` to be provided by Salsa query `class_based_items`
+ WARN expected `heap_size` to be provided by Salsa query `class_based_items`

phongddo · 2025-12-15T18:40:49Z

Hey @ntBre 👋 just in case you missed the updates on this PR. I've addressed your comments about the tests. Please take a look when you have a moment and let me know if there's anything else I should adjust 🙏

ntBre

Thank you!

This looks good to go, just one small test comment update, and I think I preferred your old tests (to avoid adding a dev-dependency to this crate). Sorry for the flip-flop and the delay.

...f_linter/src/rules/pyupgrade/snapshots/ruff_linter__rules__pyupgrade__tests__UP032_0.py.snap

crates/ruff_python_literal/src/format.rs

This reverts commit 10aeb6d.

This reverts commit 99a52eb.

ntBre

Thank you!

phongddo added 4 commits December 10, 2025 17:35

fix unicode escape parsing

ca9f3c7

update snapshot test

82ed9ba

fix when toggling pending escape flag

2cb77c6

fix clippy

7efede9

phongddo marked this pull request as ready for review December 10, 2025 18:41

phongddo mentioned this pull request Dec 10, 2025

UP032 fix misinterprets \N escape sequence as interpolation #19771

Closed

ntBre self-requested a review December 10, 2025 20:20

ntBre reviewed Dec 11, 2025

View reviewed changes

crates/ruff_python_literal/src/format.rs Show resolved Hide resolved

...f_linter/src/rules/pyupgrade/snapshots/ruff_linter__rules__pyupgrade__tests__UP032_0.py.snap Show resolved Hide resolved

ntBre added bug Something isn't working fixes Related to suggested fixes for violations labels Dec 11, 2025

update snapshot tests

99a52eb

fix clippy

10aeb6d

phongddo requested a review from ntBre December 11, 2025 00:52

phongddo changed the title ~~Fix Unicode named escape squence parsing in FormatString when fixing UP032~~ [pyupgrade] Fix Unicode named escape squence parsing in FormatString when fixing UP032 Dec 12, 2025

ntBre reviewed Dec 16, 2025

View reviewed changes

...f_linter/src/rules/pyupgrade/snapshots/ruff_linter__rules__pyupgrade__tests__UP032_0.py.snap Outdated Show resolved Hide resolved

crates/ruff_python_literal/src/format.rs Show resolved Hide resolved

phongddo added 3 commits December 16, 2025 19:08

Revert "fix clippy"

0cfca59

This reverts commit 10aeb6d.

Revert "update snapshot tests"

5305013

This reverts commit 99a52eb.

update snapshot test

bfbee7b

ntBre approved these changes Dec 16, 2025

View reviewed changes

ntBre changed the title ~~[pyupgrade] Fix Unicode named escape squence parsing in FormatString when fixing UP032~~ [pyupgrade] Fix parsing named Unicode escape sequences (UP032) Dec 16, 2025

ntBre merged commit b0bc990 into astral-sh:main Dec 16, 2025
37 checks passed

BrewTestBot mentioned this pull request Dec 18, 2025

ruff 0.14.10 Homebrew/homebrew-core#259273

Merged

This was referenced Dec 18, 2025

UP032 fix misinterprets \N in a raw string as an escape sequence #22060

Closed

[pyupgrade] Fix handling of \N in raw strings (UP032) #22149

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[`pyupgrade`] Fix parsing named Unicode escape sequences (`UP032`)#21901

[`pyupgrade`] Fix parsing named Unicode escape sequences (`UP032`)#21901
ntBre merged 9 commits intoastral-sh:mainfrom
phongddo:phongddo/fstring-escape-n

phongddo commented Dec 10, 2025 •

edited

Loading

Uh oh!

astral-sh-bot bot commented Dec 10, 2025 •

edited

Loading

Uh oh!

ntBre left a comment

Uh oh!

Uh oh!

Uh oh!

astral-sh-bot bot commented Dec 11, 2025 •

edited

Loading

Uh oh!

astral-sh-bot bot commented Dec 11, 2025

Uh oh!

phongddo commented Dec 15, 2025

Uh oh!

ntBre left a comment

Uh oh!

Uh oh!

Uh oh!

ntBre left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

phongddo commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Manual Verification

Test Plan

Uh oh!

astral-sh-bot bot commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ruff-ecosystem results

Linter (stable)

Linter (preview)

Uh oh!

ntBre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

astral-sh-bot bot commented Dec 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Diagnostic diff on typing conformance tests

Uh oh!

astral-sh-bot bot commented Dec 11, 2025

mypy_primer results

Uh oh!

phongddo commented Dec 15, 2025

Uh oh!

ntBre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ntBre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

phongddo commented Dec 10, 2025 •

edited

Loading

astral-sh-bot bot commented Dec 10, 2025 •

edited

Loading

`ruff-ecosystem` results

astral-sh-bot bot commented Dec 11, 2025 •

edited

Loading

`mypy_primer` results