Replace verbatim text with `NOT_YET_IMPLEMENTED` #4904

MichaReiser · 2023-06-06T18:06:38Z

Summary

This PR replaces the verbatim_text builder with a not_yet_implemented builder that emits NOT_YET_IMPLEMENTED_<NodeKind> for not yet implemented nodes.

The motivation for this change is that partially formatting compound statements can result in incorrectly indented code, which is a syntax error:

def func_no_args():
  a; b; c
  if True: raise RuntimeError
  if False: ...
  for i in range(10):
    print(i)
    continue

Get's reformatted to

def func_no_args():
    a; b; c
    if True: raise RuntimeError
    if False: ...
    for i in range(10):
    print(i)
    continue

because our formatter does not yet support for statements and just inserts the text from the source.

Downsides

Using an identifier will not work in all situations. For example, an identifier is invalid in an Arguments position. That's why I kept verbatim_text around and e.g. use it in the Arguments formatting logic where incorrect indentations are impossible (to my knowledge). Meaning, verbatim_text we can opt in to verbatim_text when we want to iterate quickly on nodes that we don't want to provide a full implementation yet and using an identifier would be invalid.

Upsides

Running this on main discovered stability issues with the newline handling that were previously "hidden" because of the verbatim formatting. I guess that's an upside :)

Test Plan

None?

MichaReiser · 2023-06-06T18:06:51Z

Current dependencies on/for this PR:

main
- PR Format binary expressions #4862
  - PR Correctly handle newlines after/before comments #4895
    - PR Replace verbatim text with NOT_YET_IMPLEMENTED #4904 👈
      - PR Trailing own line comments before func or class #4921
        
        PR Simple lexer for formatter #4922

This comment was auto-generated by Graphite.

MichaReiser · 2023-06-06T18:08:10Z

...pshots/ruff_python_formatter__tests__black_test__attribute_access_on_number_literals_py.snap

-x = 0O777 .real
-x = 0.000000006  .hex()
-x = -100.0000J
+NOT_YET_IMPLEMENTED_StmtAssign


An added benefit of this change is that it is easier to spot what formatting is implemented and what is incorrectly formatted because it is not yet supported.

MichaReiser · 2023-06-06T18:25:07Z

crates/ruff_python_formatter/src/builders.rs



-three_leading_newlines = 80
+NOT_YET_IMPLEMENTED_StmtAssign


I agree, this is stupid... but it is how it is.

github-actions · 2023-06-06T18:55:20Z

PR Check Results

Ecosystem

✅ ecosystem check detected no changes.

Benchmark

Linux

group                                      main                                   pr
-----                                      ----                                   --
formatter/large/dataset.py                 1.00      6.0±0.05ms     6.7 MB/sec    1.00      6.0±0.02ms     6.8 MB/sec
formatter/numpy/ctypeslib.py               1.00   1177.2±2.60µs    14.1 MB/sec    1.00   1173.2±1.40µs    14.2 MB/sec
formatter/numpy/globals.py                 1.00    132.5±0.44µs    22.3 MB/sec    1.00    132.7±1.14µs    22.2 MB/sec
formatter/pydantic/types.py                1.00      2.6±0.00ms     9.8 MB/sec    1.00      2.6±0.01ms     9.8 MB/sec
linter/all-rules/large/dataset.py          1.00     14.8±0.05ms     2.7 MB/sec    1.01     15.0±0.07ms     2.7 MB/sec
linter/all-rules/numpy/ctypeslib.py        1.00      3.6±0.02ms     4.6 MB/sec    1.00      3.6±0.02ms     4.6 MB/sec
linter/all-rules/numpy/globals.py          1.00    365.1±2.05µs     8.1 MB/sec    1.00    365.5±1.67µs     8.1 MB/sec
linter/all-rules/pydantic/types.py         1.00      6.2±0.01ms     4.1 MB/sec    1.01      6.2±0.02ms     4.1 MB/sec
linter/default-rules/large/dataset.py      1.00      7.3±0.01ms     5.6 MB/sec    1.00      7.3±0.01ms     5.5 MB/sec
linter/default-rules/numpy/ctypeslib.py    1.00   1529.5±4.95µs    10.9 MB/sec    1.00   1530.3±4.98µs    10.9 MB/sec
linter/default-rules/numpy/globals.py      1.00    165.2±0.32µs    17.9 MB/sec    1.00    165.0±0.62µs    17.9 MB/sec
linter/default-rules/pydantic/types.py     1.00      3.3±0.00ms     7.8 MB/sec    1.01      3.3±0.03ms     7.7 MB/sec

Windows

group                                      main                                   pr
-----                                      ----                                   --
formatter/large/dataset.py                 1.01      6.8±0.07ms     6.0 MB/sec    1.00      6.7±0.07ms     6.1 MB/sec
formatter/numpy/ctypeslib.py               1.00  1305.2±20.25µs    12.8 MB/sec    1.00  1302.9±21.46µs    12.8 MB/sec
formatter/numpy/globals.py                 1.01    144.0±4.69µs    20.5 MB/sec    1.00    143.2±3.99µs    20.6 MB/sec
formatter/pydantic/types.py                1.00      2.9±0.05ms     8.7 MB/sec    1.00      2.9±0.04ms     8.7 MB/sec
linter/all-rules/large/dataset.py          1.02     17.0±0.18ms     2.4 MB/sec    1.00     16.6±0.18ms     2.4 MB/sec
linter/all-rules/numpy/ctypeslib.py        1.01      4.2±0.05ms     4.0 MB/sec    1.00      4.1±0.04ms     4.0 MB/sec
linter/all-rules/numpy/globals.py          1.00    486.7±4.86µs     6.1 MB/sec    1.01   491.7±11.42µs     6.0 MB/sec
linter/all-rules/pydantic/types.py         1.01      7.0±0.11ms     3.6 MB/sec    1.00      7.0±0.09ms     3.6 MB/sec
linter/default-rules/large/dataset.py      1.00      8.3±0.07ms     4.9 MB/sec    1.00      8.3±0.14ms     4.9 MB/sec
linter/default-rules/numpy/ctypeslib.py    1.00  1753.1±20.63µs     9.5 MB/sec    1.00  1752.9±18.94µs     9.5 MB/sec
linter/default-rules/numpy/globals.py      1.01    198.1±3.59µs    14.9 MB/sec    1.00    195.8±3.19µs    15.1 MB/sec
linter/default-rules/pydantic/types.py     1.01      3.8±0.04ms     6.8 MB/sec    1.00      3.7±0.03ms     6.9 MB/sec

MichaReiser · 2023-06-07T12:49:34Z

@MichaReiser started a stack merge that includes this pull request via Graphite.

MichaReiser · 2023-06-07T12:49:56Z

Graphite rebased this pull request as part of a merge.

MichaReiser · 2023-06-07T12:57:28Z

@MichaReiser merged this pull request with Graphite.

## Summary This PR replaces the `verbatim_text` builder with a `not_yet_implemented` builder that emits `NOT_YET_IMPLEMENTED_<NodeKind>` for not yet implemented nodes. The motivation for this change is that partially formatting compound statements can result in incorrectly indented code, which is a syntax error: ```python def func_no_args(): a; b; c if True: raise RuntimeError if False: ... for i in range(10): print(i) continue ``` Get's reformatted to ```python def func_no_args(): a; b; c if True: raise RuntimeError if False: ... for i in range(10): print(i) continue ``` because our formatter does not yet support `for` statements and just inserts the text from the source. ## Downsides Using an identifier will not work in all situations. For example, an identifier is invalid in an `Arguments ` position. That's why I kept `verbatim_text` around and e.g. use it in the `Arguments` formatting logic where incorrect indentations are impossible (to my knowledge). Meaning, `verbatim_text` we can opt in to `verbatim_text` when we want to iterate quickly on nodes that we don't want to provide a full implementation yet and using an identifier would be invalid. ## Upsides Running this on main discovered stability issues with the newline handling that were previously "hidden" because of the verbatim formatting. I guess that's an upside :) ## Test Plan None?

MichaReiser requested review from konstin and charliermarsh June 6, 2023 18:06

MichaReiser added internal An internal refactor or improvement formatter Related to the formatter labels Jun 6, 2023

MichaReiser commented Jun 6, 2023

View reviewed changes

MichaReiser marked this pull request as draft June 6, 2023 18:16

MichaReiser force-pushed the replace-verbatim-with-not-yet-implemented branch from a7c1782 to a55ccd8 Compare June 6, 2023 18:23

MichaReiser changed the base branch from main to comments-newline-handling June 6, 2023 18:23

MichaReiser marked this pull request as ready for review June 6, 2023 18:23

This was referenced Jun 6, 2023

Format binary expressions #4862

Merged

Correctly handle newlines after/before comments #4895

Merged

MichaReiser commented Jun 6, 2023

View reviewed changes

MichaReiser force-pushed the comments-newline-handling branch from b2ec055 to f23ff61 Compare June 7, 2023 07:03

MichaReiser force-pushed the replace-verbatim-with-not-yet-implemented branch from a55ccd8 to 59c3e35 Compare June 7, 2023 07:03

This was referenced Jun 7, 2023

Trailing own line comments before func or class #4921

Merged

Simple lexer for formatter #4922

Merged

konstin approved these changes Jun 7, 2023

View reviewed changes

Base automatically changed from comments-newline-handling to main June 7, 2023 12:49

Replace verbatim text with NOT_YET_IMPLEMENTED

48ce57f

MichaReiser force-pushed the replace-verbatim-with-not-yet-implemented branch from 59c3e35 to 48ce57f Compare June 7, 2023 12:49

MichaReiser merged commit bcf745c into main Jun 7, 2023

MichaReiser deleted the replace-verbatim-with-not-yet-implemented branch June 7, 2023 12:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace verbatim text with `NOT_YET_IMPLEMENTED` #4904

Replace verbatim text with `NOT_YET_IMPLEMENTED` #4904

MichaReiser commented Jun 6, 2023 •

edited

Loading

MichaReiser commented Jun 6, 2023 •

edited

Loading

MichaReiser Jun 6, 2023

MichaReiser Jun 6, 2023

github-actions bot commented Jun 6, 2023 •

edited

Loading

MichaReiser commented Jun 7, 2023

MichaReiser commented Jun 7, 2023

MichaReiser commented Jun 7, 2023



		three_leading_newlines = 80
		NOT_YET_IMPLEMENTED_StmtAssign

Replace verbatim text with NOT_YET_IMPLEMENTED #4904

Replace verbatim text with NOT_YET_IMPLEMENTED #4904

Conversation

MichaReiser commented Jun 6, 2023 • edited Loading

Summary

Downsides

Upsides

Test Plan

MichaReiser commented Jun 6, 2023 • edited Loading

MichaReiser Jun 6, 2023

Choose a reason for hiding this comment

MichaReiser Jun 6, 2023

Choose a reason for hiding this comment

github-actions bot commented Jun 6, 2023 • edited Loading

PR Check Results

Ecosystem

Benchmark

Linux

Windows

MichaReiser commented Jun 7, 2023

MichaReiser commented Jun 7, 2023

MichaReiser commented Jun 7, 2023

Replace verbatim text with `NOT_YET_IMPLEMENTED` #4904

Replace verbatim text with `NOT_YET_IMPLEMENTED` #4904

MichaReiser commented Jun 6, 2023 •

edited

Loading

MichaReiser commented Jun 6, 2023 •

edited

Loading

github-actions bot commented Jun 6, 2023 •

edited

Loading