Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Brent Westbrook	4ffbd496e3	Merge branch 'main' into brent/indent-lambda-params	2025-12-09 14:34:28 -05:00
Brent Westbrook	0bec5c0362	Fix comment placement in lambda parameters (#21868 ) Summary -- This PR makes two changes to comment placement in lambda parameters. First, we now insert a line break if the first parameter has a leading comment: ```py # input ( lambda * # comment 2 x: x ) # main ( lambda # comment 2 x: x ) # this PR ( lambda # comment 2 x: x ) ``` Note the missing space in the output from main. This case is currently unstable on main. Also note that the new formatting is more consistent with our stable formatting in cases where the lambda has its own dangling comment: ```py # input ( lambda # comment 1 * # comment 2 x: x ) # output ( lambda # comment 1 # comment 2 x: x ) ``` and when a parameter without a comment precedes the split `x`: ```py # input ( lambda y, * # comment 2 x: x ) # output ( lambda y, # comment 2 x: x ) ``` This does change the stable formatting, but I think such cases are rare (expecting zero hits in the ecosystem report), this fixes an existing instability, and it should not change any code we've previously formatted. Second, this PR modifies the comment placement such that `# comment 2` in these outputs is still a leading comment on the parameter. This is also not the case on main, where it becomes a [dangling lambda comment](https://play.ruff.rs/3b29bb7e-70e4-4365-88e0-e60fe1857a35?secondary=Comments). This doesn't cause any instability that I'm aware of on main, but it does cause problems when trying to adjust the placement of dangling lambda comments in #21385. Changing the placement in this way should not affect any formatting here. Test Plan -- New lambda tests, plus existing tests covering the cases above with multiple comments around the parameters (see lambda.py 122-143, and 122-205 or so more broadly) I also checked manually that the comments are now leading on the parameter: ```shell ❯ cargo run --bin ruff_python_formatter -- --emit stdout --target-version 3.10 --print-comments <<EOF ( lambda # comment 2 x: x ) EOF Finished `dev` profile [unoptimized + debuginfo] target(s) in 0.15s Running `target/debug/ruff_python_formatter --emit stdout --target-version 3.10 --print-comments` # Comment decoration: Range, Preceding, Following, Enclosing, Comment 21..32, None, Some((Parameters, 37..39)), (ExprLambda, 6..42), "# comment 2" { Node { kind: Parameter, range: 37..39, source: `x`, }: { "leading": [ SourceComment { text: "# comment 2", position: OwnLine, formatted: true, }, ], "dangling": [], "trailing": [], }, } ( lambda # comment 2 x: x ) ``` But I didn't see a great place to put a test like this. Is there somewhere I can assert this comment placement since it doesn't affect any formatting yet? Or is it okay to wait until we use this in #21385?	2025-12-09 14:07:48 -05:00
Brent Westbrook	21b442a4fc	accept snapshots	2025-12-09 08:53:53 -05:00
Brent Westbrook	e8540d9b08	format new dangling comments	2025-12-08 12:56:57 -05:00
Brent Westbrook	8ede14a083	move comments within lambda parameters to dangling lambda comments	2025-12-08 12:29:37 -05:00
Brent Westbrook	f20f3e0d49	fix assignment instability without parameters too	2025-12-05 16:08:37 -05:00
Brent Westbrook	0710e0bc3e	fix assignment instability with dangling comments	2025-12-05 16:03:46 -05:00
Brent Westbrook	1531c94b4e	revert the last two commits, back to a stable formatting	2025-12-05 15:44:21 -05:00
Brent Westbrook	86406c0bb1	wip	2025-12-05 15:41:49 -05:00
Brent Westbrook	8cbe03b318	add more cases without parameters	2025-12-05 13:58:06 -05:00
Brent Westbrook	25d70b408b	more tests	2025-12-05 12:50:18 -05:00
Brent Westbrook	5605387a77	add another dangling case between lambda and parameters	2025-12-05 10:22:32 -05:00
Brent Westbrook	43b53edcab	improve dangling header comment placement	2025-12-05 09:21:48 -05:00
Brent Westbrook	a4b4a82e61	add another dangling eol case	2025-12-05 08:34:55 -05:00
Brent Westbrook	dfd3460c7a	add some more tests	2025-12-04 18:02:52 -05:00
Brent Westbrook	3a20c6f196	copy mapper test case from can_omit_optional_parentheses	2025-12-04 10:25:13 -05:00
Brent Westbrook	2e84402f69	add comments and some supporting tests	2025-12-04 10:08:46 -05:00
Brent Westbrook	04963a6b6b	expand parent if the lambda body breaks	2025-12-03 14:16:16 -05:00
Brent Westbrook	258b1fd7eb	add wrapping case from the ecosystem check the lambda is hugging its enclosing parentheses when it shouldn't be. there seems to be an issue with `best_fitting!` because moving any of the options we're passing to it out of `best_fitting!` avoids this behavior. IR: ``` [ source_position(0), source_position(1), "[", group(expand: propagated, [ indent([ soft_line_break, "(", group([ indent([ soft_line_break, "lambda ", group(["eval_df, _"]), ": ", best_fitting([ [ [ <interned 0> [ "MetricValue(", group(expand: propagated, [ indent([ soft_line_break, group(expand: propagated, [ "scores=eval_df[", group([ indent([soft_line_break, "\"prediction\""]), soft_line_break ]), "].tolist", group(["()"]), ",", soft_line_break_or_space, "aggregate_results={", group([ indent([ soft_line_break, group([ "\"prediction_sum\": sum(", group([ indent([ soft_line_break, group([ "eval_df[", group([ indent([soft_line_break, "\"prediction\""]), soft_line_break ]), "]" ]) ]), soft_line_break ]), ")" ]) ]), soft_line_break ]), "}", if_group_breaks([","]), expand_parent ]) ]), soft_line_break ]), ")" ] ] ] [[group(expand: true, [<ref interned 0>])]] [ [ "(", indent([hard_line_break, <ref interned 0>]), hard_line_break, ")" ] ] ]) ]), soft_line_break ]), ")", if_group_breaks([","]), expand_parent ]), soft_line_break ]), "]", source_position(196), hard_line_break, source_position(196) ] ```	2025-12-03 11:52:48 -05:00
Brent Westbrook	a3400a017a	use parenthesize_if_expands for fluent call chains	2025-12-03 11:51:02 -05:00
Brent Westbrook	9ef9d0302d	fix another ecosystem call expansion	2025-12-03 10:06:48 -05:00
Brent Westbrook	6f6c09c72a	fix snapshot changes for cases with comments	2025-12-03 09:45:00 -05:00
Brent Westbrook	efa372b379	apply Micha's patch, fixing everything? Co-authored-by: Micha Reiser <micha@reiser.io>	2025-12-03 09:14:52 -05:00
Brent Westbrook	97850661fd	add too-eagerly parenthesized case from ecosystem the initial code here is only 83 characters wide, so the call expression should fit without wrapping the whole body in parens	2025-12-02 15:34:32 -05:00
Brent Westbrook	89dc1ada39	add a couple more test cases I tried to get Claude to come up with tests, but most of them weren't very interesting. I think these two additional types of assignments might be worth having, though.	2025-12-02 11:47:41 -05:00
Brent Westbrook	e9f9507dc8	add some assignment tests with parentheses and comments	2025-12-02 11:47:04 -05:00
Brent Westbrook	24e15bfd95	exclude call and subscript expressions from has_own_parentheses	2025-12-02 11:28:19 -05:00
Brent Westbrook	9db5d43e18	possibly bad test for triple-quoted f-strings parenthesizing these seems redundant. I would prefer our old formatting more like this: ```py def ddb(): sql = ( lambda var, table, n=N: f""" CREATE TABLE {table} AS SELECT ROW_NUMBER() OVER () AS id, {var} FROM ( SELECT {var} FROM RANGE({n}) _ ({var}) ORDER BY RANDOM() ) """ ) ``` where the `f"""` serves as the parentheses, instead of the current: ```py def ddb(): sql = lambda var, table, n=N: ( f""" CREATE TABLE {table} AS SELECT ROW_NUMBER() OVER () AS id, {var} FROM ( SELECT {var} FROM RANGE({n}) _ ({var}) ORDER BY RANDOM() ) """ ) ```	2025-12-02 11:22:55 -05:00
Brent Westbrook	ad3147703d	another bad test for long bodies with their own parens this case ends up too long at 108 columns: ```py class C: def foo(): if True: transaction_count = self._query_txs_for_range( get_count_fn=lambda from_ts, to_ts, _chain_id=chain_id: db_evmtx.count_transactions_in_range( chain_id=_chain_id, from_ts=from_ts, to_ts=to_ts, ), ) ``` instead, it should be formatted like this, fitting within 88 columns: ```py class C: def foo(): if True: transaction_count = self._query_txs_for_range( get_count_fn=lambda from_ts, to_ts, _chain_id=chain_id: ( db_evmtx.count_transactions_in_range( chain_id=_chain_id, from_ts=from_ts, to_ts=to_ts, ) ), ) ``` we can fix this by removing the `has_own_parentheses` check in the new lambda formatting, but this breaks other cases. we might want to preserve this? in this specific ecosystem case, the project has a `noqa: E501` comment, so this seems to be what they want anyway, although we don't know that when formatting	2025-12-02 11:22:55 -05:00
Brent Westbrook	f634bb5247	propagate lambda layout for annotated assignments	2025-12-02 11:22:55 -05:00
Brent Westbrook	6b47664019	add another bad test case from the ecosystem report I would expect this to format as: ```py class C: _is_recognized_dtype: Callable[[DtypeObj], bool] = lambda x: ( lib.is_np_dtype(x, "M") or isinstance(x, DatetimeTZDtype) ) ``` instead of the current: ```py class C: _is_recognized_dtype: Callable[[DtypeObj], bool] = ( lambda x: lib.is_np_dtype(x, "M") or isinstance(x, DatetimeTZDtype) ) ```	2025-12-02 11:22:55 -05:00
Brent Westbrook	07bcf41a34	fix binary expression in lambda in return	2025-12-02 11:22:55 -05:00
Brent Westbrook	d68a03a519	add another bad case from the ecosystem check this should format like: ```py def foo(): if True: if True: return lambda x: ( np.exp(cs(np.log(x.to(u.MeV).value))) * u.MeV * u.cm*2 / u.g ) ``` instead of the current snapshot: ```diff def foo(): if True: if True: - return ( - lambda x: np.exp(cs(np.log(x.to(u.MeV).value))) u.MeV * u.cm*2 / u.g - ) + return lambda x: np.exp( + cs(np.log(x.to(u.MeV).value)) + ) u.MeV * u.cm**2 / u.g ```	2025-12-02 11:22:55 -05:00
Brent Westbrook	62c968c826	rough draft of ExprLambdaLayout::Assignment	2025-12-02 11:22:55 -05:00
Brent Westbrook	74093bdb50	add a poorly formatted case from the ecosystem report this formats as: ```py class C: function_dict: Dict[Text, Callable[[CRFToken], Any]] = { CRFEntityExtractorOptions.POS2: lambda crf_token: crf_token.pos_tag[ :2 ] if crf_token.pos_tag is not None else None, } ``` when I think it should look like: ```py class C: function_dict: Dict[Text, Callable[[CRFToken], Any]] = { CRFEntityExtractorOptions.POS2: lambda crf_token: ( crf_token.pos_tag[:2] if crf_token.pos_tag is not None else None, ) } ```	2025-12-02 11:22:55 -05:00
Brent Westbrook	1b58643040	wip: parenthesize long lambda bodies	2025-12-02 11:22:55 -05:00
Brent Westbrook	8cc884428d	keep lambda parameters on a single line	2025-12-02 11:22:55 -05:00
Brent Westbrook	1e70c991a2	baseline test cases	2025-12-02 11:22:55 -05:00
Dylan	62343a101a	Respect `fmt: skip` for compound statements on single line (#20633 ) Closes #11216 Essentially the approach is to implement `Format` for a new struct `FormatClause` which is just a clause header _and_ its body. We then have the information we need to see whether there is a skip suppression comment on the last child in the body and it all fits on one line.	2025-11-18 12:02:09 -06:00
Brent Westbrook	cbc6863b8c	Fix panic when formatting comments in unary expressions (#21501 ) ## Summary This is another attempt at https://github.com/astral-sh/ruff/pull/21410 that fixes https://github.com/astral-sh/ruff/issues/19226. @MichaReiser helped me get something working in a very helpful pairing session. I pushed one additional commit moving the comments back from leading comments to trailing comments, which I think retains more of the input formatting. I was inspired by Dylan's PR (#21185) to make one of these tables: <table> <thead> <tr> <th scope="col">Input</th> <th scope="col">Main</th> <th scope="col">PR</th> </tr> </thead> <tbody> <tr> <td><pre lang="python"> if ( not # comment aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb ): pass </pre></td> <td><pre lang="python"> if ( # comment not aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb ): pass </pre></td> <td><pre lang="python"> if ( not # comment aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb ): pass </pre></td> </tr> <tr> <td><pre lang="python"> if ( # unary comment not # operand comment ( # comment aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb ) ): pass </pre></td> <td><pre lang="python"> if ( # unary comment # operand comment not ( # comment aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb ) ): pass </pre></td> <td><pre lang="python"> if ( # unary comment not # operand comment ( # comment aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb ) ): pass </pre></td> </tr> <tr> <td><pre lang="python"> if ( not # comment aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb ): pass </pre></td> <td><pre lang="python"> if ( # comment not aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb ): pass </pre></td> <td><pre lang="python"> if ( not aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa # comment + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb ): pass </pre></td> </tr> </tbody> </table> hopefully it helps even though the snippets are much wider here. The two main differences are (1) that we now retain own-line comments between the unary operator and its operand instead of moving these to leading comments on the operator itself, and (2) that we move end-of-line comments between the operator and operand to dangling end-of-line comments on the operand (the last example in the table). ## Test Plan Existing tests, plus new ones based on the issue. As I noted below, I also ran the output from main on the unary.py file back through this branch to check that we don't reformat code from main. This made me feel a bit better about not preview-gating the changes in this PR. ```shell > git show main:crates/ruff_python_formatter/resources/test/fixtures/ruff/expression/unary.py \| ruff format - \| ./target/debug/ruff format --diff - > echo $? 0 ``` --------- Co-authored-by: Micha Reiser <micha@reiser.io> Co-authored-by: Takayuki Maeda <takoyaki0316@gmail.com>	2025-11-18 10:48:14 -05:00
Dylan	8156b45173	Avoid syntax error when formatting attribute expressions with outer parentheses, parenthesized value, and trailing comment on value (#20418 ) Closes #19350 This fixes a syntax error caused by formatting. However, the new tests reveal that there are some cases where formatting attributes with certain comments behaves strangely, both before and after this PR, so some more polish may be in order. For example, without parentheses around the value, and both before and after this PR, we have: ```python # unformatted variable = ( something # a comment .first_method("some string") ) # formatted variable = something.first_method("some string") # a comment ``` which is probably not where the comment ought to go.	2025-11-17 09:11:36 -06:00
Dylan	04a3ec3689	Adjust own-line comment placement between branches (#21185 ) This PR attempts to improve the placement of own-line comments between branches in the setting where the comment is more indented than the preceding node. There are two main changes. ### First change: Preceding node has leading content If the preceding node has leading content, we now regard the comment as automatically _less_ indented than the preceding node, and format accordingly. For example, ```python if True: preceding_node # leading on `else`, not trailing on `preceding_node` else: ... ``` This is more compatible with `black`, although there is a (presumably very uncommon) edge case: ```python if True: this;that # leading on `else`, but trailing in `black` else: ... ``` I'm sort of okay with this - presumably if one wanted a comment for those semi-colon separated statements, one should have put it _above_ them, and one wanted a comment only for `that` then it ought to have been on the same line? ### Second change: searching for last child in body While searching for the (recursively) last child in the body of the preceding _branch_, we implicitly assumed that the preceding node had to have a body to begin the recursion. But actually, in the base case, the preceding node _is_ the last child in the body of the preceding branch. So, for example: ```python if True: something last_child_but_no_body # leading on else for `main` but trailing in this PR else: ... ``` ### More examples The table below is an attempt to summarize the changes in behavior. The rows alternate between an example snippet with `while` and the same example with `if` - in the former case we do _not_ have an `else` node and in the latter we do. Notice that: 1. On `main` our handling of `if` vs. `while` is not consistent, whereas it is consistent in the present PR 2. We disagree with `black` in all cases except that last example on `main`, but agree in all cases for the present PR (though see above for a wonky edge case where we disagree). <table> <tr> <th>Original                             </th> <th><code>main</code>                               </th> <th>This PR                               </th> <th><code>black</code>                               </th> </tr> <tr> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> while True: pass else: # comment pass </pre> </td> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> </tr> <tr> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> </tr> <tr> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> </tr> <tr> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> </tr> <tr> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> while True: pass else: # comment pass </pre> </td> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> while True: pass # comment else: pass </pre> </td> </tr> <tr> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> <td> <pre lang="python"> if True: pass # comment else: pass </pre> </td> </tr> </table>	2025-11-17 07:30:34 -06:00
Brent Westbrook	63b1c1ea8b	Avoid extra parentheses for long `match` patterns with `as` captures (#21176 ) Summary -- This PR fixes #17796 by taking the approach mentioned in https://github.com/astral-sh/ruff/issues/17796#issuecomment-2847943862 of simply recursing into the `MatchAs` patterns when checking if we need parentheses. This allows us to reuse the parentheses in the inner pattern before also breaking the `MatchAs` pattern itself: ```diff match class_pattern: case Class(xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx) as capture: pass - case ( - Class(xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx) as capture - ): + case Class( + xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx + ) as capture: pass - case ( - Class( - xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx - ) as capture - ): + case Class( + xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx + ) as capture: pass case ( Class( @@ -685,13 +683,11 @@ match sequence_pattern_brackets: case [xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx] as capture: pass - case ( - [xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx] as capture - ): + case [ + xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx + ] as capture: pass - case ( - [ - xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx - ] as capture - ): + case [ + xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx + ] as capture: pass ``` I haven't really resolved the question of whether or not it's okay always to recurse, but I'm hoping the ecosystem check on this PR might shed some light on that. Test Plan -- New tests based on the issue and then reviewing the ecosystem check here	2025-11-03 17:06:52 -05:00
Brent Westbrook	827d8ae5d4	Allow newlines after function headers without docstrings (#21110 ) Summary -- This is a first step toward fixing #9745. After reviewing our open issues and several Black issues and PRs, I personally found the function case the most compelling, especially with very long argument lists: ```py def func( self, arg1: int, arg2: bool, arg3: bool, arg4: float, arg5: bool, ) -> tuple[...]: if arg2 and arg3: raise ValueError ``` or many annotations: ```py def function( self, data: torch.Tensor \| tuple[torch.Tensor, ...], other_argument: int ) -> torch.Tensor \| tuple[torch.Tensor, ...]: do_something(data) return something ``` I think docstrings help the situation substantially both because syntax highlighting will usually give a very clear separation between the annotations and the docstring and because we already allow a blank line _after_ the docstring: ```py def function( self, data: torch.Tensor \| tuple[torch.Tensor, ...], other_argument: int ) -> torch.Tensor \| tuple[torch.Tensor, ...]: """ A function doing something. And a longer description of the things it does. """ do_something(data) return something ``` There are still other comments on #9745, such as [this one] with 9 upvotes, where users specifically request blank lines in all block types, or at least including conditionals and loops. I'm sympathetic to that case as well, even if personally I don't find an [example] like this: ```py if blah: # Do some stuff that is logically related data = get_data() # Do some different stuff that is logically related results = calculate_results() return results ``` to be much more readable than: ```py if blah: # Do some stuff that is logically related data = get_data() # Do some different stuff that is logically related results = calculate_results() return results ``` I'm probably just used to the latter from the formatters I've used, but I do prefer it. I also think that functions are the least susceptible to the accidental introduction of a newline after refactoring described in Micha's [comment] on #8893. I actually considered further restricting this change to functions with multiline headers. I don't think very short functions like: ```py def foo(): return 1 ``` benefit nearly as much from the allowed newline, but I just went with any function without a docstring for now. I guess a marginal case like: ```py def foo(a_long_parameter: ALongType, b_long_parameter: BLongType) -> CLongType: return 1 ``` might be a good argument for not restricting it. I caused a couple of syntax errors before adding special handling for the ellipsis-only case, so I suspect that there are some other interesting edge cases that may need to be handled better. Test Plan -- Existing tests, plus a few simple new ones. As noted above, I suspect that we may need a few more for edge cases I haven't considered. [this one]: https://github.com/astral-sh/ruff/issues/9745#issuecomment-2876771400 [example]: https://github.com/psf/black/issues/902#issuecomment-1562154809 [comment]: https://github.com/astral-sh/ruff/issues/8893#issuecomment-1867259744	2025-10-31 14:53:40 -04:00
Dylan	116611bd39	Fix finding keyword range for clause header after statement ending with semicolon (#21067 ) When formatting clause headers for clauses that are not their own node, like an `else` clause or `finally` clause, we begin searching for the keyword at the end of the previous statement. However, if the previous statement ended in a semicolon this caused a panic because we only expected trivia between the end of the last statement and the keyword. This PR adjusts the starting point of our search for the keyword to begin after the optional semicolon in these cases. Closes #21065	2025-10-27 09:52:17 -05:00
Micha Reiser	3c7f56f582	Restore `indent.py` (#21094 )	2025-10-27 10:34:29 +00:00
Brent Westbrook	4b0fa5f270	Render a diagnostic for syntax errors introduced in formatter tests (#21021 ) ## Summary I spun this out from #21005 because I thought it might be helpful separately. It just renders a nice `Diagnostic` for syntax errors pointing to the source of the error. This seemed a bit more helpful to me than just the byte offset when working on #21005, and we had most of the code around after #20443 anyway. ## Test Plan This doesn't actually affect any passing tests, but here's an example of the additional output I got when I broke the spacing after the `in` token: ``` error[internal-error]: Expected 'in', found name --> /home/brent/astral/ruff/crates/ruff_python_formatter/resources/test/fixtures/black/cases/cantfit.py:50:79 \| 48 \| need_more_to_make_the_line_long_enough, 49 \| ) 50 \| del ([], name_1, name_2), [(), [], name_4, name_3], name_1[[name_2 for name_1 inname_0]] \| ^^^^^^^^ 51 \| del () \| ``` I just appended this to the other existing output for now.	2025-10-21 13:47:26 -04:00
Brent Westbrook	0115fd3757	Avoid reusing nested, interpolated quotes before Python 3.12 (#20930 ) ## Summary Fixes #20774 by tracking whether an `InterpolatedStringState` element is nested inside of another interpolated element. This feels like kind of a naive fix, so I'm welcome to other ideas. But it resolves the problem in the issue and clears up the syntax error in the black compatibility test, without affecting many other cases. The other affected case is actually interesting too because the [input](`96b156303b/crates/ruff_python_formatter/resources/test/fixtures/ruff/expression/fstring.py (L707)`) is invalid, but the previous quote selection fixed the invalid syntax: ```pycon Python 3.11.13 (main, Sep 2 2025, 14:20:25) [Clang 20.1.4 ] on linux Type "help", "copyright", "credits" or "license" for more information. >>> f'{1: abcd "{'aa'}" }' # input File "<stdin>", line 1 f'{1: abcd "{'aa'}" }' ^^ SyntaxError: f-string: expecting '}' >>> f'{1: abcd "{"aa"}" }' # old output Traceback (most recent call last): File "<stdin>", line 1, in <module> ValueError: Invalid format specifier ' abcd "aa" ' for object of type 'int' >>> f'{1: abcd "{'aa'}" }' # new output File "<stdin>", line 1 f'{1: abcd "{'aa'}" }' ^^ SyntaxError: f-string: expecting '}' ``` We now preserve the invalid syntax in the input. Unfortunately, this also seems to be another edge case I didn't consider in https://github.com/astral-sh/ruff/pull/20867 because we don't flag this as a syntax error after 0.14.1: <details><summary>Shell output</summary> <p> ``` > uvx ruff@0.14.0 check --ignore ALL --target-version py311 - <<EOF f'{1: abcd "{'aa'}" }' EOF invalid-syntax: Cannot reuse outer quote character in f-strings on Python 3.11 (syntax was added in Python 3.12) --> -:1:14 \| 1 \| f'{1: abcd "{'aa'}" }' \| ^ \| Found 1 error. > uvx ruff@0.14.1 check --ignore ALL --target-version py311 - <<EOF f'{1: abcd "{'aa'}" }' EOF All checks passed! > uvx python@3.11 -m ast <<EOF f'{1: abcd "{'aa'}" }' EOF Traceback (most recent call last): File "<frozen runpy>", line 198, in _run_module_as_main File "<frozen runpy>", line 88, in _run_code File "/home/brent/.local/share/uv/python/cpython-3.11.13-linux-x86_64-gnu/lib/python3.11/ast.py", line 1752, in <module> main() File "/home/brent/.local/share/uv/python/cpython-3.11.13-linux-x86_64-gnu/lib/python3.11/ast.py", line 1748, in main tree = parse(source, args.infile.name, args.mode, type_comments=args.no_type_comments) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/brent/.local/share/uv/python/cpython-3.11.13-linux-x86_64-gnu/lib/python3.11/ast.py", line 50, in parse return compile(source, filename, mode, flags, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<stdin>", line 1 f'{1: abcd "{'aa'}" }' ^^ SyntaxError: f-string: expecting '}' ``` </p> </details> I assumed that was the same `ParseError` as the one caused by `f"{1:""}"`, but this is a nested interpolation inside of the format spec. ## Test Plan New test copied from the black compatibility test. I guess this is a duplicate now, I started working on this branch before the new black tests were imported, so I could delete the separate test in our fixtures if that's preferable.	2025-10-17 08:49:16 -04:00
Brent Westbrook	8b9ab48ac6	Fix syntax error false positives for escapes and quotes in f-strings (#20867 ) Summary -- Fixes #20844 by refining the unsupported syntax error check for [PEP 701] f-strings before Python 3.12 to allow backslash escapes and escaped outer quotes in the format spec part of f-strings. These are only disallowed within the f-string expression part on earlier versions. Using the examples from the PR: ```pycon >>> f"{1:\x64}" '1' >>> f"{1:\"d\"}" Traceback (most recent call last): File "<stdin>", line 1, in <module> ValueError: Invalid format specifier '"d"' for object of type 'int' ``` Note that the second case is a runtime error, but this is actually avoidable if you override `__format__`, so despite being pretty weird, this could actually be a valid use case. ```pycon >>> class C: ... def __format__(args, *kwargs): return "<C>" ... >>> f"{C():\"d\"}" '<C>' ``` At first I thought narrowing the range we check to exclude the format spec would only work for escapes, but it turns out that cases like `f"{1:""}"` are already covered by an existing `ParseError`, so we can just narrow the range of both our escape and quote checks. Our comment check also seems to be working correctly because it's based on the actual tokens. A case like [this](https://play.ruff.rs/9f1c2ff2-cd8e-4ad7-9f40-56c0a524209f): ```python f"""{1:# }""" ``` doesn't include a comment token, instead the `#` is part of an `InterpolatedStringLiteralElement`. Test Plan -- New inline parser tests [PEP 701]: https://peps.python.org/pep-0701/	2025-10-15 09:23:16 -04:00
Brent Westbrook	591e9bbccb	Remove parentheses around multiple exception types on Python 3.14+ (#20768 ) Summary -- This PR implements the black preview style from https://github.com/psf/black/pull/4720. As of Python 3.14, you're allowed to omit the parentheses around groups of exceptions, as long as there's no `as` binding: 3.13 ```pycon Python 3.13.4 (main, Jun 4 2025, 17:37:06) [Clang 20.1.4 ] on linux Type "help", "copyright", "credits" or "license" for more information. >>> try: ... ... except (Exception, BaseException): ... ... Ellipsis >>> try: ... ... except Exception, BaseException: ... ... File "<python-input-1>", line 2 except Exception, BaseException: ... ^^^^^^^^^^^^^^^^^^^^^^^^ SyntaxError: multiple exception types must be parenthesized ``` 3.14 ```pycon Python 3.14.0rc2 (main, Sep 2 2025, 14:20:56) [Clang 20.1.4 ] on linux Type "help", "copyright", "credits" or "license" for more information. >>> try: ... ... except Exception, BaseException: ... ... Ellipsis >>> try: ... ... except (Exception, BaseException): ... ... Ellipsis >>> try: ... ... except Exception, BaseException as e: ... ... File "<python-input-2>", line 2 except Exception, BaseException as e: ... ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ SyntaxError: multiple exception types must be parenthesized when using 'as' ``` I think this ended up being pretty straightforward, at least once Micha showed me where to start :) Test Plan -- New tests At first I thought we were deviating from black in how we handle comments within the exception type tuple, but I think this applies to how we format all tuples, not specifically with the new preview style.	2025-10-14 11:17:45 -04:00

1 2 3 4 5 ...

439 Commits