Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Charlie Marsh	d616c9b870	Avoid omitting optional parentheses for argument-less parentheses (#6484 ) ## Summary This PR fixes some misformattings around optional parentheses for expressions. I first noticed that we were misformatting this: ```python return ( unicodedata.normalize("NFKC", s1).casefold() == unicodedata.normalize("NFKC", s2).casefold() ) ``` The above is stable Black formatting, but we were doing: ```python return unicodedata.normalize("NFKC", s1).casefold() == unicodedata.normalize( "NFKC", s2 ).casefold() ``` Above, the "last" expression is a function call, so our `can_omit_optional_parentheses` was returning `true`... However, it turns out that Black treats function calls differently depending on whether or not they have arguments -- presumedly because they'll never split empty parentheses, and so they're functionally non-useful. On further investigation, I believe this applies to all parenthesized expressions. If Black can't split on the parentheses, it doesn't leverage them when removing optional parentheses. ## Test Plan Nice increase in similarity scores. Before: - `zulip`: 0.99702 - `django`: 0.99784 - `warehouse`: 0.99585 - `build`: 0.75623 - `transformers`: 0.99470 - `cpython`: 0.75989 - `typeshed`: 0.74853 After: - `zulip`: 0.99705 - `django`: 0.99795 - `warehouse`: 0.99600 - `build`: 0.75623 - `transformers`: 0.99471 - `cpython`: 0.75989 - `typeshed`: 0.74853	2023-08-11 17:58:42 +00:00
Dhruv Manilawala	c434bdd2bd	Add formatting for `MatchCase` (#6360 ) ## Summary This PR adds formatting support for `MatchCase` node with subs for the `Pattern` nodes. ## Test Plan Added test cases for case node handling with comments, newlines. resolves: #6299	2023-08-11 19:20:25 +05:30
Charlie Marsh	f2939c678b	Avoid breaking call chains unnecessarily (#6488 ) ## Summary This PR attempts to fix the formatting of the following expression: ```python max_message_id = ( Message.objects.filter(recipient=recipient).order_by("id").reverse()[0].id ) ``` Specifically, Black preserves _that_ formatting, while we do: ```python max_message_id = ( Message.objects.filter(recipient=recipient) .order_by("id") .reverse()[0] .id ) ``` The fix here is to add a group around the entire call chain. ## Test Plan Before: - `zulip`: 0.99702 - `django`: 0.99784 - `warehouse`: 0.99585 - `build`: 0.75623 - `transformers`: 0.99470 - `cpython`: 0.75989 - `typeshed`: 0.74853 After: - `zulip`: 0.99703 - `django`: 0.99791 - `warehouse`: 0.99586 - `build`: 0.75623 - `transformers`: 0.99470 - `cpython`: 0.75989 - `typeshed`: 0.74853	2023-08-11 13:33:15 +00:00
Victor Hugo Gomes	b05574babd	Fix formatter instability with half-indented comment (#6460 ) ## Summary The bug was happening in this [loop](`75f402eb82/crates/ruff_python_formatter/src/comments/placement.rs (L545)`). Basically, In the first iteration of the loop, the `comment_indentation` is bigger than `child_indentation` (`comment_indentation` is 7 and `child_indentation` is 4) making the `Ordering::Greater` branch execute. Inside the `Ordering::Greater` branch, the `if` block gets executed, resulting in the update of these variables. ```rust parent_body = current_body; current_body = Some(last_child_in_current_body); last_child_in_current_body = nested_child; ``` In the second iteration of the loop, `comment_indentation` is smaller than `child_indentation` (`comment_indentation` is 7 and `child_indentation` is 8) making the `Ordering::Less` branch execute. Inside the `Ordering::Less` branch, the `if` block gets executed, this is where the bug was happening. At this point `parent_body` should be a `StmtFunctionDef` but it was a `StmtClassDef`. Causing the comment to be incorrectly formatted. That happened for the following code: ```python class A: def f(): pass # strangely indented comment print() ``` There is only one problem that I couldn't figure it out a solution, the variable `current_body` in this [line](`75f402eb82/crates/ruff_python_formatter/src/comments/placement.rs (L542C5-L542C49)`) now gives this warning _"value assigned to `current_body` is never read maybe it is overwritten before being read?"_ Any tips on how to solve that? Closes #5337 ## Test Plan Add new test case. --------- Co-authored-by: konstin <konstin@mailbox.org>	2023-08-11 11:21:16 +00:00
konsti	0ef6af807b	Implement DerefMut for WithNodeLevel (#6443 ) Summary Implement `DerefMut` for `WithNodeLevel` so it can be used in the same way as `PyFormatter`. I want this for my WIP upstack branch to enable `.fmt(f)` on `WithNodeLevel` context. We could extend this to remove the other two method from `WithNodeLevel`.	2023-08-11 10:41:48 +00:00
David Szotten	f091b46497	move comments from expressions in f-strings out (#6481 )	2023-08-11 09:22:30 +02:00
Charlie Marsh	2cedb401bd	Force parentheses for named expressions in more contexts (#6494 ) See: https://github.com/astral-sh/ruff/pull/6436#issuecomment-1673583888.	2023-08-11 01:54:46 -04:00
magic-akari	dc3275fe7f	Improve Ruff Formatter Interoperability (#6472 )	2023-08-10 14:39:53 +02:00
konsti	4811af0f0b	Formatter: Add test cases for comments after opening parentheses (#6420 ) Summary I collected all examples of end-of-line comments after opening parentheses that i could think of so we get a comprehensive view at the state of their formatting (#6390). This PR intentionally only adds tests cases without any changes in formatting. We need to decide which exact formatting we want, ideally in terms of these test files, and implement this in follow-up PRs. ~~One stability check is still deactivated pending https://github.com/astral-sh/ruff/pull/6386.~~	2023-08-10 08:34:03 +00:00
konsti	39beeb61f7	Track formatting all comments We currently don't format all comments as match statements are not yet implemented. We can work around this for the top level match statement by setting them manually formatted but the mocked-out top level match doesn't call into its children so they would still have unformatted comments	2023-08-10 09:19:27 +02:00
Micha Reiser	e2f7862404	Preserve dangling f-string comments <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR fixes the issue where the FString formatting dropped dangling comments between the string parts. ```python result_f = ( f' File "{__file__}", line {lineno_f+1}, in f\n' ' f()\n' # XXX: The following line changes depending on whether the tests # are run through the interactive interpreter or with -m # It also varies depending on the platform (stack size) # Fortunately, we don't care about exactness here, so we use regex r' \[Previous line repeated (\d+) more times\]' '\n' 'RecursionError: maximum recursion depth exceeded\n' ) ``` The solution here isn't ideal because it re-introduces the `enclosing_parent` on `DecoratedComment` but it is the easiest fix that I could come up. I didn't spend more time finding another solution becaues I think we have to re-write most of the fstring formatting with the upcoming Python 3.12 support (because lexing the individual parts as we do now will no longer work). closes #6440 <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan `cargo test` The child PR testing that all comments are formatted should now pass	2023-08-10 09:11:25 +02:00
Micha Reiser	c1bc67686c	Use SimpleTokenizer in `max_lines` (#6451 )	2023-08-10 08:13:14 +02:00
Dhruv Manilawala	6a64f2289b	Rename `Magic` to `IpyEscape` (#6395 ) ## Summary This PR renames the `MagicCommand` token to `IpyEscapeCommand` token and `MagicKind` to `IpyEscapeKind` type to better reflect the purpose of the token and type. Similarly, it renames the AST nodes from `LineMagic` to `IpyEscapeCommand` prefixed with `Stmt`/`Expr` wherever necessary. It also makes renames from using `jupyter_magic` to `ipython_escape_commands` in various function names. The mode value is still `Mode::Jupyter` because the escape commands are part of the IPython syntax but the lexing/parsing is done for a Jupyter notebook. ### Motivation behind the rename: * IPython codebase defines it as "EscapeCommand" / "Escape Sequences": * Escape Sequences: `292e3a2345/IPython/core/inputtransformer2.py (L329-L333)` * Escape command: `292e3a2345/IPython/core/inputtransformer2.py (L410-L411)` * The word "magic" is used mainly for the actual magic commands i.e., the ones starting with `%`/`%%` (https://ipython.readthedocs.io/en/stable/interactive/reference.html#magic-command-system). So, this avoids any confusion between the Magic token (`%`, `%%`) and the escape command itself. ## Test Plan * `cargo test` to make sure all renames are done correctly. * `grep` for `jupyter_escape`/`magic` to make sure all renames are done correctly.	2023-08-09 13:28:18 +00:00
Charlie Marsh	3bf1c66cda	Group function definition parameters with return type annotations (#6410 ) ## Summary This PR removes the group around function definition parameters, instead grouping the parameters with the type parameters and return type annotation. This increases Zulip's similarity score from 0.99385 to 0.99699, so it's a meaningful improvement. However, there's at least one stability error that I'm working on, and I'm really just looking for high-level feedback at this point, because I'm not happy with the solution. Closes https://github.com/astral-sh/ruff/issues/6352. ## Test Plan Before: - `zulip`: 0.99396 - `django`: 0.99784 - `warehouse`: 0.99578 - `build`: 0.75436 - `transformers`: 0.99407 - `cpython`: 0.75987 - `typeshed`: 0.74432 After: - `zulip`: 0.99702 - `django`: 0.99784 - `warehouse`: 0.99585 - `build`: 0.75623 - `transformers`: 0.99470 - `cpython`: 0.75988 - `typeshed`: 0.74853	2023-08-09 12:13:58 +00:00
Micha Reiser	a39dd76d95	Add `enter` and `leave_node` methods to Preoder visitor (#6422 )	2023-08-09 09:09:00 +00:00
Charlie Marsh	55d6fd53cd	Treat comments on open parentheses in return annotations as dangling (#6413 ) ## Summary Given: ```python def double(a: int) -> ( # Hello int ): return 2a ``` We currently treat `# Hello` as a trailing comment on the parameters (`(a: int)`). This PR adds a placement method to instead treat it as a dangling comment on the function definition itself, so that it gets formatted at the end of the definition, like: ```python def double(a: int) -> int: # Hello return 2a ``` The formatting in this case is unchanged, but it's incorrect IMO for that to be a trailing comment on the parameters, and that placement leads to an instability after changing the grouping in #6410. Fixing this led to a _different_ instability related to tuple return type annotations, like: ```python def zrevrangebylex(self, name: _Key, max: _Value, min: _Value, start: int \| None = None, num: int \| None = None) -> ( # type: ignore[override] ): ... ``` (This is a real example.) To fix, I had to special-case tuples in that spot, though I'm not certain that's correct.	2023-08-08 16:48:38 -04:00
Charlie Marsh	c7703e205d	Move `empty_parenthesized` into the `parentheses.rs` (#6403 ) ## Summary This PR moves `empty_parenthesized` such that it's peer to `parenthesized`, and changes the API to better match that of `parenthesized` (takes `&str` rather than `StaticText`, has a `with_dangling_comments` method, etc.). It may be intentionally _not_ part of `parentheses.rs`, but to me they're so similar that it makes more sense for them to be in the same module, with the same API, etc.	2023-08-08 19:17:17 +00:00
Dhruv Manilawala	d815a25b11	Update `StmtMatch` formatting snapshots (#6427 )	2023-08-08 16:45:02 +02:00
Dhruv Manilawala	001aa486df	Add formatting for `StmtMatch` (#6286 ) ## Summary This PR adds support for `StmtMatch` with subs for `MatchCase`. ## Test Plan Add a few additional test cases around `match` statement, comments, line breaks. resolves: #6298	2023-08-08 18:48:49 +05:30
Charlie Marsh	87984e9ac7	Expand parents whenever open-parenthesis comments are present (#6389 ) ## Summary This PR modifies our dangling-open-parenthesis handling to _always_ expand the parent expression. So, for example, given: ```python a = int( # type: ignore int( # type: ignore int( # type: ignore 6 ) ) ) ``` We now retain that as stable formatting, instead of truncating like: ```python a = int(int(int(6))) # comment # comment # comment ``` Note that Black _does_ collapse comments like this _unless_ they're `# type: ignore` comments, and perhaps in some other cases, so this is an intentional deviation ([playground](https://black.vercel.app/?version=main&state=_Td6WFoAAATm1rRGAgAhARYAAAB0L-Wj4AFEAHpdAD2IimZxl1N_WlOfrjryFgvD4ScVsKPztqdHDGJUg5knO0JCdpUfW1IrWSNmIJPx95s0hP-pRNkCQNH64-eIznIvXjeWBQ5-qax0oNw4yMOuhwr2azvMRZaEB5r8IXVPHmRCJp7fe7y4290u1zzxqK_nAi6q_5sI-jsAAAAA8HgZ9V7hG3QAAZYBxQIAAGnCHXexxGf7AgAAAAAEWVo=)).	2023-08-08 08:45:20 -04:00
konsti	90ba40c23c	Fix zulip unstable formatting with end-of-line comments (#6386 ) ## Bug Given ```python x = () - (# ) ``` the comment is a dangling comment of the empty tuple. This is an end-of-line comment so it may move after the expression. It still expands the parent, so the operator breaks: ```python x = ( () - () # ) ``` In the next formatting pass, the comment is not a trailing tuple but a trailing bin op comment, so the bin op doesn't break anymore. The comment again expands the parent, so we still add the superfluous parentheses ```python x = ( () - () # ) ``` ## Fix The new formatting is to keep the comment on the empty tuple. This is a log uglier and again has additional outer parentheses, but it's stable: ```python x = ( () - ( # ) ) ``` ## Alternatives Black formats all the examples above as ```python x = () - () # ``` which i find better. I would be happy about any suggestions for better solutions than the current one. I'd mainly need a workaround for expand parent having an effect on the bin op instead of first moving the comment to the end and then applying expand parent to the assign statement.	2023-08-08 09:15:35 +00:00
Micha Reiser	2bd345358f	Simplify `parenthesized` formatting (#6419 )	2023-08-08 08:50:57 +00:00
Charlie Marsh	404e334fec	Rename `ArgumentSeparator` to `ParameterSeparator` (#6404 ) To mirror the rename from `Arguments` to `Parameters`.	2023-08-07 15:46:28 -04:00
Charlie Marsh	8919b6ad9a	Add a `with_dangling_comments` to the parenthesized formatter (#6402 ) See: https://github.com/astral-sh/ruff/pull/6376#discussion_r1285514328.	2023-08-07 19:12:12 +00:00
Charlie Marsh	df1591b3c2	Remove outdated TODO (#6400 ) See: https://github.com/astral-sh/ruff/pull/6376#discussion_r1285539278.	2023-08-07 18:33:18 +00:00
Charlie Marsh	a637b8b3a3	Fixup comment handling on opening parenthesis in function definition (#6381 ) ## Summary I noticed some deviations in how we treat dangling comments that hug the opening parenthesis for function definitions. For example, given: ```python def f( # first # second ): # third ... ``` We currently format as: ```python def f( # first # second ): # third ... ``` This PR adds the proper opening-parenthesis dangling comment handling for function parameters. Specifically, as with all other parenthesized nodes, we now detect that dangling comment in `placement.rs` and handle it in `parameters.rs`. We have to take some care in that file, since we have multiple "kinds" of dangling comments, but I added a bunch of test cases that we now format identically to Black. ## Test Plan `cargo test` Before: - `zulip`: 0.99388 - `django`: 0.99784 - `warehouse`: 0.99504 - `transformers`: 0.99404 - `cpython`: 0.75913 - `typeshed`: 0.74364 After: - `zulip`: 0.99386 - `django`: 0.99784 - `warehouse`: 0.99504 - `transformers`: 0.99404 - `cpython`: 0.75913 - `typeshed`: 0.74409 Meaningful improvement on `typeshed`, minor decrease on `zulip`.	2023-08-07 14:04:56 -04:00
Charlie Marsh	3f0eea6d87	Rename `JoinedStr` to `FString` in the AST (#6379 ) ## Summary Per the proposal in https://github.com/astral-sh/ruff/discussions/6183, this PR renames the `JoinedStr` node to `FString`.	2023-08-07 17:33:17 +00:00
Zanie Blue	999d88e773	Fix formatting of chained boolean operations (#6394 ) Closes https://github.com/astral-sh/ruff/issues/6068 These commits are kind of a mess as I did some stumbling around here. Unrolls formatting of chained boolean operations to prevent nested grouping which gives us Black-compatible formatting where each boolean operation is on a new line.	2023-08-07 12:22:33 -05:00
Charlie Marsh	63ffadf0b8	Avoid omitting parentheses for trailing attributes on call expressions (#6322 ) ## Summary This PR modifies our `can_omit_optional_parentheses` rules to ensure that if we see a call followed by an attribute, we treat that as an attribute access rather than a splittable call expression. This in turn ensures that we wrap like: ```python ct_match = aaaaaaaaaaact_id == self.get_content_type( obj=rel_obj, using=instance._state.db ) ``` For calls, but: ```python ct_match = ( aaaaaaaaaaact_id == self.get_content_type(obj=rel_obj, using=instance._state.db).id ) ``` For calls with trailing attribute accesses. Closes https://github.com/astral-sh/ruff/issues/6065. ## Test Plan Similarity index before: - `zulip`: 0.99436 - `django`: 0.99779 - `warehouse`: 0.99504 - `transformers`: 0.99403 - `cpython`: 0.75912 - `typeshed`: 0.72293 And after: - `zulip`: 0.99436 - `django`: 0.99780 - `warehouse`: 0.99504 - `transformers`: 0.99404 - `cpython`: 0.75913 - `typeshed`: 0.72293	2023-08-07 13:18:58 -04:00
Charlie Marsh	daefa74e9a	Remove async AST node variants for `with`, `for`, and `def` (#6369 ) ## Summary Per the suggestion in https://github.com/astral-sh/ruff/discussions/6183, this PR removes `AsyncWith`, `AsyncFor`, and `AsyncFunctionDef`, replacing them with an `is_async` field on the non-async variants of those structs. Unlike an interpreter, we _generally_ have identical handling for these nodes, so separating them into distinct variants adds complexity from which we don't really benefit. This can be seen below, where we get to remove a _ton_ of code related to adding generic `Any*` wrappers, and a ton of duplicate branches for these cases. ## Test Plan `cargo test` is unchanged, apart from parser snapshots.	2023-08-07 16:36:02 +00:00
Charlie Marsh	b763973357	Avoid hard line break after dangling open-parenthesis comments (#6380 ) ## Summary Given: ```python [ # comment first, second, third ] # another comment ``` We were adding a hard line break as part of the formatting of `# comment`, which led to the following formatting: ```python [first, second, third] # comment # another comment ``` Closes https://github.com/astral-sh/ruff/issues/6367.	2023-08-07 14:15:32 +00:00
Charlie Marsh	63692b3798	Use `parenthesized_with_dangling_comments` in arguments formatter (#6376 ) ## Summary Fixes an instability whereby this: ```python def get_recent_deployments(threshold_days: int) -> Set[str]: # Returns a list of deployments not older than threshold days # including `/root/zulip` directory if it exists. recent = set() threshold_date = datetime.datetime.now() - datetime.timedelta( # noqa: DTZ005 days=threshold_days ) ``` Was being formatted as: ```python def get_recent_deployments(threshold_days: int) -> Set[str]: # Returns a list of deployments not older than threshold days # including `/root/zulip` directory if it exists. recent = set() threshold_date = ( datetime.datetime.now() - datetime.timedelta(days=threshold_days) # noqa: DTZ005 ) ``` Which was in turn being formatted as: ```python def get_recent_deployments(threshold_days: int) -> Set[str]: # Returns a list of deployments not older than threshold days # including `/root/zulip` directory if it exists. recent = set() threshold_date = ( datetime.datetime.now() - datetime.timedelta(days=threshold_days) # noqa: DTZ005 ) ``` The second-to-third formattings still differs from Black because we aren't taking the line suffix into account when splitting (https://github.com/astral-sh/ruff/issues/6377), but the first formatting is correct and should be unchanged (i.e., the first-to-second formattings is incorrect, and fixed here). ## Test Plan `cargo run --bin ruff_dev -- format-dev --stability-check ../zulip`	2023-08-07 09:43:57 -04:00
Charlie Marsh	4d47dfd6c0	Tweak breaking groups for comprehensions (#6321 ) ## Summary Fixes some comprehension formatting by avoiding creating the group for the comprehension itself (so that if it breaks, all parts break on their own lines, e.g. the `for` and the `if` clauses). Closes https://github.com/astral-sh/ruff/issues/6063. ## Test Plan Bunch of new fixtures.	2023-08-04 14:00:54 +00:00
konsti	99baad12d8	Call chain formatting in fluent style (#6151 ) Implement fluent style/call chains. See the `call_chains.py` formatting for examples. This isn't fully like black because in `raise A from B` they allow `A` breaking can influence the formatting of `B` even if it is already multiline. Similarity index: \| project \| main \| PR \| \|--------------\|-------\|-------\| \| build \| ??? \| 0.753 \| \| django \| 0.991 \| 0.998 \| \| transformers \| 0.993 \| 0.994 \| \| typeshed \| 0.723 \| 0.723 \| \| warehouse \| 0.978 \| 0.994 \| \| zulip \| 0.992 \| 0.994 \| Call chain formatting is affected by https://github.com/astral-sh/ruff/issues/627, but i'm cutting scope here. Closes #5343 Test Plan: * Added a dedicated call chains test file * The ecosystem checks found some bugs * I manually check django and zulip formatting --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-08-04 13:58:01 +00:00
Charlie Marsh	3a985dd71e	Rename `CommentPlacement#then_with` to `or_else` (#6341 ) Per nits in the PR.	2023-08-04 13:50:57 +00:00
Charlie Marsh	1e3fe67ca5	Refactor and rename `skip_trailing_trivia` (#6312 ) Based on feedback here: https://github.com/astral-sh/ruff/pull/6274#discussion_r1282747964.	2023-08-04 13:30:53 +00:00
Micha Reiser	f4831d5a26	Formatter comment handling nits (#6339 )	2023-08-04 13:22:16 +00:00
konsti	1031bb6550	Formatter: Add SourceType to context to enable special formatting for stub files (#6331 ) Summary This adds the information whether we're in a .py python source file or in a .pyi stub file to enable people working on #5822 and related issues. I'm not completely happy with `Default` for something that depends on the input. Test Plan None, this is currently unused, i'm leaving this to first implementation of stub file specific formatting. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-08-04 11:52:26 +00:00
David Szotten	fe97a2a302	Fix panic with empty attribute inner comment (#6332 ) Fixes https://github.com/astral-sh/ruff/issues/6181	2023-08-04 11:59:55 +02:00
konsti	a48d16e025	Replace `Formatter<PyFormatContext<'_>>` with `PyFormatter` (#6330 ) This is a refactoring to use the type alias in more places. In the process, I had to fix and run generate.py. There are no functional changes.	2023-08-04 10:48:58 +02:00
Charlie Marsh	1d8759d5df	Generalize comment-after-bracket handling to lists, sets, etc. (#6320 ) ## Summary We already support preserving the end-of-line comment in calls and type parameters, as in: ```python foo( # comment bar, ) ``` This PR adds the same behavior for lists, sets, comprehensions, etc., such that we preserve: ```python [ # comment 1, 2, 3, ] ``` And related cases.	2023-08-04 01:28:05 +00:00
Charlie Marsh	d3aa8b4ee0	Add API to chain comment placement operations (#6319 ) ## Summary This PR adds an API for chaining comment placement methods based on the [`then_with`](https://doc.rust-lang.org/std/cmp/enum.Ordering.html#method.then_with) from `Ordering` in the standard library. For example, you can now do: ```rust try_some_case(comment).then_with(\|comment\| try_some_other_case_if_still_default(comment)) ``` This lets us avoid this kind of pattern, which I've seen in `placement.rs` and used myself before: ```rust let comment = match handle_own_line_comment_between_branches(comment, preceding, locator) { CommentPlacement::Default(comment) => comment, placement => return placement, }; ```	2023-08-03 21:08:50 -04:00
Charlie Marsh	5f225b18ab	Generalize bracketed end-of-line comment handling (#6315 ) Micha suggested this in https://github.com/astral-sh/ruff/pull/6274#discussion_r1282774151, and it allows us to unify the implementations for arguments and type params.	2023-08-03 20:51:03 +00:00
Charlie Marsh	1705fcef36	Mark trailing comments in parenthesized tests (#6287 ) ## Summary This ensures that we treat `# comment` as parenthesized in contexts like: ```python while ( True # comment ): pass ``` The same logic applies equally to `for`, `async for`, `if`, `with`, and `async with`. The general pattern is that you have an expression which precedes a colon-separated suite.	2023-08-03 20:45:03 +00:00
Charlie Marsh	b3f3529499	Improve comments around `Arguments` handling in classes (#6310 ) ## Summary Based on the confusion here: https://github.com/astral-sh/ruff/pull/6274#discussion_r1282754515. I looked into moving this logic into `placement.rs`, but I think it's trickier than it may appear.	2023-08-03 12:34:03 -04:00
Charlie Marsh	c75e8a8dab	Move `ExprCall`'s `NeedsParentheses` impl into `expr_call.rs` (#6309 ) Accidental move.	2023-08-03 16:01:01 +00:00
Zanie Blue	5b2e973fa5	Add formatting of type alias statements (#6162 ) Part of #5062 Extends https://github.com/astral-sh/ruff/pull/6161 Closes #5929	2023-08-02 20:40:32 +00:00
Zanie Blue	1a60d1e3c6	Add formatting of type parameters in class and function definitions (#6161 ) Part of #5062 Closes https://github.com/astral-sh/ruff/issues/5931 Implements formatting of a sequence of type parameters in a dedicated struct for reuse by classes, functions, and type aliases (preparing for #5929). Adds formatting of type parameters in class and function definitions — previously, they were just elided.	2023-08-02 20:29:28 +00:00
Charlie Marsh	9425ed72a0	Break global and nonlocal statements over continuation lines (#6172 ) ## Summary Builds on #6170 to break `global` and `nonlocal` statements, such that we get: ```python def f(): global \ analyze_featuremap_layer, \ analyze_featuremapcompression_layer, \ analyze_latencies_post, \ analyze_motions_layer, \ analyze_size_model ``` Instead of: ```python def f(): global analyze_featuremap_layer, analyze_featuremapcompression_layer, analyze_latencies_post, analyze_motions_layer, analyze_size_model ``` Notably, we avoid applying this formatting if the statement ends in a comment. Otherwise, the comment would _need_ to be placed after the last item, like: ```python def f(): global \ analyze_featuremap_layer, \ analyze_featuremapcompression_layer, \ analyze_latencies_post, \ analyze_motions_layer, \ analyze_size_model # noqa ``` To me, this seems wrong (and would break the `# noqa` comment). Ideally, the items would be parenthesized, and the comment would be on the inner parenthesis, like: ```python def f(): global ( # noqa analyze_featuremap_layer, analyze_featuremapcompression_layer, analyze_latencies_post, analyze_motions_layer, analyze_size_model ) ``` But that's not valid syntax.	2023-08-02 19:55:00 +00:00
Victor Hugo Gomes	7c5791fb77	Fix formatting of `lambda` star arguments (#6257 ) ## Summary Previously, the ruff formatter was removing the star argument of `lambda` expressions when formatting. Given the following code snippet ```python lambda a: () lambda *b: () ``` it would be formatted to ```python lambda: () lambda: () ``` We fix this by checking for the presence of `args`, `vararg` or `kwarg` in the `lambda` expression, before we were only checking for the presence of `args`. Fixes #5894 ## Test Plan Add new tests cases. --------- Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2023-08-02 19:31:20 +00:00
Charlie Marsh	8a0f844642	Box type params and arguments fields on the class definition node (#6275 ) ## Summary This PR boxes the `TypeParams` and `Arguments` fields on the class definition node. These fields are optional and often emitted, and given that class definition is our largest enum variant, we pay the cost of including them for every statement in the AST. Boxing these types reduces the statement size by 40 bytes, which seems like a good tradeoff given how infrequently these are accessed. ## Test Plan Need to benchmark, but no behavior changes.	2023-08-02 16:47:06 +00:00
Charlie Marsh	4c53bfe896	Add formatter support for call and class definition `Arguments` (#6274 ) ## Summary This PR leverages the `Arguments` AST node introduced in #6259 in the formatter, which ensures that we correctly handle trailing comments in calls, like: ```python f( 1, # comment ) pass ``` (Previously, this was treated as a leading comment on `pass`.) This also allows us to unify the argument handling across calls and class definitions. ## Test Plan A bunch of new fixture tests, plus improved Black compatibility.	2023-08-02 11:54:22 -04:00
Charlie Marsh	981e64f82b	Introduce an `Arguments` AST node for function calls and class definitions (#6259 ) ## Summary This PR adds a new `Arguments` AST node, which we can use for function calls and class definitions. The `Arguments` node spans from the left (open) to right (close) parentheses inclusive. In the case of classes, the `Arguments` is an option, to differentiate between: ```python # None class C: ... # Some, with empty vectors class C(): ... ``` In this PR, we don't really leverage this change (except that a few rules get much simpler, since we don't need to lex to find the start and end ranges of the parentheses, e.g., `crates/ruff/src/rules/pyupgrade/rules/lru_cache_without_parameters.rs`, `crates/ruff/src/rules/pyupgrade/rules/unnecessary_class_parentheses.rs`). In future PRs, this will be especially helpful for the formatter, since we can track comments enclosed on the node itself. ## Test Plan `cargo test`	2023-08-02 10:01:13 -04:00
Charlie Marsh	7842c82a0a	Preserve end-of-line comments on import-from statements (#6216 ) ## Summary Ensures that we keep comments at the end-of-line in cases like: ```python from foo import ( # comment bar, ) ``` Closes https://github.com/astral-sh/ruff/issues/6067.	2023-08-01 18:58:05 +00:00
Charlie Marsh	9c708d8fc1	Rename `Parameter#arg` and `ParameterWithDefault#def` fields (#6255 ) ## Summary This PR renames... - `Parameter#arg` to `Parameter#name` - `ParameterWithDefault#def` to `ParameterWithDefault#parameter` (such that `ParameterWithDefault` has a `default` and a `parameter`) ## Test Plan `cargo test`	2023-08-01 14:28:34 -04:00
Charlie Marsh	adc8bb7821	Rename `Arguments` to `Parameters` in the AST (#6253 ) ## Summary This PR renames a few AST nodes for clarity: - `Arguments` is now `Parameters` - `Arg` is now `Parameter` - `ArgWithDefault` is now `ParameterWithDefault` For now, the attribute names that reference `Parameters` directly are changed (e.g., on `StmtFunctionDef`), but the attributes on `Parameters` itself are not (e.g., `vararg`). We may revisit that decision in the future. For context, the AST node formerly known as `Arguments` is used in function definitions. Formally (outside of the Python context), "arguments" typically refers to "the values passed to a function", while "parameters" typically refers to "the variables used in a function definition". E.g., if you Google "arguments vs parameters", you'll get some explanation like: > A parameter is a variable in a function definition. It is a placeholder and hence does not have a concrete value. An argument is a value passed during function invocation. We're thus deviating from Python's nomenclature in favor of a scheme that we find to be more precise.	2023-08-01 13:53:28 -04:00
Charlie Marsh	a82eb9544c	Implement Black's rules around newlines before and after class docstrings (#6209 ) ## Summary Black allows up to one blank line _before_ a class docstring, and enforces one blank line _after_ a class docstring. This PR implements that handling. The cases in `crates/ruff_python_formatter/resources/test/fixtures/ruff/statement/class_definition.py` match Black identically.	2023-08-01 13:33:01 -04:00
konsti	1df7e9831b	Replace `.map_or(false, $closure)` with `.is_some_and(closure)` (#6244 ) Summary [Option::is_some_and](https://doc.rust-lang.org/stable/std/option/enum.Option.html#method.is_some_and) and [Result::is_ok_and](https://doc.rust-lang.org/std/result/enum.Result.html#method.is_ok_and) are new methods is rust 1.70. I find them way more readable than `.map_or(false, ...)`. The changes are `s/.map_or(false,/.is_some_and(/g`, then manually switching to `is_ok_and` where the value is a Result rather than an Option. Test Plan n/a^	2023-08-01 19:29:42 +02:00
Micha Reiser	debfca3a11	Remove `Parse` trait (#6235 )	2023-08-01 18:35:03 +02:00
Charlie Marsh	928ab63a64	Add empty lines before nested functions and classes (#6206 ) ## Summary This PR ensures that if a function or class is the first statement in a nested suite that _isn't_ a function or class body, we insert a leading newline. For example, given: ```python def f(): if True: def register_type(): pass ``` We _want_ to preserve the newline, whereas today, we remove it. Note that this only applies when the function or class doesn't have any leading comments. Closes https://github.com/astral-sh/ruff/issues/6066.	2023-08-01 15:30:59 +00:00
Micha Reiser	f45e8645d7	Remove unused parser modes <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR removes the `Interactive` and `FunctionType` parser modes that are unused by ruff <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan `cargo test` <!-- How was it tested? -->	2023-08-01 13:10:07 +02:00
Micha Reiser	7c7231db2e	Remove unsupported `type_comment` field <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR removes the `type_comment` field which our parser doesn't support. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan `cargo test` <!-- How was it tested? -->	2023-08-01 12:53:13 +02:00
Micha Reiser	4ad5903ef6	Delete type-ignore node <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR removes the type ignore node from the AST because our parser doesn't support it, and just having it around is confusing. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan `cargo build` <!-- How was it tested? -->	2023-08-01 12:34:50 +02:00
konsti	c6986ac95d	Consistent `CommentPlacement` conversion signatures (#6231 ) Summary Allow passing any node to `CommentPlacement::{leading, trailing, dangling}` without manually converting. Conversely, Restrict the comment to the only type we actually pass. Test Plan No changes.	2023-08-01 12:01:17 +02:00
David Szotten	07468f8be9	format ExprJoinedStr (#5932 )	2023-08-01 08:26:30 +02:00
Micha Reiser	38b5726948	formatter: `WithNodeLevel` helper (#6212 )	2023-07-31 21:22:17 +00:00
Charlie Marsh	615337a54d	Remove newline-insertion logic from `JoinNodesBuilder` (#6205 ) ## Summary This PR moves the "insert empty lines" behavior out of `JoinNodesBuilder` and into the `Suite` formatter. I find it a little confusing that the logic is split between those two formatters right now, and since this is _only_ used in that one place, IMO it is a bit simpler to just inline it and use a single approach to tracking state (right now, both are stateful). The only other place this was used was for decorators. As a side effect, we now remove blank lines in both of these cases, which is a known but intentional deviation from Black (which preserves the empty line before the comment in the first case): ```python @foo # Hello @bar def baz(): pass @foo @bar def baz(): pass ```	2023-07-31 16:58:15 -04:00
konsti	a7aa3caaae	Rename formatter_progress to formatter_ecosystem_checks (#6194 ) Rename the `scripts/formatter_progress.sh` to `formatter/formatter_ecosysytem_checks.sh` since it fits the actual task better.	2023-07-31 18:33:12 +00:00
konsti	9063f4524d	Fix formatting of trailing unescaped quotes in raw triple quoted strings (#6202 ) Summary This prevents us from turning `r'''\""'''` into `r"""\"""""`, which is invalid syntax. This PR fixes CI, which is currently broken on main (in a way that still passes on linter PRs and allows merging formatter PRs, but it's bad to have a job be red). Once merged, i'll make the formatted ecosystem checks a required check. Test Plan Added a regression test.	2023-07-31 19:25:16 +02:00
Charlie Marsh	7eb2ba47cc	Add empty line after `import` block (#6200 ) ## Summary Ensures that, given: ```python import os x = 1 ``` We format like: ```python import os x = 1 ```	2023-07-31 12:01:45 -04:00
Harutaka Kawamura	0274de1fff	Preserve backslash in raw string literal (#6152 )	2023-07-31 12:48:17 +00:00
konsti	a540933bc9	Print log when formatter ecosystem checks fail (#6187 ) Summary Print the errors when the formatter ecosystem checks failed. Im not happy that we current collect the log in the first place, but this is the less invasive change and we need it to unblock reviewing #6152. Test Plan https://github.com/astral-sh/ruff/actions/runs/5713112075/job/15477879403?pr=6188	2023-07-31 14:45:38 +02:00
Micha Reiser	311a1f9ec4	Remove `len` from `JoinCommaSeparatedBuilder` (#6185 )	2023-07-31 12:19:47 +00:00
Luc Khai Hai	b95fc6d162	Format bytes string (#6166 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Format bytes string Closes #6064 ## Test Plan Added a fixture based on string's one	2023-07-31 10:46:40 +02:00
Charlie Marsh	76741cac77	Add `global` and `nonlocal` formatting (#6170 ) ## Summary Adds `global` and `nonlocal` formatting, without the "deviation from black" outlined in the linked issue, which I'll do separately. See: https://github.com/astral-sh/ruff/issues/4798. ## Test Plan Added a fixture in the Ruff-specific directory since the Black fixtures don't seem to cover this.	2023-07-29 14:39:42 +00:00
Charlie Marsh	5d9814d84d	Remove parentheses around some walrus operators (#6173 ) ## Summary Closes https://github.com/astral-sh/ruff/issues/5781 ## Test Plan Added cases to `crates/ruff_python_formatter/resources/test/fixtures/ruff/expression/named_expr.py` one-by-one and adjusted the condition as needed.	2023-07-29 10:06:26 -04:00
Charlie Marsh	646ff6497c	Ignore end-of-line file exemption comments (#6160 ) ## Summary This PR protects against code like: ```python from typing import Optional import bar # ruff: noqa import baz class Foo: x: Optional[str] = None ``` In which the user wrote `# ruff: noqa` to ignore a specific error, not realizing that it was a file-level exemption that thus turned off all lint rules. Specifically, if a `# ruff: noqa` directive is not at the start of a line, we now ignore it and warn, since this is almost certainly a mistake.	2023-07-29 00:40:32 +00:00
qdegraaf	0638a26347	Add `AnyExpressionYield` to consolidate `ExprYield` and `ExprYieldFrom` (#6127 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-27 16:01:16 +00:00
Micha Reiser	6bf6646c5d	Respect indent when measuring with `MeasureMode::AllLines` (#6120 )	2023-07-27 10:22:13 -04:00
konsti	9574ff3dc7	Unbreak main (#6123 ) This fixes main breaking due to two merges.	2023-07-27 10:22:13 -04:00
konsti	06d9ff9577	Don't format trailing comma for lambda arguments (#5946 ) Summary lambda arguments don't have parentheses, so they shouldn't get a magic trailing comma either. This fixes some unstable formatting Test Plan Added a regression test. 89 (from previously 145) instances of unstable formatting remaining. ``` $ cargo run --bin ruff_dev --release -- format-dev --stability-check --error-file formatter-ecosystem-errors.txt --multi-project target/checkouts > formatter-ecosystem-progress.txt $ rg "Unstable formatting" target/formatter-ecosystem-errors.txt \| wc -l 89 ``` Closes #5892	2023-07-27 10:22:13 -04:00
Micha Reiser	40f54375cb	Pull in RustPython parser (#6099 )	2023-07-27 09:29:11 +00:00
konsti	13f9a16e33	Rewrite placement logic (#6040 ) ## Summary This is a rewrite of the main comment placement logic. `place_comment` now has three parts: - place own line comments - between branches - after a branch - place end-of-line comments - after colon - after a branch - place comments for specific nodes (that include module level comments) The rewrite fixed three bugs: `class A: # trailing comment` comments now stay end-of-line, `try: # comment` remains end-of-line and deeply indented try-else-finally comments remain with the right nested statement. It will be much easier to give more alternative branches nodes since this is abstracted away by `is_node_with_body` and the first/last child helpers. Adding new node types can now be done by adding an entry to the `place_comment` match. The code went from 1526 lines before #6033 to 1213 lines now. It thinks it easier to just read the new `placement.rs` rather than reviewing the diff. ## Test Plan The existing fixtures staying the same or improving plus new ones for the bug fixes.	2023-07-26 16:21:23 +00:00
Micha Reiser	2cf00fee96	Remove parser dependency from ruff-python-ast (#6096 )	2023-07-26 17:47:22 +02:00
Dhruv Manilawala	025fa4eba8	Integrate the new Jupyter AST nodes in Ruff (#6086 ) ## Summary This PR adds the implementation for the new Jupyter AST nodes i.e., `ExprLineMagic` and `StmtLineMagic`. ## Test Plan Add test cases for `unparse` containing magic commands resolves: #6087	2023-07-26 08:20:30 +00:00
Harutaka Kawamura	62f821daaa	Avoid raising PT012 for simple `with` statements (#6081 )	2023-07-26 01:43:31 +00:00
Zanie Blue	389fe13c93	Implement visitation of type aliases and parameters (#5927 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary <!-- What's the purpose of the change? What does it do, and why? --> Part of #5062 Requires https://github.com/astral-sh/RustPython-Parser/pull/32 Adds visitation of type alias statements and type parameters in class and function definitions. Duplicates tests for `PreorderVisitor` into `Visitor` with new snapshots. Testing required node implementations for the `TypeParam` enum, which is a chunk of the diff and the reason we need `Ranged` implementations in https://github.com/astral-sh/RustPython-Parser/pull/32. ## Test Plan <!-- How was it tested? --> Adds unit tests with snapshots.	2023-07-25 17:11:26 +00:00
Chris Pryer	f5c69c1b34	Update `ArgumentsParentheses` usage (#6070 )	2023-07-25 18:03:48 +02:00
konsti	e7f228f781	Placement refactor (#6034 ) ## Summary This PR is a refactoring of placement.rs. The code got more consistent, some comments were updated and some dead code was removed or replaced with debug assertions. It also contains a bugfix for the placement of end-of-branch comments with nested bodies inside try statements that occurred when refactoring the nested body loop. ## Test Plan The existing test cases don't change. I added a couple of cases that i think should be tested but weren't, and a regression test for the bugfix	2023-07-25 11:49:05 +02:00
konsti	7f3797185c	Fix formatter with-statement after-as own line comment instability (#6033 ) Summary Fix an instability in with statement formatter when there is an own line comment as the `as` ```python with ( a as # bad comment b): ``` Test Plan Added the comment to the test cases.	2023-07-24 18:12:07 +00:00
konsti	a9f535997d	Document formatter progress scripts (#6035 ) ## Summary Add documentation to the formatter progress scripts ## Test Plan n/a	2023-07-24 19:42:20 +02:00
Micha Reiser	fdb3c8852f	Prefer breaking the implicit string concatenation over breaking before `%` (#5947 )	2023-07-24 18:30:42 +02:00
Chris Pryer	8eadacda33	Update `TupleParentheses` usage (#5810 )	2023-07-24 14:44:36 +00:00
Luc Khai Hai	dfa81b6fe0	Format numeric constants (#5972 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-24 07:04:40 +00:00
konsti	46f8961292	Formatter: Add EmptyWithDanglingComments helper (#5951 ) Summary Add a `EmptyWithDanglingComments` format helper that formats comments inside empty parentheses, brackets or curly braces. Previously, this was implemented separately, and partially incorrectly, for each use case. Empty `()`, `[]` and `{}` are special because there can be dangling comments, and they can be in two positions: ```python x = [ # end-of-line # own line ] ``` These comments are dangling because they can't be assigned to any element inside as they would in all other cases. Test Plan Added a regression test. 145 (from previously 149) instances of unstable formatting remaining. ``` $ cargo run --bin ruff_dev --release -- format-dev --stability-check --error-file formatter-ecosystem-errors.txt --multi-project target/checkouts > formatter-ecosystem-progress.txt $ rg "Unstable formatting" target/formatter-ecosystem-errors.txt \| wc -l 145 ```	2023-07-23 14:32:16 +02:00
konsti	972f9a9c15	Fix formatting lambda with empty arguments (#5944 ) Summary Fix implemented in https://github.com/astral-sh/RustPython-Parser/pull/35: Previously, empty lambda arguments (e.g. `lambda: 1`) would get the range of the entire expression, which leads to incorrect comment placement. Now empty lambda arguments get an empty range between the `lambda` and the `:` tokens. Test Plan Added a regression test. 149 instances of unstable formatting remaining. ``` $ cargo run --bin ruff_dev --release -- format-dev --stability-check --error-file formatter-ecosystem-errors.txt --multi-project target/checkouts > formatter-ecosystem-progress.txt $ rg "Unstable formatting" target/formatter-ecosystem-errors.txt \| wc -l 149 ```	2023-07-21 15:48:45 +02:00
qdegraaf	519dbdffaa	Format `ExprYield`/`ExprYieldFrom` (#5921 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-21 12:07:51 +00:00
konsti	c3b506fca6	Add script to shrink all formatter errors (#5943 ) Summary Add script to shrink all formatter errors: This started as a fun idea and turned out really useful: This script gives us a single Python file with all formatter stability errors. I want to keep it around to occasionally update #5828 so I added it to the git. Test Plan None, this is a helper script	2023-07-21 11:32:35 +02:00
konsti	f6b40a021f	Document shrinking script (#5942 ) Summary Document shrinking script: I thinks it's both in a good enough state and valuable enough to document it's usage.	2023-07-21 11:32:26 +02:00
Luc Khai Hai	b866cbb33d	Improve slice formatting (#5922 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary - Remove space when start of slice is empty - Treat unary op except `not` as simple expression ## Test Plan Add some simple tests for unary op expressions in slice Closes #5673	2023-07-20 15:05:18 +00:00
Micha Reiser	eeb8a5fe0a	Avoid line break before `for` in comprehension if outer expression expands (#5912 )	2023-07-20 10:07:22 +00:00
Micha Reiser	76e9ce6dc0	Fix `SimpleTokenizer`'s backward lexing of `# ` (#5878 )	2023-07-20 11:54:18 +02:00
konsti	8c5f8a8aef	Formatter: Small RParen refactoring (#5885 ) ## Summary A bit more consistency inspired by https://github.com/astral-sh/ruff/pull/5882#discussion_r1268182403 ## Test Plan Existing tests (refactoring)	2023-07-20 11:30:39 +02:00
Chris Pryer	9e32585cb1	Use `dangling_node_comments` in `lambda` formatting (#5903 )	2023-07-20 08:52:32 +02:00
Charlie Marsh	5f3da9955a	Rename `ruff_python_whitespace` to `ruff_python_trivia` (#5886 ) ## Summary This crate now contains utilities for dealing with trivia more broadly: whitespace, newlines, "simple" trivia lexing, etc. So renaming it to reflect its increased responsibilities. To avoid conflicts, I've also renamed `Token` and `TokenKind` to `SimpleToken` and `SimpleTokenKind`.	2023-07-19 11:48:27 -04:00
konsti	a227775f62	Type alias stub for formatter (#5880 ) Summary This replaces the `todo!()` with a type alias stub in the formatter. I added the tests from `704eb40108/parser/src/parser.rs (L901-L936)` as ruff python formatter tests. Test Plan None, testing is part of the actual implementation	2023-07-19 17:28:07 +02:00
konsti	a51606a10a	Handle parentheses when formatting slice expressions (#5882 ) Summary Fix the formatter crash with `x[(1) :: ]` and related code. Problem For assigning comments in slices in subscripts, we need to find the positions of the colons to assign comments before and after the colon to the respective lower/upper/step node (or dangling in that section). Formatting `x[(1) :: ]` was broken because we were looking for a `:` after the `1` but didn't consider that there could be a `)` outside the range of the lower node, which contains just the `1` and no optional parentheses. Solution Use the simple tokenizer directly and skip all closing parentheses. Test Plan I added regression tests. Closes #5733	2023-07-19 15:25:25 +00:00
konsti	63ed7a31e8	Add message to formatter SyntaxError (#5881 ) Summary Add a static string error message to the formatter syntax error so we can disambiguate where the syntax error came from Test Plan No fixed tests, we don't expect this to occur, but it helped with transformers syntax error debugging: ``` Error: Failed to format node Caused by: syntax error: slice first colon token was not a colon ```	2023-07-19 17:15:26 +02:00
Chris Pryer	9fb8d6e999	Omit tuple parentheses inside comprehensions (#5790 )	2023-07-19 12:05:38 +00:00
Chris Pryer	38678142ed	Format `lambda` expression (#5806 )	2023-07-19 11:47:56 +00:00
David Szotten	5d68ad9008	Format expr generator exp (#5804 )	2023-07-19 13:01:58 +02:00
Charlie Marsh	4204fc002d	Remove exception-handler lexing from `unused-bound-exception` fix (#5851 ) ## Summary The motivation here is that it will make this rule easier to rewrite as a deferred check. Right now, we can't run this rule in the deferred phase, because it depends on the `except_handler` to power its autofix. Instead of lexing the `except_handler`, we can use the `SimpleTokenizer` from the formatter, and just lex forwards and backwards. For context, this rule detects the unused `e` in: ```python try: pass except ValueError as e: pass ```	2023-07-18 18:27:46 +00:00
konsti	5d41c832ad	Formatter: Run generate.py for ElifElseClauses (#5864 ) Summary This removes the diff for the next user of `generate.py`. It's effectively a refactoring. Test Plan No functional changes	2023-07-18 17:17:17 +02:00
Micha Reiser	3b32e3a8fe	perf(formatter): Improve `is_expression_parenthesized` performance (#5825 )	2023-07-18 15:48:49 +02:00
konsti	730e6b2b4c	Refactor `StmtIf`: Formatter and Linter (#5459 ) ## Summary Previously, `StmtIf` was defined recursively as ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub orelse: Vec<Stmt>, } ``` Every `elif` was represented as an `orelse` with a single `StmtIf`. This means that this representation couldn't differentiate between ```python if cond1: x = 1 else: if cond2: x = 2 ``` and ```python if cond1: x = 1 elif cond2: x = 2 ``` It also makes many checks harder than they need to be because we have to recurse just to iterate over an entire if-elif-else and because we're lacking nodes and ranges on the `elif` and `else` branches. We change the representation to a flat ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub elif_else_clauses: Vec<ElifElseClause>, } pub struct ElifElseClause { pub range: TextRange, pub test: Option<Expr>, pub body: Vec<Stmt>, } ``` where `test: Some(_)` represents an `elif` and `test: None` an else. This representation is different tradeoff, e.g. we need to allocate the `Vec<ElifElseClause>`, the `elif`s are now different than the `if`s (which matters in rules where want to check both `if`s and `elif`s) and the type system doesn't guarantee that the `test: None` else is actually last. We're also now a bit more inconsistent since all other `else`, those from `for`, `while` and `try`, still don't have nodes. With the new representation some things became easier, e.g. finding the `elif` token (we can use the start of the `ElifElseClause`) and formatting comments for if-elif-else (no more dangling comments splitting, we only have to insert the dangling comment after the colon manually and set `leading_alternate_branch_comments`, everything else is taken of by having nodes for each branch and the usual placement.rs fixups). ## Merge Plan This PR requires coordination between the parser repo and the main ruff repo. I've split the ruff part, into two stacked PRs which have to be merged together (only the second one fixes all tests), the first for the formatter to be reviewed by @michareiser and the second for the linter to be reviewed by @charliermarsh. * MH: Review and merge https://github.com/astral-sh/RustPython-Parser/pull/20 * MH: Review and merge or move later in stack https://github.com/astral-sh/RustPython-Parser/pull/21 * MH: Review and approve https://github.com/astral-sh/RustPython-Parser/pull/22 * MH: Review and approve formatter PR https://github.com/astral-sh/ruff/pull/5459 * CM: Review and approve linter PR https://github.com/astral-sh/ruff/pull/5460 * Merge linter PR in formatter PR, fix ecosystem checks (ecosystem checks can't run on the formatter PR and won't run on the linter PR, so we need to merge them first) * Merge https://github.com/astral-sh/RustPython-Parser/pull/22 * Create tag in the parser, update linter+formatter PR * Merge linter+formatter PR https://github.com/astral-sh/ruff/pull/5459 --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-18 13:40:15 +02:00
Chris Pryer	167b9356fa	Update from `join_with` example to `join_comma_separated` (#5843 ) ## Summary Originally `join_with` was used in the formatters README.md. Now it uses ```rs f.join_comma_separated(item.end()) .nodes(elts.iter()) .finish() ``` ## Test Plan None	2023-07-18 11:03:16 +02:00
konsti	d098256c96	Add a tool for shrinking failing examples (#5731 ) ## Summary For formatter instabilities, the message we get look something like this: ```text Unstable formatting /home/konsti/ruff/target/checkouts/deepmodeling:dpdispatcher/dpdispatcher/slurm.py @@ -47,9 +47,9 @@ - script_header_dict["slurm_partition_line"] = ( - NOT_YET_IMPLEMENTED_ExprJoinedStr - ) + script_header_dict[ + "slurm_partition_line" + ] = NOT_YET_IMPLEMENTED_ExprJoinedStr Unstable formatting /home/konsti/ruff/target/checkouts/deepmodeling:dpdispatcher/dpdispatcher/pbs.py @@ -26,9 +26,9 @@ - pbs_script_header_dict["select_node_line"] += ( - NOT_YET_IMPLEMENTED_ExprJoinedStr - ) + pbs_script_header_dict[ + "select_node_line" + ] += NOT_YET_IMPLEMENTED_ExprJoinedStr ``` For ruff crashes. you don't even get that but just the file that crashed it. To extract the actual bug, you'd need to manually remove parts of the file, rerun to see if the bug still occurs (and revert if it doesn't) until you have a minimal example. With this script, you run ```shell cargo run --bin ruff_shrinking -- target/checkouts/deepmodeling:dpdispatcher/dpdispatcher/slurm.py target/minirepo/code.py "Unstable formatting" "target/debug/ruff_dev format-dev --stability-check target/minirepo" ``` and get ```python class Slurm(): def gen_script_header(self, job): if resources.queue_name != "": script_header_dict["slurm_partition_line"] = f"#SBATCH --partition {resources.queue_name}" ``` which is an nice minimal example. I've been using this script and it would be easier for me if this were part of main. The main disadvantage to merging is that it adds additional dependencies. ## Test Plan I've been using this for a number of minimization. This is an internal helper script you only run manually. I could add a test that minimizes a rule violation if required. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-18 08:03:35 +00:00
David Szotten	52aa2fc875	upgrade rustpython to remove tuple-constants (#5840 ) c.f. https://github.com/astral-sh/RustPython-Parser/pull/28 Tests: No snapshots changed --------- Co-authored-by: Zanie <contact@zanie.dev>	2023-07-17 22:50:31 +00:00
konsti	7dd30f0270	Read black options in format_dev script (#5827 ) ## Summary Comparing repos with black requires that we use the settings as black, notably line length and magic trailing comma behaviour. Excludes and preserving quotes (vs. a preference for either quote style) is not yet implemented because they weren't needed for the test projects. In the other two commits i fixed the output when the progress bar is hidden (this way is recommonded in the indicatif docs), added a `scratch.pyi` file to gitignore because black formats stub files differently and also updated the ecosystem readme with the projects json without forks. ## Test Plan I added a `line-length` vs `line_length` test. Otherwise only my personal usage atm, a PR to integrate the script into the CI to check some projects will follow.	2023-07-17 13:29:43 +00:00
Micha Reiser	21063544f7	Fix formatter `generate.py` (#5829 )	2023-07-17 10:41:27 +00:00
Luc Khai Hai	fb336898a5	Format `AsyncFor` (#5808 )	2023-07-17 10:38:59 +02:00
Chris Pryer	1dd52ad139	Update generate.py comment (#5809 ) ## Summary The generated comment is different from the generate files current comment. ## Test Plan None	2023-07-16 11:51:30 -04:00
Micha Reiser	df2efe81c8	Respect magic trailing comma for set expression (#5782 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR uses the `join_comma_separated` builder for formatting set expressions to ensure the formatting preserves magic commas, if the setting is enabled. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan See the fixed black tests <!-- How was it tested? -->	2023-07-15 16:40:38 +00:00
Chris Pryer	fa4855e6fe	Format `DictComp` expression (#5771 ) ## Summary Format `DictComp` like `ListComp` from #5600. It's not 100%, but I figured maybe it's worth starting to explore. ## Test Plan Added ruff fixture based on `ListComp`'s.	2023-07-15 17:35:23 +01:00
Micha Reiser	3cda89ecaf	Parenthesize with statements (#5758 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR improves the parentheses handling for with items to get closer to black's formatting. ### Case 1: ```python # Black / Input with ( [ "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa", "bbbbbbbbbb", "cccccccccccccccccccccccccccccccccccccccccc", dddddddddddddddddddddddddddddddd, ] as example1, aaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb + cccccccccccccccccccccccccccc + ddddddddddddddddd as example2, CtxManager2() as example2, CtxManager2() as example2, CtxManager2() as example2, ): ... # Before with ( [ "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa", "bbbbbbbbbb", "cccccccccccccccccccccccccccccccccccccccccc", dddddddddddddddddddddddddddddddd, ] as example1, ( aaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb + cccccccccccccccccccccccccccc + ddddddddddddddddd ) as example2, CtxManager2() as example2, CtxManager2() as example2, CtxManager2() as example2, ): ... ``` Notice how Ruff wraps the binary expression in an extra set of parentheses ### Case 2: Black does not expand the with-items if the with has no parentheses: ```python # Black / Input with aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb as c: ... # Before with ( aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb as c ): ... ``` Or ```python # Black / Input with [ "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa", "bbbbbbbbbb", "cccccccccccccccccccccccccccccccccccccccccc", dddddddddddddddddddddddddddddddd, ] as example1, aaaaaaaaaaaaaaaaaaaaaaaaaa * bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb * cccccccccccccccccccccccccccc + ddddddddddddddddd as example2, CtxManager222222222222222() as example2: ... # Before (Same as Case 1) with ( [ "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa", "bbbbbbbbbb", "cccccccccccccccccccccccccccccccccccccccccc", dddddddddddddddddddddddddddddddd, ] as example1, ( aaaaaaaaaaaaaaaaaaaaaaaaaa * bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb * cccccccccccccccccccccccccccc + ddddddddddddddddd ) as example2, CtxManager222222222222222() as example2, ): ... ``` ## Test Plan I added new snapshot tests Improves the django similarity index from 0.973 to 0.977	2023-07-15 16:03:09 +01:00
Luc Khai Hai	e1c119fde3	Format `SetComp` (#5774 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Format `SetComp` like `ListComp`. ## Test Plan Derived from `ListComp`'s fixture.	2023-07-15 15:50:47 +01:00
Micha Reiser	8187bf9f7e	Cover Black's `is_aritmetic_like` formatting (#5738 )	2023-07-14 17:54:58 +02:00
konsti	fb46579d30	Add Regression test for #5605 , where formatting `x[:,]` failed. (#5759 ) #5605 has been fixed, i added the failing example from the issue as a regression test. Closes #5605	2023-07-14 11:55:05 +02:00
Chris Pryer	a961f75e13	Format `assert` statement (#5168 )	2023-07-14 09:01:33 +02:00
Charlie Marsh	e7b059cc5c	Fix nested lists in CONTRIBUTING.md (#5721 ) ## Summary We have a lot of two-space-indented stuff, but apparently it needs to be four-space indented to render as expected in MkDocs.	2023-07-13 16:32:59 +00:00
Micha Reiser	5dd5ee0c5b	Properly group assignment targets (#5728 )	2023-07-13 16:00:49 +02:00
konsti	549173b395	Fix `StmtAnnAssign` formatting by mirroring `StmtAssign` (#5732 ) ## Summary `StmtAnnAssign` would not insert parentheses when breaking the same way `StmtAssign` does, causing unstable formatting and likely some syntax errors. ## Test Plan I added a regression test.	2023-07-13 10:51:25 +00:00
konsti	68e0f97354	Formatter: Better f-string dummy (#5730 ) ## Summary The previous dummy was causing instabilities since it turned a string into a variable. E.g. ```python script_header_dict[ "slurm_partition_line" ] = f"#SBATCH --partition {resources.queue_name}" ``` has an instability as ```python - script_header_dict["slurm_partition_line"] = ( - NOT_YET_IMPLEMENTED_ExprJoinedStr - ) + script_header_dict[ + "slurm_partition_line" + ] = NOT_YET_IMPLEMENTED_ExprJoinedStr ``` ## Test Plan The instability is gone, otherwise it's still a dummy	2023-07-13 09:27:25 +00:00
Micha Reiser	067b2a6ce6	Pass parent to `NeedsParentheses` (#5708 )	2023-07-13 08:57:29 +02:00
Charlie Marsh	6dbc6d2e59	Use shared `Cursor` across crates (#5715 ) ## Summary We have two `Cursor` implementations. This PR moves the implementation from the formatter into `ruff_python_whitespace` (kind of a poorly-named crate now) and uses it for both use-cases.	2023-07-12 21:09:27 +00:00
Micha Reiser	653429bef9	Handle right parens in join comma builder (#5711 )	2023-07-12 18:21:28 +02:00
konsti	f0aa6bd4d3	Document ruff_dev and format_dev (#5648 ) ## Summary Document all `ruff_dev` subcommands and document the `format_dev` flags in the formatter readme. CC @zanieb please flag everything that isn't clear or missing ## Test Plan n/a	2023-07-12 16:18:22 +02:00
Micha Reiser	30bec3fcfa	Only omit optinal parens if the expression ends or starts with a parenthesized expression <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR matches Black' behavior where it only omits the optional parentheses if the expression starts or ends with a parenthesized expression: ```python a + [aaa, bbb, cccc] * c # Don't omit [aaa, bbb, cccc] + a * c # Split a + c * [aaa, bbb, ccc] # Split ``` <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan This improves the Jaccard index from 0.945 to 0.946	2023-07-11 17:05:25 +02:00
Micha Reiser	8b9193ab1f	Improve comprehension line break beheavior <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR improves the Black compatibility when it comes to breaking comprehensions. We want to avoid line breaks before the target and `in` whenever possible. Furthermore, `if X is not None` should be grouped together, similar to other binary like expressions <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan `cargo test` <!-- How was it tested? -->	2023-07-11 16:51:24 +02:00
konsti	62a24e1028	Format `ModExpression` (#5689 ) ## Summary We don't use `ModExpression` anywhere but it's part of the AST, removes one `not_implemented_yet` and is a trivial 2-liner, so i implemented formatting for `ModExpression`. ## Test Plan None, this kind of node does not occur in file input. Otherwise all the tests for expressions	2023-07-11 16:41:10 +02:00
Micha Reiser	f1d367655b	Format `target: annotation = value?` expressions (#5661 )	2023-07-11 16:40:28 +02:00
konsti	0c8ec80d7b	Change lambda dummy to NOT_YET_IMPLEMENTED_lambda (#5687 ) This only changes the dummy to be easier to identify.	2023-07-11 13:16:18 +00:00
Micha Reiser	8665a1a19d	Pass `FormatContext` to `NeedsParentheses` <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary I started working on this because I assumed that I would need access to options inside of `NeedsParantheses` but it then turned out that I won't. Anyway, it kind of felt nice to pass fewer arguments. So I'm gonna put this out here to get your feedback if you prefer this over passing individual fiels. Oh, I sneeked in another change. I renamed `context.contents` to `source`. `contents` is too generic and doesn't tell you anything. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan It compiles	2023-07-11 14:28:50 +02:00
Micha Reiser	715250a179	Prefer expanding parenthesized expressions before operands <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR implements Black's behavior where it first splits off parenthesized expressions before splitting before operands to avoid unnecessary parentheses: ```python # We want if a + [ b, c ]: pass # Rather than if ( a + [b, c] ): pass ``` This is implemented by using the new IR elements introduced in #5596. * We give the group wrapping the optional parentheses an ID (`parentheses_id`) * We use `conditional_group` for the lower priority groups (all non-parenthesized expressions) with the condition that the `parentheses_id` group breaks (we want to split before operands only if the parentheses are necessary) * We use `fits_expanded` to wrap all other parenthesized expressions (lists, dicts, sets), to prevent that expanding e.g. a list expands the `parentheses_id` group. We gate the `fits_expand` to only apply if the `parentheses_id` group fits (because we prefer `a\n+[b, c]` over expanding `[b, c]` if the whole expression gets parenthesized). We limit using `fits_expanded` and `conditional_group` only to expressions that themselves are not in parentheses (checking the conditions isn't free) ## Test Plan It increases the Jaccard index for Django from 0.915 to 0.917 ## Incompatibilites There are two incompatibilities left that I'm aware of (there may be more, I didn't go through all snapshot differences). ### Long string literals I commented on the regression. The issue is that a very long string (or any content without a split point) may not fit when only breaking the right side. The formatter than inserts the optional parentheses. But this is kind of useless because the overlong string will still not fit, because there are no new split points. I think we should ignore this incompatibility for now ### Expressions on statement level I don't fully understand the logic behind this yet, but black doesn't break before the operators for the following example even though the expression exceeds the configured line width ```python aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa < bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb > ccccccccccccccccccccccccccccc == ddddddddddddddddddddd ``` But it would if the expression is used inside of a condition. What I understand so far is that Black doesn't insert optional parentheses on the expression statement level (and a few other places) and, therefore, only breaks after opening parentheses. I propose to keep this deviation for now to avoid overlong-lines and use the compatibility report to make a decision if we should implement the same behavior.	2023-07-11 14:07:39 +02:00
Micha Reiser	d30e9125eb	Extend formatter IR to support Black's expression formatting (#5596 )	2023-07-11 11:20:04 +00:00
konsti	212fd86bf0	Switch from jaccard index to similarity index (#5679 ) ## Summary The similarity index, the fraction of unchanged lines, is easier to understand than the jaccard index, the fraction between intersection and union. ## Test Plan I ran this on django and git a 0.945 index, meaning 5.5% of lines are currently reformatted when compared to black	2023-07-11 13:03:44 +02:00
David Szotten	4b58a9c092	formatter: tidy: list_comp is an expression, not a statement (#5677 )	2023-07-11 08:00:10 +00:00
konsti	b7794f855b	Format StmtAugAssign (#5655 ) ## Summary Format statements such as `tree_depth += 1`. This is a statement that does not allow any line breaks, the only thing to be mindful of is to parenthesize the assigned expression Jaccard index on django: 0.915 -> 0.918 ## Test Plan black tests, and two new tests, a basic one and one that ensures that the child gets parentheses. I ran the django stability check.	2023-07-11 09:06:23 +02:00
Chris Pryer	15c7b6bcf7	Format `delete` statement (#5169 )	2023-07-11 08:36:26 +02:00
David Szotten	1782fb8c30	format ExprListComp (#5600 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-11 06:35:51 +00:00
Micha Reiser	987111f5fb	Format `ExpressionStarred` nodes (#5654 )	2023-07-11 06:08:08 +00:00
Charlie Marsh	4dee49d6fa	Run nightly Clippy over the Ruff repo (#5670 ) ## Summary This is the result of running `cargo +nightly clippy --workspace --all-targets --all-features -- -D warnings` and fixing all violations. Just wanted to see if there were any interesting new checks on nightly 👀	2023-07-10 23:44:38 -04:00
Louis Dispa	e7e2f44440	Format `raise` statement (#5595 ) ## Summary This PR implements the formatting of `raise` statements. I haven't looked at the black implementation, this is inspired from from the `return` statements formatting. ## Test Plan The black differences with insta. I also compared manually some edge cases with very long string and call chaining and it seems to do the same formatting as black. There is one issue: ```python # input raise OsError( "aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa" ) from a.aaaaa(aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa).a(aaaa) # black raise OsError( "aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa" ) from a.aaaaa( aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa ).a( aaaa ) # ruff raise OsError( "aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa" ) from a.aaaaa( aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa ).a(aaaa) ``` But I'm not sure this diff is the raise formatting implementation. --------- Co-authored-by: Louis Dispa <ldispa@deezer.com>	2023-07-10 21:23:49 +02:00
konsti	cab3a507bc	Fix find_only_token_in_range with expression parentheses (#5645 ) ## Summary Fix an oversight in `find_only_token_in_range` where the following code would panic due do the closing and opening parentheses being in the range we scan: ```python d1 = [ ("a") if # 1 ("b") else # 2 ("c") ] ``` Closing and opening parentheses respectively are now correctly skipped. ## Test Plan I added a regression test	2023-07-10 15:55:19 +02:00
Micha Reiser	089a671adb	Fix Black compatible snapshot deletion (#5646 )	2023-07-10 15:00:18 +02:00
konsti	bd8f65814c	Format named expressions (walrus operator) (#5642 ) ## Summary Format named expressions (walrus operator) such a `value := f()`. Unlike tuples, named expression parentheses are not part of the range even when mandatory, so mapping optional parentheses to always gives us decent formatting without implementing all [PEP 572](https://peps.python.org/pep-0572/) rules on when we need parentheses where other expressions wouldn't. We might want to revisit this decision later and implement special cases, but for now this gives us what we need. ## Test Plan black fixtures, i added some fixtures and checked django and cpython for stability. Closes #5613	2023-07-10 12:32:15 +00:00
David Szotten	1e894f328c	formatter: multi char tokens in SimpleTokenizer (#5610 )	2023-07-10 09:00:59 +01:00
Dimitri Papadopoulos Orfanos	efe7c393d1	Fix typos found by codespell (#5607 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Fix typos found by [codespell](https://github.com/codespell-project/codespell). I have left out `memoize` for now (see #5606). <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan CI tests. <!-- How was it tested? -->	2023-07-08 12:33:18 +02:00
konsti	0b9af031fb	Format ExprIfExp (ternary operator) (#5597 ) ## Summary Format `ExprIfExp`, also known as the ternary operator or inline `if`. It can look like ```python a1 = 1 if True else 2 ``` but also ```python b1 = ( # We return "a" ... "a" # that's our True value # ... if this condition matches ... if True # that's our test # ... otherwise we return "b§ else "b" # that's our False value ) ``` This also fixes a visitor order bug. The jaccard index on django goes from 0.911 to 0.915. ## Test Plan I added fixtures without and with comments in strange places.	2023-07-07 19:11:52 +00:00
konsti	0f9d7283e7	Add format-dev contributor docs (#5594 ) ## Summary This adds markdown-level docs for #5492 ## Test Plan n/a --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-07 16:52:13 +00:00
konsti	b22e6c3d38	Extend ruff_dev formatter script to compute statistics and format a project (#5492 ) ## Summary This extends the `ruff_dev` formatter script util. Instead of only doing stability checks, you can now choose different compatible options on the CLI and get statistics. * It adds an option the formats all files that ruff would check to allow looking at an entire black-formatted repository with `git diff` * It computes the [Jaccard index](https://en.wikipedia.org/wiki/Jaccard_index) as a measure of deviation between input and output, which is useful as single number metric for assessing our current deviations from black. * It adds progress bars to both the single projects as well as the multi-project mode. * It adds an option to write the multi-project output to a file Sample usage: ``` $ cargo run --bin ruff_dev -- format-dev --stability-check crates/ruff/resources/test/cpython $ cargo run --bin ruff_dev -- format-dev --stability-check /home/konsti/projects/django Syntax error in /home/konsti/projects/django/tests/test_runner_apps/tagged/tests_syntax_error.py: source contains syntax errors (parser error): BaseError { error: UnrecognizedToken(Name { name: "syntax_error" }, None), offset: 131, source_path: "<filename>" } Found 0 stability errors in 2755 files (jaccard index 0.911) in 9.75s $ cargo run --bin ruff_dev -- format-dev --write /home/konsti/projects/django ``` Options: ``` Several utils related to the formatter which can be run on one or more repositories. The selected set of files in a repository is the same as for `ruff check`. * Check formatter stability: Format a repository twice and ensure that it looks that the first and second formatting look the same. * Format: Format the files in a repository to be able to check them with `git diff` * Statistics: The subcommand the Jaccard index between the (assumed to be black formatted) input and the ruff formatted output Usage: ruff_dev format-dev [OPTIONS] [FILES]... Arguments: [FILES]... Like `ruff check`'s files. See `--multi-project` if you want to format an ecosystem checkout Options: --stability-check Check stability We want to ensure that once formatted content stays the same when formatted again, which is known as formatter stability or formatter idempotency, and that the formatter prints syntactically valid code. As our test cases cover only a limited amount of code, this allows checking entire repositories. --write Format the files. Without this flag, the python files are not modified --format <FORMAT> Control the verbosity of the output [default: default] Possible values: - minimal: Filenames only - default: Filenames and reduced diff - full: Full diff and invalid code -x, --exit-first-error Print only the first error and exit, `-x` is same as pytest --multi-project Checks each project inside a directory, useful e.g. if you want to check all of the ecosystem checkouts --error-file <ERROR_FILE> Write all errors to this file in addition to stdout. Only used in multi-project mode ``` ## Test Plan I ran this on django (2755 files, jaccard index 0.911) and discovered a magic trailing comma problem and that we really needed to implement import formatting. I ran the script on cpython to identify https://github.com/astral-sh/ruff/pull/5558.	2023-07-07 11:30:12 +00:00
Micha Reiser	40ddc1604c	Introduce `parenthesized` helper (#5565 )	2023-07-07 11:28:25 +02:00
konsti	5e5a96ca28	Fix formatter `StmtTry` test (#5568 ) For some reason this didn't turn up on CI before CC @michareiser this is the fix for the error you had	2023-07-06 18:23:53 +00:00
konsti	8184235f93	Try statements have a body: Fix formatter instability (#5558 ) ## Summary The following code was previously leading to unstable formatting: ```python try: try: pass finally: print(1) # issue7208 except A: pass ``` The comment would be formatted as a trailing comment of `try` which is unstable as an end-of-line comment gets two extra whitespaces. This was originally found in `99b00efd5e/Lib/getpass.py (L68-L91)` ## Test Plan I added a regression test	2023-07-06 16:07:47 +02:00
konsti	787e2fd49d	Format import statements (#5493 ) ## Summary Format import statements in all their variants. Specifically, this implemented formatting `StmtImport`, `StmtImportFrom` and `Alias`. ## Test Plan I added some custom snapshots, even though this has been covered well by black's tests.	2023-07-04 07:07:20 +00:00
konsti	a647f31600	Don't add a magic trailing comma for a single entry (#5463 ) ## Summary If a comma separated list has only one entry, black will respect the magic trailing comma, but it will not add a new one. The following code will remain as is: ```python b1 = [ aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa ] b2 = [ aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa, ] b3 = [ aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa, aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa ] ``` ## Test Plan This was first discovered in `7eeadc82c2/django/contrib/admin/checks.py (L674-L681)`, which i've minimized into a call test. I've added tests for the three cases (one entry + no comma, one entry + comma, more than one entry) to the list tests. The diffs from the black tests get smaller.	2023-07-03 21:48:44 +02:00
Louis Dispa	dc072537e5	Fix python_formatter generate.py with rust path (#5475 ) ## Summary This PR fix an issue with the `generate.py` file of the python formatter. Since https://github.com/astral-sh/ruff/pull/5369 the [node.rs file](`f51dc20497/crates/ruff_python_ast/src/node.rs`) used to generate the types now has `ast::` in the enum. ```rust pub enum AnyNode { ModModule(ModModule), ModInteractive(ModInteractive), ModExpression(ModExpression), ModFunctionType(ModFunctionType), ... ``` And now: ```rust pub enum AnyNode { ModModule(ast::ModModule), ModInteractive(ast::ModInteractive), ModExpression(ast::ModExpression), ModFunctionType(ast::ModFunctionType), ... ``` The python script was not parsing rust paths. This PR adds the possibility to have it. ## Test Plan This was tested locally. ### Script output Before ``` ['ast::ModModule),', 'ast::ModInteractive),', 'ast::ModExpression),', 'ast::ModFunctionType),', 'ast::StmtFunctionDef),', 'ast::StmtAsyncFunctionDef),', 'ast::StmtClassDef),', 'ast::StmtReturn),', 'ast::StmtDelete),', 'ast::StmtAssign),', 'ast::StmtAugAssign),', 'ast::StmtAnnAssign),', 'ast::StmtFor),', 'ast::StmtAsyncFor),', 'ast::StmtWhile),', 'ast::StmtIf),', 'ast::StmtWith),', 'ast::StmtAsyncWith),', 'ast::StmtMatch),', 'ast::StmtRaise),', 'ast::StmtTry),', 'ast::StmtTryStar),', 'ast::StmtAssert),', 'ast::StmtImport),', 'ast::StmtImportFrom),', 'ast::StmtGlobal),', 'ast::StmtNonlocal),', 'ast::StmtExpr),', 'ast::StmtPass),', 'ast::StmtBreak),', 'ast::StmtContinue),', 'ast::ExprBoolOp),', 'ast::ExprNamedExpr),', 'ast::ExprBinOp),', 'ast::ExprUnaryOp),', 'ast::ExprLambda),', 'ast::ExprIfExp),', 'ast::ExprDict),', 'ast::ExprSet),', 'ast::ExprListComp),', 'ast::ExprSetComp),', 'ast::ExprDictComp),', 'ast::ExprGeneratorExp),', 'ast::ExprAwait),', 'ast::ExprYield),', 'ast::ExprYieldFrom),', 'ast::ExprCompare),', 'ast::ExprCall),', 'ast::ExprFormattedValue),', 'ast::ExprJoinedStr),', 'ast::ExprConstant),', 'ast::ExprAttribute),', 'ast::ExprSubscript),', 'ast::ExprStarred),', 'ast::ExprName),', 'ast::ExprList),', 'ast::ExprTuple),', 'ast::ExprSlice),', 'ast::ExceptHandlerExceptHandler),', 'ast::PatternMatchValue),', 'ast::PatternMatchSingleton),', 'ast::PatternMatchSequence),', 'ast::PatternMatchMapping),', 'ast::PatternMatchClass),', 'ast::PatternMatchStar),', 'ast::PatternMatchAs),', 'ast::PatternMatchOr),', 'ast::TypeIgnoreTypeIgnore),', 'Comprehension),', 'Arguments),', 'Arg),', 'ArgWithDefault),', 'Keyword),', 'Alias),', 'WithItem),', 'MatchCase),', 'Decorator),'] error: unexpected closing delimiter: `)` --> <stdin>:3:55 \| 2 \| use ruff_formatter::{write, Buffer, FormatResult}; \| - this opening brace... - ...matches this closing brace 3 \| use rustpython_parser::ast::ast::ModModule),; \| ^ unexpected closing delimiter Traceback (most recent call last): File "/Users/ldispa/Documents/perso/ruff/crates/ruff_python_formatter/generate.py", line 100, in <module> node_path.write_text(rustfmt(code)) ^^^^^^^^^^^^^ File "/Users/ldispa/Documents/perso/ruff/crates/ruff_python_formatter/generate.py", line 12, in rustfmt return check_output(["rustfmt", "--emit=stdout"], input=code, text=True) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/Cellar/python@3.11/3.11.4_1/Frameworks/Python.framework/Versions/3.11/lib/python3.11/subprocess.py", line 466, in check_output return run(*popenargs, stdout=PIPE, timeout=timeout, check=True, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/Cellar/python@3.11/3.11.4_1/Frameworks/Python.framework/Versions/3.11/lib/python3.11/subprocess.py", line 571, in run raise CalledProcessError(retcode, process.args, subprocess.CalledProcessError: Command '['rustfmt', '--emit=stdout']' returned non-zero exit status 1. ``` After: ``` ['ModModule', 'ModInteractive', 'ModExpression', 'ModFunctionType', 'StmtFunctionDef', 'StmtAsyncFunctionDef', 'StmtClassDef', 'StmtReturn', 'StmtDelete', 'StmtAssign', 'StmtAugAssign', 'StmtAnnAssign', 'StmtFor', 'StmtAsyncFor', 'StmtWhile', 'StmtIf', 'StmtWith', 'StmtAsyncWith', 'StmtMatch', 'StmtRaise', 'StmtTry', 'StmtTryStar', 'StmtAssert', 'StmtImport', 'StmtImportFrom', 'StmtGlobal', 'StmtNonlocal', 'StmtExpr', 'StmtPass', 'StmtBreak', 'StmtContinue', 'ExprBoolOp', 'ExprNamedExpr', 'ExprBinOp', 'ExprUnaryOp', 'ExprLambda', 'ExprIfExp', 'ExprDict', 'ExprSet', 'ExprListComp', 'ExprSetComp', 'ExprDictComp', 'ExprGeneratorExp', 'ExprAwait', 'ExprYield', 'ExprYieldFrom', 'ExprCompare', 'ExprCall', 'ExprFormattedValue', 'ExprJoinedStr', 'ExprConstant', 'ExprAttribute', 'ExprSubscript', 'ExprStarred', 'ExprName', 'ExprList', 'ExprTuple', 'ExprSlice', 'ExceptHandlerExceptHandler', 'PatternMatchValue', 'PatternMatchSingleton', 'PatternMatchSequence', 'PatternMatchMapping', 'PatternMatchClass', 'PatternMatchStar', 'PatternMatchAs', 'PatternMatchOr', 'TypeIgnoreTypeIgnore', 'Comprehension', 'Arguments', 'Arg', 'ArgWithDefault', 'Keyword', 'Alias', 'WithItem', 'MatchCase', 'Decorator'] ```	2023-07-03 16:07:57 +02:00
konsti	7ac9e0252e	Document Checking formatter stability and panics (#5415 ) This adds the documentation, but ideally we should add the CI first	2023-07-03 11:22:19 +02:00
konsti	ca6ff72404	Change generator formatting dummy to include NOT_YET_IMPLEMENTED (#5464 ) ## Summary Change generator formatting dummy to include `NOT_YET_IMPLEMENTED`. This makes it easier to correctly identify them as dummies ## Test Plan This is a dummy change	2023-07-03 09:11:14 +02:00
Anders Kaseorg	df13e69c3c	Format let-else with rustfmt nightly (#5461 ) Support for `let…else` formatting was just merged to nightly (rust-lang/rust#113225). Rerun `cargo fmt` with Rust nightly 2023-07-02 to pick this up. Followup to #939. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2023-07-03 02:13:35 +00:00
Micha Reiser	f9129e435a	Normalize '\r' in string literals to '\n' <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR normalizes line endings inside of strings to `\n` as required by the printer. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a new test using `\r\n` and ran the ecosystem check. There are no remaining end of line panics. https://gist.github.com/MichaReiser/8f36b1391ca7b48475b3a4f592d74ff4 <!-- How was it tested? -->	2023-06-30 10:13:23 +02:00
Micha Reiser	9c2a75284b	Preserve parentheses around left side of binary expression <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR fixes an issue where the binary expression formatting removed parentheses around the left hand side of an expression. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a new regression test and re-ran the ecosystem check. It brings down the `check-formatter-stability` output from a 3.4MB file down to 900KB. <!-- How was it tested? -->	2023-06-30 09:52:14 +02:00
Micha Reiser	ae25638b0b	Update Black tests (#5438 )	2023-06-30 06:32:50 +00:00
Micha Reiser	955e9ef821	Fix invalid syntax for binary expression in unary op (#5370 )	2023-06-29 08:09:26 +02:00
Micha Reiser	38189ed913	Fix invalid printer IR error (#5422 )	2023-06-29 08:09:13 +02:00
David Szotten	ca5e10b5ea	format StmtTryStar (#5418 )	2023-06-29 08:07:33 +02:00
David Szotten	c7adb9117f	format StmtAsyncWith (#5376 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-28 10:21:44 +00:00
David Szotten	1979103ec0	Format `StmtTry` (#5222 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-28 10:02:15 +00:00
konstin	7f6cb9dfb5	Format call expressions (without call chaining) (#5341 ) ## Summary This formats call expressions with magic trailing comma and parentheses behaviour but without call chaining ## Test Plan Lots of new test fixtures, including some that don't work yet	2023-06-27 09:29:40 +00:00
David Szotten	50a7769d69	magic trailing comma for ExprList (#5365 )	2023-06-26 21:59:01 +02:00
Charlie Marsh	fa1b85b3da	Remove prelude from `ruff_python_ast` (#5369 ) ## Summary Per @MichaReiser, this is causing more confusion than it is helpful.	2023-06-26 11:43:49 -04:00
David Szotten	d00559e42a	format StmtWith (#5350 )	2023-06-26 15:09:06 +01:00
Micha Reiser	49cabca3e7	Format implicit string continuation (#5328 )	2023-06-26 12:41:47 +00:00
Micha Reiser	313711aaf9	Prefer the configured quote style <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR extends the string formatting to respect the configured quote style. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan Extended the string test with new cases and set it up to run twice: Once with the `quote_style: Doube`, and once with `quote_style: Single` single and double quotes. <!-- How was it tested? -->	2023-06-26 14:24:25 +02:00
Micha Reiser	f18a1f70de	Add tests for skip magic trailing comma <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds tests that verify that the magic trailing comma is not respected if disabled in the formatter options. Our test setup now allows to create a `<fixture-name>.options.json` file that contains an array of configurations that should be tested. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan It's all about tests :) <!-- How was it tested? -->	2023-06-26 14:15:55 +02:00
Micha Reiser	dd0d1afb66	Create `PyFormatOptions` <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds a new `PyFormatOptions` struct that stores the python formatter options. The new options aren't used yet, with the exception of magical trailing commas and the options passed to the printer. I'll follow up with more PRs that use the new options (e.g. `QuoteStyle`). <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan `cargo test` I'll follow up with a new PR that adds support for overriding the options in our fixture tests.	2023-06-26 14:02:17 +02:00
konstin	a52cd47c7f	Fix attribute chain own line comments (#5340 ) ## Motation Previously, ```python x = ( a1 .a2 # a . # b # c a3 ) ``` got formatted as ```python x = a1.a2 # a . # b # c a3 ``` which is invalid syntax. This fixes that. ## Summary This implements a basic form of attribute chaining (<https://black.readthedocs.io/en/stable/the_black_code_style/current_style.html#call-chains>) by checking if any inner attribute access contains an own line comment, and if this is the case, adds parentheses around the outermost attribute access while disabling parentheses for all inner attribute expressions. We want to replace this with an implementation that uses recursion or a stack while formatting instead of in `needs_parentheses` and also includes calls rather sooner than later, but i'm fixing this now because i'm uncomfortable with having known invalid syntax generation in the formatter. ## Test Plan I added new fixtures.	2023-06-26 09:13:07 +00:00
Micha Reiser	8879927b9a	Use `insta::glob` instead of `fixture` macro (#5364 )	2023-06-26 08:46:18 +00:00
Micha Reiser	d3d69a031e	Add `JoinCommaSeparatedBuilder` (#5342 )	2023-06-23 22:03:05 +01:00
konstin	4b65446de6	Refactor magic trailing comma (#5339 ) ## Summary This is small refactoring to reuse the code that detects the magic trailing comma across functions. I make this change now to avoid copying code in a later PR. @MichaReiser is planning on making a larger refactoring later that integrates with the join nodes builder ## Test Plan No functional changes. The magic trailing comma behaviour is checked by the fixtures.	2023-06-23 18:53:55 +02:00
Micha Reiser	2dfa6ff58d	Fix unstable set comprehension formatting (#5327 )	2023-06-23 11:50:24 +02:00
konstin	930f03de98	Don't mistake a following if for an elif (#5296 ) In the following code, the comment used to get wrongly associated with the `if False` since it looked like an elif. This fixes it by checking the indentation and adding a regression test ```python if True: pass else: # Comment if False: pass pass ``` Originally found in `1570b94a02/gradio/external.py (L478)`	2023-06-23 10:07:28 +02:00
Micha Reiser	c52aa8f065	Basic string formatting <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR implements formatting for non-f-string Strings that do not use implicit concatenation. Docstring formatting is out of the scope of this PR. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a few tests for simple string literals. ## Performance Ouch. This is hitting performance somewhat hard. This is probably because we now iterate each string a couple of times: 1. To detect if it is an implicit string continuation 2. To detect if the string contains any new lines 3. To detect the preferred quote 4. To normalize the string Edit: I integrated the detection of newlines into the preferred quote detection so that we only iterate the string three time. We can probably do better by merging the implicit string continuation with the quote detection and new line detection by iterating till the end of the string part and returning the offset. We then use our simple tokenizer to skip over any comments or whitespace until we find the first non trivia token. From there we keep continue doing this in a loop until we reach the end o the string. I'll leave this improvement for later.	2023-06-23 09:46:05 +02:00
Micha Reiser	3e12bdff45	Format Compare Op <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds basic formatting for compare operations. The implementation currently breaks diffeently when nesting binary like expressions. I haven't yet figured out what Black's logic is in that case but I think that this by itself is already an improvement worth merging. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a few new tests <!-- How was it tested? -->	2023-06-23 09:35:29 +02:00
konstin	d407165aa7	Fix formatter panic with comment after parenthesized dict value (#5293 ) ## Summary This snippet used to panic because it expected to see a comma or something similar after the `2` but met the closing parentheses that is not part of the range and panicked ```python a = { 1: (2), # comment 3: True, } ``` Originally found in `636a717ef0/testing/marionette/client/marionette_driver/geckoinstance.py (L109)` This snippet is also the test plan.	2023-06-22 16:52:48 +02:00
Micha Reiser	f7e1cf4b51	Format `class` definitions (#5289 )	2023-06-22 09:09:43 +00:00
konstin	7d4f8e59da	Improve FormatExprCall dummy (#5290 ) This solves an instability when formatting cpython. It also introduces another one, but i think it's still a worthwhile change for now. There's no proper testing since this is just a dummy.	2023-06-22 10:59:30 +02:00
Micha Reiser	ccf34aae8c	Format Attribute Expression (#5259 )	2023-06-21 21:33:53 +00:00
David Szotten	1eccbbb60e	Format StmtFor (#5163 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary format StmtFor still trying to learn how to help out with the formatter. trying something slightly more advanced than [break](#5158) mostly copied form StmtWhile ## Test Plan snapshots	2023-06-21 23:00:31 +02:00
konstin	9419d3f9c8	Special `ExprTuple` formatting option for `for`-loops (#5175 ) ## Motivation While black keeps parentheses nearly everywhere, the notable exception is in the body of for loops: ```python for (a, b) in x: pass ``` becomes ```python for a, b in x: pass ``` This currently blocks #5163, which this PR should unblock. ## Solution This changes the `ExprTuple` formatting option to include one additional option that removes the parentheses when not using magic trailing comma and not breaking. It is supposed to be used through ```rust #[derive(Debug)] struct ExprTupleWithoutParentheses<'a>(&'a Expr); impl Format<PyFormatContext<'_>> for ExprTupleWithoutParentheses<'_> { fn fmt(&self, f: &mut Formatter<PyFormatContext<'_>>) -> FormatResult<()> { match self.0 { Expr::Tuple(expr_tuple) => expr_tuple .format() .with_options(TupleParentheses::StripInsideForLoop) .fmt(f), other => other.format().with_options(Parenthesize::IfBreaks).fmt(f), } } } ``` ## Testing The for loop formatting isn't merged due to missing this (and i didn't want to create more git weirdness across two people), but I've confirmed that when applying this to while loops instead of for loops, then ```rust write!( f, [ text("while"), space(), ExprTupleWithoutParentheses(test.as_ref()), text(":"), trailing_comments(trailing_condition_comments), block_indent(&body.format()) ] )?; ``` makes ```python while (a, b): pass while ( ajssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssa, b, ): pass while (a,b,): pass ``` formatted as ```python while a, b: pass while ( ajssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssa, b, ): pass while ( a, b, ): pass ```	2023-06-21 21:17:47 +02:00
konstin	d7c7484618	Format function argument separator comments (#5211 ) ## Summary This is a complete rewrite of the handling of `/` and `*` comment handling in function signatures. The key problem is that slash and star don't have a note. We now parse out the positions of slash and star and their respective preceding and following note. I've left code comments for each possible case of function signature structure and comment placement ## Test Plan I extended the function statement fixtures with cases that i found. If you have more weird edge cases your input would be appreciated.	2023-06-21 17:56:47 +00:00
konstin	bc63cc9b3c	Fix remaining CPython formatter errors except for function argument separator comments (#5210 ) ## Summary This fixes two problems discovered when trying to format the cpython repo with `cargo run --bin ruff_dev -- check-formatter-stability projects/cpython`: The first is to ignore try/except trailing comments for now since they lead to unstable formatting on the dummy. The second is to avoid dropping trailing if comments through placement: This changes the placement to keep a comment trailing an if-elif or if-elif-else to keep the comment a trailing comment on the entire if. Previously the last comment would have been lost. ```python if "first if": pass elif "first elif": pass ``` The last remaining problem in cpython so far is function signature argument separator comment placement which is its own PR on top of this. ## Test Plan I added test fixtures of minimized examples with links back to the original cpython location	2023-06-21 19:45:53 +02:00
Micha Reiser	e47aa468d5	Format Identifier (#5255 )	2023-06-21 17:35:37 +02:00
konstin	6155fd647d	Format Slice Expressions (#5047 ) This formats slice expressions and subscript expressions. Spaces around the colons follows the same rules as black (https://black.readthedocs.io/en/stable/the_black_code_style/current_style.html#slices): ```python e00 = "e"[:] e01 = "e"[:1] e02 = "e"[: a()] e10 = "e"[1:] e11 = "e"[1:1] e12 = "e"[1 : a()] e20 = "e"[a() :] e21 = "e"[a() : 1] e22 = "e"[a() : a()] e200 = "e"[a() : :] e201 = "e"[a() :: 1] e202 = "e"[a() :: a()] e210 = "e"[a() : 1 :] ``` Comment placement is different due to our very different infrastructure. If we have explicit bounds (e.g. `x[1:2]`) all comments get assigned as leading or trailing to the bound expression. If a bound is missing `[:]`, comments get marked as dangling and placed in the same section as they were originally in: ```python x = "x"[ # a # b : # c # d ] ``` to ```python x = "x"[ # a # b : # c # d ] ``` Except for the potential trailing end-of-line comments, all comments get formatted on their own line. This can be improved by keeping end-of-line comments after the opening bracket or after a colon as such but the changes were already complex enough. I added tests for comment placement and spaces.	2023-06-21 15:09:39 +00:00
konstin	44156f6962	Improve debuggability of `place_comment` (#5209 ) ## Summary I found it hard to figure out which function decides placement for a specific comment. An explicit loop makes this easier to debug ## Test Plan There should be no functional changes, no changes to the formatting of the fixtures.	2023-06-21 09:52:13 +00:00
Micha Reiser	653dbb6d17	Format BoolOp (#4986 )	2023-06-21 09:27:57 +00:00
konstin	db301c14bd	Consistently name comment own line/end-of-line `line_position()` (#5215 ) ## Summary Previously, `DecoratedComment` used `text_position()` and `SourceComment` used `position()`. This PR unifies this to `line_position` everywhere. ## Test Plan This is a rename refactoring.	2023-06-21 11:04:56 +02:00
Micha Reiser	1336ca601b	Format `UnaryExpr` <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds basic formatting for unary expressions. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a new `unary.py` with custom test cases	2023-06-21 10:09:47 +02:00
Micha Reiser	3973836420	Correctly handle left/right breaking of binary expression <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Black supports for layouts when it comes to breaking binary expressions: ```rust #[derive(Copy, Clone, Debug, Eq, PartialEq)] enum BinaryLayout { /// Put each operand on their own line if either side expands Default, /// Try to expand the left to make it fit. Add parentheses if the left or right don't fit. /// ///```python /// [ /// a, /// b /// ] & c ///``` ExpandLeft, /// Try to expand the right to make it fix. Add parentheses if the left or right don't fit. /// /// ```python /// a & [ /// b, /// c /// ] /// ``` ExpandRight, /// Both the left and right side can be expanded. Try in the following order: /// * expand the right side /// * expand the left side /// * expand both sides /// /// to make the expression fit /// /// ```python /// [ /// a, /// b /// ] & [ /// c, /// d /// ] /// ``` ExpandRightThenLeft, } ``` Our current implementation only handles `ExpandRight` and `Default` correctly. This PR adds support for `ExpandRightThenLeft` and `ExpandLeft`. ## Test Plan I added tests that play through all 4 binary expression layouts.	2023-06-21 09:40:05 +02:00
Micha Reiser	e520a3a721	Fix ArgWithDefault comments handling (#5204 )	2023-06-20 20:48:07 +00:00
Micha Reiser	b369288833	Accept any `Into<AnyNodeRef>` as `Comments` arguments (#5205 )	2023-06-20 16:49:21 +00:00
Charlie Marsh	6331598511	Upgrade `RustPython` to access ranged names (#5194 ) ## Summary In https://github.com/astral-sh/RustPython-Parser/pull/8, we modified RustPython to include ranges for any identifiers that aren't `Expr::Name` (which already has an identifier). For example, the `e` in `except ValueError as e` was previously un-ranged. To extract its range, we had to do some lexing of our own. This change should improve performance and let us remove a bunch of code. ## Test Plan `cargo test`	2023-06-20 15:43:38 +00:00
David Szotten	773e79b481	basic formatting for ExprDict (#5167 )	2023-06-20 09:25:08 +00:00
Charlie Marsh	36e01ad6eb	Upgrade RustPython (#5192 ) ## Summary This PR upgrade RustPython to pull in the changes to `Arguments` (zip defaults with their identifiers) and all the renames to `CmpOp` and friends.	2023-06-19 21:09:53 +00:00
konstin	0e028142f4	Explain dangling comments in the formatter (#5170 ) This documentation change improves the section on dangling comments in the formatter. --------- Co-authored-by: David Szotten <davidszotten@gmail.com> Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-19 14:24:45 +02:00
Chris Pryer	195b36c429	Format `continue` statement (#5165 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Format `continue` statement. ## Test Plan `continue` is used already in some tests, but if a new test is needed I could add it. --------- Co-authored-by: konstin <konstin@mailbox.org>	2023-06-18 11:25:59 +00:00
David Szotten	4b9b6829dc	format StmtBreak (#5158 ) ## Summary format `StmtBreak` trying to learn how to help out with the formatter. starting simple ## Test Plan new snapshot test	2023-06-17 10:31:29 +02:00
Charlie Marsh	5ea3e42513	Always use identifier ranges to store bindings (#5110 ) ## Summary At present, when we store a binding, we include a `TextRange` alongside it. The `TextRange` _sometimes_ matches the exact range of the identifier to which the `Binding` is linked, but... not always. For example, given: ```python x = 1 ``` The binding we create _will_ use the range of `x`, because the left-hand side is an `Expr::Name`, which has a valid range on it. However, given: ```python try: pass except ValueError as e: pass ``` When we create a binding for `e`, we don't have a `TextRange`... The AST doesn't give us one. So we end up extracting it via lexing. This PR extends that pattern to the rest of the binding kinds, to ensure that whenever we create a binding, we always use the range of the bound name. This leads to better diagnostics in cases like pattern matching, whereby the diagnostic for "unused variable `x`" here used to include `x`, instead of just `x`: ```python def f(provided: int) -> int: match provided: case [_, x]: pass ``` This is _also_ required for symbol renames, since we track writes as bindings -- so we need to know the ranges of the bound symbols. By storing these bindings precisely, we can also remove the `binding.trimmed_range` abstraction -- since bindings already use the "trimmed range". To implement this behavior, I took some of our existing utilities (like the code we had for `except ValueError as e` above), migrated them from a full lexer to a zero-allocation lexer that _only_ identifies "identifiers", and moved the behavior into a trait, so we can now do `stmt.identifier(locator)` to get the range for the identifier. Honestly, we might end up discarding much of this if we decide to put ranges on all identifiers (https://github.com/astral-sh/RustPython-Parser/pull/8). But even if we do, this will _still_ be a good change, because the lexer introduced here is useful beyond names (e.g., we use it find the `except` keyword in an exception handler, to find the `else` after a `for` loop, and so on). So, I'm fine committing this even if we end up changing our minds about the right approach. Closes #5090. ## Benchmarks No significant change, with one statistically significant improvement (-2.1654% on `linter/all-rules/large/dataset.py`): ``` linter/default-rules/numpy/globals.py time: [73.922 µs 73.955 µs 73.986 µs] thrpt: [39.882 MiB/s 39.898 MiB/s 39.916 MiB/s] change: time: [-0.5579% -0.4732% -0.3980%] (p = 0.00 < 0.05) thrpt: [+0.3996% +0.4755% +0.5611%] Change within noise threshold. Found 6 outliers among 100 measurements (6.00%) 4 (4.00%) low severe 1 (1.00%) low mild 1 (1.00%) high mild linter/default-rules/pydantic/types.py time: [1.4909 ms 1.4917 ms 1.4926 ms] thrpt: [17.087 MiB/s 17.096 MiB/s 17.106 MiB/s] change: time: [+0.2140% +0.2741% +0.3392%] (p = 0.00 < 0.05) thrpt: [-0.3380% -0.2734% -0.2136%] Change within noise threshold. Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) high mild 1 (1.00%) high severe linter/default-rules/numpy/ctypeslib.py time: [688.97 µs 691.34 µs 694.15 µs] thrpt: [23.988 MiB/s 24.085 MiB/s 24.168 MiB/s] change: time: [-1.3282% -0.7298% -0.1466%] (p = 0.02 < 0.05) thrpt: [+0.1468% +0.7351% +1.3461%] Change within noise threshold. Found 15 outliers among 100 measurements (15.00%) 1 (1.00%) low mild 2 (2.00%) high mild 12 (12.00%) high severe linter/default-rules/large/dataset.py time: [3.3872 ms 3.4032 ms 3.4191 ms] thrpt: [11.899 MiB/s 11.954 MiB/s 12.011 MiB/s] change: time: [-0.6427% -0.2635% +0.0906%] (p = 0.17 > 0.05) thrpt: [-0.0905% +0.2642% +0.6469%] No change in performance detected. Found 20 outliers among 100 measurements (20.00%) 1 (1.00%) low severe 2 (2.00%) low mild 4 (4.00%) high mild 13 (13.00%) high severe linter/all-rules/numpy/globals.py time: [148.99 µs 149.21 µs 149.42 µs] thrpt: [19.748 MiB/s 19.776 MiB/s 19.805 MiB/s] change: time: [-0.7340% -0.5068% -0.2778%] (p = 0.00 < 0.05) thrpt: [+0.2785% +0.5094% +0.7395%] Change within noise threshold. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high severe linter/all-rules/pydantic/types.py time: [3.0362 ms 3.0396 ms 3.0441 ms] thrpt: [8.3779 MiB/s 8.3903 MiB/s 8.3997 MiB/s] change: time: [-0.0957% +0.0618% +0.2125%] (p = 0.45 > 0.05) thrpt: [-0.2121% -0.0618% +0.0958%] No change in performance detected. Found 11 outliers among 100 measurements (11.00%) 1 (1.00%) low severe 3 (3.00%) low mild 5 (5.00%) high mild 2 (2.00%) high severe linter/all-rules/numpy/ctypeslib.py time: [1.6879 ms 1.6894 ms 1.6909 ms] thrpt: [9.8478 MiB/s 9.8562 MiB/s 9.8652 MiB/s] change: time: [-0.2279% -0.0888% +0.0436%] (p = 0.18 > 0.05) thrpt: [-0.0435% +0.0889% +0.2284%] No change in performance detected. Found 5 outliers among 100 measurements (5.00%) 4 (4.00%) low mild 1 (1.00%) high severe linter/all-rules/large/dataset.py time: [7.1520 ms 7.1586 ms 7.1654 ms] thrpt: [5.6777 MiB/s 5.6831 MiB/s 5.6883 MiB/s] change: time: [-2.5626% -2.1654% -1.7780%] (p = 0.00 < 0.05) thrpt: [+1.8102% +2.2133% +2.6300%] Performance has improved. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high mild ```	2023-06-15 18:43:19 +00:00
konstin	66089e1a2e	Fix a number of formatter errors from the cpython repository (#5089 ) ## Summary This fixes a number of problems in the formatter that showed up with various files in the [cpython](https://github.com/python/cpython) repository. These problems surfaced as unstable formatting and invalid code. This is not the entirety of problems discovered through cpython, but a big enough chunk to separate it. Individual fixes are generally individual commits. They were discovered with #5055, which i update as i work through the output ## Test Plan I added regression tests with links to cpython for each entry, except for the two stubs that also got comment stubs since they'll be implemented properly later.	2023-06-15 11:24:14 +00:00
Charlie Marsh	716cab2f19	Run `rustfmt` on nightly to clean up erroneous comments (#5106 ) ## Summary This PR runs `rustfmt` with a few nightly options as a one-time fix to catch some malformatted comments. I ended up just running with: ```toml condense_wildcard_suffixes = true edition = "2021" max_width = 100 normalize_comments = true normalize_doc_attributes = true reorder_impl_items = true unstable_features = true use_field_init_shorthand = true ``` Since these all seem like reasonable things to fix, so may as well while I'm here.	2023-06-15 00:19:05 +00:00
konstin	95ee6dcb3b	Add contributor docs to formatter (#5023 ) I've written done my condensed learnings from working on the formatter so that others can have an easier start working on it. This is a pure docs change	2023-06-13 07:22:17 +00:00
Charlie Marsh	cc44349401	Use dedicated structs in `comparable.rs` (#5042 ) ## Summary Updating to match the updated AST structure, for consistency.	2023-06-13 03:57:34 +00:00
konstin	e586c27590	Format ExprTuple (#4963 ) This implements formatting ExprTuple, including magic trailing comma. I intentionally didn't change the settings mechanism but just added a dummy global const flag. Besides the snapshots, I added custom breaking/joining tests and a deeply nested test case. The diffs look better than previously, proper black compatibility depends on parentheses handling. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-12 12:55:47 +00:00
Charlie Marsh	68b6d30c46	Use consistent `Cargo.toml` metadata in all crates (#5015 )	2023-06-12 00:02:40 +00:00
Charlie Marsh	f401050878	Introduce `PythonWhitespace` to confine trim operations to Python whitespace (#4994 ) ## Summary We use `.trim()` and friends in a bunch of places, to strip whitespace from source code. However, not all Unicode whitespace characters are considered "whitespace" in Python, which only supports the standard space, tab, and form-feed characters. This PR audits our usages of `.trim()`, `.trim_start()`, `.trim_end()`, and `char::is_whitespace`, and replaces them as appropriate with a new `.trim_whitespace()` analogues, powered by a `PythonWhitespace` trait. In general, the only place that should continue to use `.trim()` is content within docstrings, which don't need to adhere to Python's semantic definitions of whitespace. Closes #4991.	2023-06-09 21:44:50 -04:00
Charlie Marsh	1d756dc3a7	Move Python whitespace utilities into new `ruff_python_whitespace` crate (#4993 ) ## Summary `ruff_newlines` becomes `ruff_python_whitespace`, and includes the existing "universal newline" handlers alongside the Python whitespace-specific utilities.	2023-06-10 00:59:57 +00:00
Micha Reiser	111e1f93ca	perf(formatter): Skip bodies without comments (#4978 )	2023-06-09 11:33:57 +02:00
Micha Reiser	68d52da43b	Track formatted comments (#4979 )	2023-06-09 09:09:45 +00:00
Micha Reiser	646ab64850	Fix binary expression formatting with leading comments (#4964 )	2023-06-09 09:02:50 +00:00
Micha Reiser	1accbeffd6	Format `if` statements (#4961 )	2023-06-09 10:55:14 +02:00
Micha Reiser	68969240c5	Format Function definitions (#4951 )	2023-06-08 16:07:33 +00:00
Micha Reiser	9c3fb23ace	Simple lexer for formatter (#4922 )	2023-06-08 17:37:39 +02:00
konstin	467df23e65	Implement StmtReturn (#4960 ) * Implement StmtPass This implements StmtPass as `pass`. The snapshot diff is small because pass mainly occurs in bodies and function (#4951) and if/for bodies. * Implement StmtReturn This implements StmtReturn as `return` or `return {value}`. The snapshot diff is small because return occurs in functions (#4951)	2023-06-08 16:29:39 +02:00
konstin	c8442e91ce	Implement StmtPass (#4959 ) This implements StmtPass as `pass`. The snapshot diff is small because pass mainly occurs in bodies and function (#4951) and if/for bodies.	2023-06-08 16:29:27 +02:00
Micha Reiser	6bef347a8e	Trailing own line comments before func or class (#4921 )	2023-06-08 12:50:25 +00:00
Micha Reiser	c1cc6f3be1	Add basic Constant formatting (#4954 )	2023-06-08 11:42:44 +00:00
Micha Reiser	83cf6d6e2f	Implement Binary expression without `best_fitting` (#4952 )	2023-06-08 12:45:03 +02:00
konstin	23abad0bd5	A basic StmtAssign formatter and better dummies for expressions (#4938 ) * A basic StmtAssign formatter and better dummies for expressions The goal of this PR was formatting StmtAssign since many nodes in the black tests (and in python in general) are after an assignment. This caused unstable formatting: The spacing of power op spacing depends on the type of the two involved expressions, but each expression was formatted as dummy string and re-parsed as a ExprName, so in the second round the different rules of ExprName were applied, causing unstable formatting. This PR does not necessarily bring us closer to black's style, but it unlocks a good porting of black's test suite and is a basis for implementing the Expr nodes. * fmt * Review	2023-06-08 12:20:25 +02:00
Micha Reiser	39a1f3980f	Upgrade RustPython (#4900 )	2023-06-08 05:53:14 +00:00
Micha Reiser	bcf745c5ba	Replace verbatim text with `NOT_YET_IMPLEMENTED` (#4904 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR replaces the `verbatim_text` builder with a `not_yet_implemented` builder that emits `NOT_YET_IMPLEMENTED_<NodeKind>` for not yet implemented nodes. The motivation for this change is that partially formatting compound statements can result in incorrectly indented code, which is a syntax error: ```python def func_no_args(): a; b; c if True: raise RuntimeError if False: ... for i in range(10): print(i) continue ``` Get's reformatted to ```python def func_no_args(): a; b; c if True: raise RuntimeError if False: ... for i in range(10): print(i) continue ``` because our formatter does not yet support `for` statements and just inserts the text from the source. ## Downsides Using an identifier will not work in all situations. For example, an identifier is invalid in an `Arguments ` position. That's why I kept `verbatim_text` around and e.g. use it in the `Arguments` formatting logic where incorrect indentations are impossible (to my knowledge). Meaning, `verbatim_text` we can opt in to `verbatim_text` when we want to iterate quickly on nodes that we don't want to provide a full implementation yet and using an identifier would be invalid. ## Upsides Running this on main discovered stability issues with the newline handling that were previously "hidden" because of the verbatim formatting. I guess that's an upside :) ## Test Plan None?	2023-06-07 14:57:25 +02:00
Micha Reiser	6ab3fc60f4	Correctly handle newlines after/before comments (#4895 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This issue fixes the removal of empty lines between a leading comment and the previous statement: ```python a = 20 # leading comment b = 10 ``` Ruff removed the empty line between `a` and `b` because: * The leading comments formatting does not preserve leading newlines (to avoid adding new lines at the top of a body) * The `JoinNodesBuilder` counted the lines before `b`, which is 1 -> Doesn't insert a new line This is fixed by changing the `JoinNodesBuilder` to count the lines instead after the last node. This correctly gives 1, and the `# leading comment` will insert the empty lines between any other leading comment or the node. ## Test Plan I added a new test for empty lines.	2023-06-07 14:49:43 +02:00
Micha Reiser	3f032cf09d	Format binary expressions (#4862 ) * Format Binary Expressions * Extract NeedsParentheses trait	2023-06-06 08:34:53 +00:00
Micha Reiser	913b9d1fcf	Normalize newlines in `verbatim_text` (#4850 )	2023-06-05 19:30:28 +00:00
Micha Reiser	33434fcb9c	Add Formatter benchmark (#4860 )	2023-06-05 21:05:42 +02:00
konstin	209aaa5add	Ensure type_ignores for Module are empty (#4861 ) According to https://docs.python.org/3/library/ast.html#ast-helpers, we expect type_ignores to be always be empty, so this adds a debug assert. Test plan: I confirmed that the assertion holdes for the file below and for all the black tests which include a number of `type: ignore` comments. ```python # type: ignore if 1: print("1") # type: ignore # elsebranch # type: ignore else: # type: ignore print("2") # type: ignore while 1: print() # type: ignore ```	2023-06-05 11:38:08 +02:00
konstin	ff37d7af23	Implement module formatting using JoinNodesBuilder (#4808 ) * Implement module formatting using JoinNodesBuilder This uses JoinNodesBuilder to implement module formatting for #4800 See the snapshots for the changed behaviour. See one PR up for a CLI that i used to verify the trailing new line behaviour	2023-06-05 08:35:05 +00:00
Micha Reiser	c65f47d7c4	Format `while` Statement (#4810 )	2023-06-05 08:24:00 +00:00
konstin	d1d06960f0	Add a formatter CLI for debugging (#4809 ) * Add a formatter CLI for debugging This adds a ruff_python_formatter cli modelled aber `rustfmt` that i use for debugging * clippy * Add print IR and print comments options Tested with `cargo run --bin ruff_python_formatter -- --print-ir --print-comments scratch.py`	2023-06-05 07:33:33 +00:00
Micha Reiser	2c41c54e0c	Format `ExprName` (#4803 )	2023-06-03 16:06:14 +02:00
Micha Reiser	d6daa61563	Handle trailing end-of-line comments in-between-bodies (#4812 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary And more custom logic around comments in bodies... uff. Let's say we have the following code ```python if x == y: pass # trailing comment of pass else: # trailing comment of `else` print("I have no comments") ``` Right now, the formatter attaches the `# trailing comment of `else` as a trailing comment of `pass` because it doesn't "see" that there's an `else` keyword in between (because the else body is just a Vec and not a node). This PR adds custom logic that attaches the trailing comments after the `else` as dangling comments to the `if` statement. The if statement must then split the dangling comments by `comments.text_position()`: * All comments up to the first end-of-line comment are leading comments of the `else` keyword. * All end-of-line comments coming after are `trailing` comments for the `else` keyword. ## Test Plan I added new unit tests.	2023-06-03 15:29:22 +02:00
Micha Reiser	cb6788ab5f	Handle trailing body end-of-line comments (#4811 ) ### Summary This PR adds custom logic to handle end-of-line comments of the last statement in a body. For example: ```python while True: if something.changed: do.stuff() # trailing comment b ``` The `# trailing comment` is a trailing comment of the `do.stuff()` expression statement. We incorrectly attached the comment as a trailing comment of the enclosing `while` statement because the comment is between the end of the while statement (the `while` statement ends right after `do.stuff()`) and before the `b` statement. This PR fixes the placement to correctly attach these comments to the last statement in a body (recursively). ## Test Plan I reviewed the snapshots and they now look correct. This may appear odd because a lot comments have now disappeared. This is the expected result because we use `verbatim` formatting for the block statements (like `while`) and that means that it only formats the inner content of the block, but not any trailing comments. The comments were visible before, because they were associated with the block statement (e.g. `while`).	2023-06-03 15:17:33 +02:00
Micha Reiser	ebdc4afc33	Suite formatting and `JoinNodesBuilder` (#4805 )	2023-06-02 14:14:38 +00:00
Micha Reiser	a401989b7a	Format StmtExpr (#4788 )	2023-06-02 12:52:38 +00:00
Micha Reiser	4cd4b37e74	Format the comment content (#4786 )	2023-06-02 11:22:34 +00:00
konstin	c4fdbf8903	Switch PyFormatter lifetimes (#4804 ) Stylistic change to have the input lifetime first and the output lifetime second. I'll rebase my other PR on top of this. Test plan: `cargo clippy`	2023-06-02 12:26:39 +02:00
Micha Reiser	5d939222db	Leading, Dangling, and Trailing comments formatting (#4785 )	2023-06-02 09:26:36 +02:00
konstin	63d892f1e4	Implement basic module formatting (#4784 ) * Add Format for Stmt * Implement basic module formatting This implements formatting each statement in a module with a hard line break in between, so that we can start formatting statements. Basic testing is done by the snapshots	2023-06-01 15:25:50 +02:00
Micha Reiser	4ea4fd1984	Introduce `lines_before` helper (#4780 )	2023-06-01 11:56:43 +02:00
konstin	d4027d8b65	Use new formatter infrastructure in CLI and test (#4767 ) * Use dummy verbatim formatter for all nodes * Use new formatter infrastructure in CLI and test * Expose the new formatter in the CLI * Merge import blocks	2023-06-01 11:55:04 +02:00
konstin	9bf168c0a4	Use dummy verbatim formatter for all nodes (#4755 )	2023-06-01 08:25:26 +00:00
Micha Reiser	59148344be	Place comments of left and right binary expression operands (#4751 )	2023-06-01 07:01:32 +00:00
konstin	0945803427	Generate FormatRule definitions (#4724 ) * Generate FormatRule definitions * Generate verbatim output * pub(crate) everything * clippy fix * Update crates/ruff_python_formatter/src/lib.rs Co-authored-by: Micha Reiser <micha@reiser.io> * Update crates/ruff_python_formatter/src/lib.rs Co-authored-by: Micha Reiser <micha@reiser.io> * stub out with Ok(()) again * Update crates/ruff_python_formatter/src/lib.rs Co-authored-by: Micha Reiser <micha@reiser.io> * PyFormatContext::{contents, locator} with `#[allow(unused)]` * Can't leak private type * remove commented code * Fix ruff errors * pub struct Format{node} due to rust rules --------- Co-authored-by: Julian LaNeve <lanevejulian@gmail.com> Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-01 08:38:53 +02:00
Micha Reiser	b7294b48e7	Handle positional-only-arguments separator comments (#4748 )	2023-06-01 06:22:49 +00:00
Micha Reiser	be31d71849	Correctly associate own-line comments in bodies (#4671 )	2023-06-01 08:12:53 +02:00
Charlie Marsh	9d0ffd33ca	Move universal newline handling into its own crate (#4729 )	2023-05-31 12:00:47 -04:00
Micha Reiser	e209b5fc5f	Add reformat check (#4753 )	2023-05-31 17:36:15 +02:00
Micha Reiser	6c1ff6a85f	Upgrade RustPython (#4747 )	2023-05-31 08:26:35 +00:00
Micha Reiser	06bcb85f81	formatter: Remove CST and old formatting (#4730 )	2023-05-31 08:27:23 +02:00
Micha Reiser	0cd453bdf0	Generic "comment to node" association logic (#4642 )	2023-05-30 09:28:01 +00:00
Micha Reiser	84a5584888	Add `Comments` data structure (#4641 )	2023-05-30 08:54:55 +00:00
Micha Reiser	6146b75dd0	Add `MultiMap` implementation for storing comments (#4639 )	2023-05-30 09:51:25 +02:00
Micha Reiser	edc6c4058f	Move `shared_traits` to `ruff_formatter` (#4632 )	2023-05-24 17:38:11 +02:00
Micha Reiser	86ced3516b	Introduce `SourceCodeSlice` to reduce the size of `FormatElement` (#4622 ) Introduce `SourceCodeSlice` to reduce the size of `FormatElement`	2023-05-24 15:04:52 +00:00
Micha Reiser	6943beee66	Remove source position from `FormatElement::DynamicText` (#4619 )	2023-05-24 16:36:14 +02:00
Micha Reiser	daadd24bde	Include decorators in `Function` and `Class` definition ranges (#4467 )	2023-05-22 17:50:42 +02:00
Charlie Marsh	e8e66f3824	Remove unnecessary path prefixes (#4492 )	2023-05-18 10:19:09 -04:00
Micha Reiser	ddf7de7e86	Prototype Black's string joining/splitting (#4449 )	2023-05-16 18:42:40 +01:00
Jeong, YunWon	4b05ca1198	Specialize ConversionFlag (#4450 )	2023-05-16 18:00:13 +02:00
Charlie Marsh	f0465bf106	Emit non-logical newlines for "empty" lines (#4444 )	2023-05-16 14:58:56 +00:00
Micha Reiser	fa26860296	Refactor range from `Attributed` to `Node`s (#4422 )	2023-05-16 06:36:32 +00:00
Jonathan Plasse	c10a4535b9	Disallow `unreachable_pub` (#4314 )	2023-05-11 18:00:00 -04:00
Micha Reiser	1ccef5150d	Remove lifetime from FormatContext (#4376 )	2023-05-11 15:43:42 +00:00
Jeong, YunWon	be6e00ef6e	Re-integrate RustPython parser repository (#4359 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-05-11 07:47:17 +00:00
Calum Young	f0f4bf2929	Move typos to pre-commit config (#4148 )	2023-04-29 12:13:35 -04:00
Micha Reiser	cab65b25da	Replace row/column based `Location` with byte-offsets. (#3931 )	2023-04-26 18:11:02 +00:00
Micha Reiser	381203c084	Store source code on message (#3897 )	2023-04-11 07:57:36 +00:00
Micha Reiser	76c47a9a43	Cheap cloneable LineIndex (#3896 )	2023-04-11 07:33:40 +00:00
Charlie Marsh	d919adc13c	Introduce a `ruff_python_semantic` crate (#3865 )	2023-04-04 16:50:47 +00:00
Charlie Marsh	cf7e1ddd08	Remove some `usize` references (#3819 )	2023-03-30 17:35:42 -04:00
Charlie Marsh	c2750a59ab	Implement an iterator for universal newlines (#3454 ) # Summary We need to support CR line endings (as opposed to LF and CRLF line endings, which are already supported). They're rare, but they do appear in Python code, and we tend to panic on any file that uses them. Our `Locator` abstraction now supports CR line endings. However, Rust's `str#lines` implementation does _not_. This PR adds a `UniversalNewlineIterator` implementation that respects all of CR, LF, and CRLF line endings, and plugs it into most of the `.lines()` call sites. As an alternative design, it could be nice if we could leverage `Locator` for this. We've already computed all of the line endings, so we could probably iterate much more efficiently? # Test Plan Largely relying on automated testing, however, also ran over some known failure cases, like #3404.	2023-03-13 00:01:29 -04:00
Charlie Marsh	da1f83fe32	Remove `core` module from `ruff_python_formatter` (#3373 )	2023-03-08 19:11:39 +00:00
Charlie Marsh	0a9d259f9c	Remove copied `core` modules from `ruff_python_formatter` (#3371 )	2023-03-08 19:03:40 +00:00
Charlie Marsh	130e733023	Implement `From<Located>` for `Range` (#3377 )	2023-03-08 18:50:20 +00:00
Charlie Marsh	ff2c0dd491	Use shared `leading_quote` implementation in ruff_python_formatter (#3396 )	2023-03-08 18:21:59 +00:00
Charlie Marsh	d1c48016eb	Rename `ruff_python` crate to `ruff_python_stdlib` (#3354 ) In hindsight, `ruff_python` is too general. A good giveaway is that it's actually a prefix of some other crates. The intent of this crate is to reimplement pieces of the Python standard library and CPython itself, so `ruff_python_stdlib` feels appropriate.	2023-03-06 13:43:22 +00:00
Jonathan Plasse	8828e12283	Bump dependencies and move more shared dependencies into workspace (#3340 )	2023-03-04 12:36:26 -05:00
Charlie Marsh	f5f09b489b	Introduce dedicated CST tokens for other operator kinds (#3267 )	2023-02-27 23:54:57 -05:00
Charlie Marsh	061495a9eb	Make BoolOp its own located token (#3265 )	2023-02-28 03:43:28 +00:00
Charlie Marsh	470e1c1754	Preserve comments on non-defaulted arguments (#3264 )	2023-02-27 23:41:40 +00:00
Charlie Marsh	16be691712	Enable more non-panicking formatter tests (#3262 )	2023-02-27 18:21:53 -05:00
Charlie Marsh	2261e194a0	Create dedicated `Body` nodes in the formatter CST (#3223 )	2023-02-27 22:55:05 +00:00
Charlie Marsh	1c75071136	Implement basic rendering of remaining AST nodes (#3233 )	2023-02-26 05:05:56 +00:00
Charlie Marsh	51bca19c1d	Add builders for common comment rendering (#3232 )	2023-02-26 04:16:24 +00:00
Jeong YunWon	84e96cdcd9	More enum work (#3212 )	2023-02-25 11:40:16 -05:00
Charlie Marsh	159422071e	Handle end-of-line comments on `excepthandler` and `alias` (#3196 )	2023-02-23 22:35:39 -05:00
Charlie Marsh	6eaacf96be	Introduce a new CST element for slice segments (#3195 )	2023-02-24 00:49:41 +00:00
Charlie Marsh	eb15371453	Make Locator available in AST-to-CST conversion pass (#3194 )	2023-02-23 19:43:03 -05:00
Charlie Marsh	bda2a0007a	Parenthesize numbers during attribute accesses (#3189 )	2023-02-23 14:57:23 -05:00
Charlie Marsh	32d165b7ad	Implement complex literal formatting (#3186 )	2023-02-23 19:09:33 +00:00
Charlie Marsh	ac79bf4ee9	Implement float literal formatting (#3184 )	2023-02-23 14:02:23 -05:00
Charlie Marsh	376eab3a53	Implement integer literal formatting (#3183 )	2023-02-23 18:31:56 +00:00
Charlie Marsh	08be7bd285	Add a TODO to string_literal (#3181 )	2023-02-23 12:46:20 -05:00
Charlie Marsh	1e7233a8eb	Add support for reformatting byte strings (#3176 )	2023-02-23 16:50:24 +00:00
Charlie Marsh	f967f344fc	Add support for basic `Constant::Str` formatting (#3173 ) This PR enables us to apply the proper quotation marks, including support for escapes. There are some significant TODOs, especially around implicit concatenations like: ```py ( "abc" "def" ) ``` Which are represented as a single AST node, which requires us to tokenize _within_ the formatter to identify all the individual string parts.	2023-02-23 16:23:10 +00:00
Charlie Marsh	095f005bf4	Move RustPython vendored and helper code into its own crate (#3171 )	2023-02-23 14:14:16 +00:00
Charlie Marsh	e5c1f95545	Check-in updated snapshot (#3161 )	2023-02-23 03:42:27 +00:00
Charlie Marsh	227ff62a4e	Don't touch tuple brackets after `in` (#3160 )	2023-02-23 03:10:24 +00:00
Charlie Marsh	d8e4902516	Un-modify `tupleassign` and `function2` tests (#3158 ) I manually changed these in #3080 and #3083 to get the tests passing (with notes around the deviations) -- but that's no longer necessary, now that we have proper testing that takes deviations into account.	2023-02-23 02:37:25 +00:00
Charlie Marsh	5fd827545b	Add a trailing newline to all .py.expect files (#3156 ) This just re-formats all the `.py.expect` files with Black, both to add a trailing newline and be doubly-certain that they're correctly formatted. I also ensured that we add a hard line break after each statement, and that we avoid including an extra newline in the generated Markdown (since the code should contain the exact expected newlines).	2023-02-23 02:29:27 +00:00
Charlie Marsh	2f9de335db	Upgrade RustPython to match new flattened exports (#3141 )	2023-02-22 19:36:13 +00:00
Charlie Marsh	1efa2e07ad	Avoid match statement misidentification in token rules (#3129 )	2023-02-22 15:44:45 +00:00
Micha Reiser	ffd8e958fc	chore: Upgrade Rust to 1.67.0 (#3125 )	2023-02-22 10:03:17 -05:00
Micha Reiser	ed33b75bad	test(ruff_python_formatter): Run all Black tests (#2993 ) This PR changes the testing infrastructure to run all black tests and: * Pass if Ruff and Black generate the same formatting * Fail and write a markdown snapshot that shows the input code, the differences between Black and Ruff, Ruffs output, and Blacks output This is achieved by introducing a new `fixture` macro (open to better name suggestions) that "duplicates" the attributed test for every file that matches the specified glob pattern. Creating a new test for each file over having a test that iterates over all files has the advantage that you can run a single test, and that test failures indicate which case is failing. The `fixture` macro also makes it straightforward to e.g. setup our own spec tests that test very specific formatting by creating a new folder and use insta to assert the formatted output.	2023-02-22 09:25:06 -05:00
Charlie Marsh	cdc4e86158	Add support for TryStar (#3089 )	2023-02-21 13:42:20 -05:00
Charlie Marsh	a6eb60cdd5	Enable `function2` test (#3083 )	2023-02-21 04:37:50 +00:00
Charlie Marsh	90c04b9cff	Enable `tupleassign` test (#3080 )	2023-02-21 00:42:23 +00:00
Charlie Marsh	b701cca779	Enable some already-passing Black tests (#3079 )	2023-02-21 00:10:35 +00:00
Charlie Marsh	ce8953442d	Add support for trailing colons in slice expressions (#3077 )	2023-02-20 23:24:32 +00:00
Charlie Marsh	6e02405bd6	Add `StmtKind::Try`; fix trailing newlines (#3074 )	2023-02-20 22:55:32 +00:00
Jeong YunWon	35606d7b05	clean up to fix nightly clippy warnings and dedents (#3057 )	2023-02-20 09:33:47 -05:00
Charlie Marsh	c297d46899	Remove unused `AsFormat` trait for `Option<T>` (#3041 ) We should re-add this, but it's currently unused and doesn't compile under 1.66.0. See: #3039.	2023-02-19 20:19:35 +00:00
Jonathan Plasse	b75663be6d	Add missing rust-version in crates (#3009 )	2023-02-19 15:07:17 +00:00
Charlie Marsh	180541a924	Unify comment terminology with that of `rome_formatter` (#2979 )	2023-02-17 03:02:25 +00:00
Charlie Marsh	6088a36cd3	Use `line_suffix` for end-of-line comments (#2975 )	2023-02-16 18:37:40 -05:00
Charlie Marsh	5157f584ab	Improve pow operator spacing (#2970 ) Ensure that we add spaces to expressions like `foo.bar() ** 2`.	2023-02-16 15:17:32 -05:00
Charlie Marsh	1c01ec21cb	Regenerate expected Black snapshots (#2968 )	2023-02-16 19:39:17 +00:00
Charlie Marsh	cb971d3a48	Respect self as positional-only argument in annotation rules (#2927 )	2023-02-15 15:25:17 +00:00
Charlie Marsh	57a5071b4e	Rename some methods on `Locator` (#2926 )	2023-02-15 10:21:49 -05:00
Charlie Marsh	ca49b00e55	Add initial formatter implementation (#2883 ) # Summary This PR contains the code for the autoformatter proof-of-concept. ## Crate structure The primary formatting hook is the `fmt` function in `crates/ruff_python_formatter/src/lib.rs`. The current formatter approach is outlined in `crates/ruff_python_formatter/src/lib.rs`, and is structured as follows: - Tokenize the code using the RustPython lexer. - In `crates/ruff_python_formatter/src/trivia.rs`, extract a variety of trivia tokens from the token stream. These include comments, trailing commas, and empty lines. - Generate the AST via the RustPython parser. - In `crates/ruff_python_formatter/src/cst.rs`, convert the AST to a CST structure. As of now, the CST is nearly identical to the AST, except that every node gets a `trivia` vector. But we might want to modify it further. - In `crates/ruff_python_formatter/src/attachment.rs`, attach each trivia token to the corresponding CST node. The logic for this is mostly in `decorate_trivia` and is ported almost directly from Prettier (given each token, find its preceding, following, and enclosing nodes, then attach the token to the appropriate node in a second pass). - In `crates/ruff_python_formatter/src/newlines.rs`, normalize newlines to match Black’s preferences. This involves traversing the CST and inserting or removing `TriviaToken` values as we go. - Call `format!` on the CST, which delegates to type-specific formatter implementations (e.g., `crates/ruff_python_formatter/src/format/stmt.rs` for `Stmt` nodes, and similar for `Expr` nodes; the others are trivial). Those type-specific implementations delegate to kind-specific functions (e.g., `format_func_def`). ## Testing and iteration The formatter is being developed against the Black test suite, which was copied over in-full to `crates/ruff_python_formatter/resources/test/fixtures/black`. The Black fixtures had to be modified to create `[insta](https://github.com/mitsuhiko/insta)`-compatible snapshots, which now exist in the repo. My approach thus far has been to try and improve coverage by tackling fixtures one-by-one. ## What works, and what doesn’t - Most nodes are supported at a basic level (though there are a few stragglers at time of writing, like `StmtKind::Try`). - Newlines are properly preserved in most cases. - Magic trailing commas are properly preserved in some (but not all) cases. - Trivial leading and trailing standalone comments mostly work (although maybe not at the end of a file). - Inline comments, and comments within expressions, often don’t work -- they work in a few cases, but it’s one-off right now. (We’re probably associating them with the “right” nodes more often than we are actually rendering them in the right place.) - We don’t properly normalize string quotes. (At present, we just repeat any constants verbatim.) - We’re mishandling a bunch of wrapping cases (if we treat Black as the reference implementation). Here are a few examples (demonstrating Black's stable behavior): ```py # In some cases, if the end expression is "self-closing" (functions, # lists, dictionaries, sets, subscript accesses, and any length-two # boolean operations that end in these elments), Black # will wrap like this... if some_expression and f( b, c, d, ): pass # ...whereas we do this: if ( some_expression and f( b, c, d, ) ): pass # If function arguments can fit on a single line, then Black will # format them like this, rather than exploding them vertically. if f( a, b, c, d, e, f, g, ... ): pass ``` - We don’t properly preserve parentheses in all cases. Black preserves parentheses in some but not all cases.	2023-02-15 04:06:35 +00:00

... 9 10 11 12 13 ...

839 Commits