Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
David Szotten	1eccbbb60e	Format StmtFor (#5163 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary format StmtFor still trying to learn how to help out with the formatter. trying something slightly more advanced than [break](#5158) mostly copied form StmtWhile ## Test Plan snapshots	2023-06-21 23:00:31 +02:00
konstin	9419d3f9c8	Special `ExprTuple` formatting option for `for`-loops (#5175 ) ## Motivation While black keeps parentheses nearly everywhere, the notable exception is in the body of for loops: ```python for (a, b) in x: pass ``` becomes ```python for a, b in x: pass ``` This currently blocks #5163, which this PR should unblock. ## Solution This changes the `ExprTuple` formatting option to include one additional option that removes the parentheses when not using magic trailing comma and not breaking. It is supposed to be used through ```rust #[derive(Debug)] struct ExprTupleWithoutParentheses<'a>(&'a Expr); impl Format<PyFormatContext<'_>> for ExprTupleWithoutParentheses<'_> { fn fmt(&self, f: &mut Formatter<PyFormatContext<'_>>) -> FormatResult<()> { match self.0 { Expr::Tuple(expr_tuple) => expr_tuple .format() .with_options(TupleParentheses::StripInsideForLoop) .fmt(f), other => other.format().with_options(Parenthesize::IfBreaks).fmt(f), } } } ``` ## Testing The for loop formatting isn't merged due to missing this (and i didn't want to create more git weirdness across two people), but I've confirmed that when applying this to while loops instead of for loops, then ```rust write!( f, [ text("while"), space(), ExprTupleWithoutParentheses(test.as_ref()), text(":"), trailing_comments(trailing_condition_comments), block_indent(&body.format()) ] )?; ``` makes ```python while (a, b): pass while ( ajssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssa, b, ): pass while (a,b,): pass ``` formatted as ```python while a, b: pass while ( ajssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssa, b, ): pass while ( a, b, ): pass ```	2023-06-21 21:17:47 +02:00
konstin	d7c7484618	Format function argument separator comments (#5211 ) ## Summary This is a complete rewrite of the handling of `/` and `*` comment handling in function signatures. The key problem is that slash and star don't have a note. We now parse out the positions of slash and star and their respective preceding and following note. I've left code comments for each possible case of function signature structure and comment placement ## Test Plan I extended the function statement fixtures with cases that i found. If you have more weird edge cases your input would be appreciated.	2023-06-21 17:56:47 +00:00
konstin	bc63cc9b3c	Fix remaining CPython formatter errors except for function argument separator comments (#5210 ) ## Summary This fixes two problems discovered when trying to format the cpython repo with `cargo run --bin ruff_dev -- check-formatter-stability projects/cpython`: The first is to ignore try/except trailing comments for now since they lead to unstable formatting on the dummy. The second is to avoid dropping trailing if comments through placement: This changes the placement to keep a comment trailing an if-elif or if-elif-else to keep the comment a trailing comment on the entire if. Previously the last comment would have been lost. ```python if "first if": pass elif "first elif": pass ``` The last remaining problem in cpython so far is function signature argument separator comment placement which is its own PR on top of this. ## Test Plan I added test fixtures of minimized examples with links back to the original cpython location	2023-06-21 19:45:53 +02:00
Micha Reiser	e47aa468d5	Format Identifier (#5255 )	2023-06-21 17:35:37 +02:00
konstin	6155fd647d	Format Slice Expressions (#5047 ) This formats slice expressions and subscript expressions. Spaces around the colons follows the same rules as black (https://black.readthedocs.io/en/stable/the_black_code_style/current_style.html#slices): ```python e00 = "e"[:] e01 = "e"[:1] e02 = "e"[: a()] e10 = "e"[1:] e11 = "e"[1:1] e12 = "e"[1 : a()] e20 = "e"[a() :] e21 = "e"[a() : 1] e22 = "e"[a() : a()] e200 = "e"[a() : :] e201 = "e"[a() :: 1] e202 = "e"[a() :: a()] e210 = "e"[a() : 1 :] ``` Comment placement is different due to our very different infrastructure. If we have explicit bounds (e.g. `x[1:2]`) all comments get assigned as leading or trailing to the bound expression. If a bound is missing `[:]`, comments get marked as dangling and placed in the same section as they were originally in: ```python x = "x"[ # a # b : # c # d ] ``` to ```python x = "x"[ # a # b : # c # d ] ``` Except for the potential trailing end-of-line comments, all comments get formatted on their own line. This can be improved by keeping end-of-line comments after the opening bracket or after a colon as such but the changes were already complex enough. I added tests for comment placement and spaces.	2023-06-21 15:09:39 +00:00
konstin	44156f6962	Improve debuggability of `place_comment` (#5209 ) ## Summary I found it hard to figure out which function decides placement for a specific comment. An explicit loop makes this easier to debug ## Test Plan There should be no functional changes, no changes to the formatting of the fixtures.	2023-06-21 09:52:13 +00:00
Micha Reiser	653dbb6d17	Format BoolOp (#4986 )	2023-06-21 09:27:57 +00:00
konstin	db301c14bd	Consistently name comment own line/end-of-line `line_position()` (#5215 ) ## Summary Previously, `DecoratedComment` used `text_position()` and `SourceComment` used `position()`. This PR unifies this to `line_position` everywhere. ## Test Plan This is a rename refactoring.	2023-06-21 11:04:56 +02:00
Micha Reiser	1336ca601b	Format `UnaryExpr` <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds basic formatting for unary expressions. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a new `unary.py` with custom test cases	2023-06-21 10:09:47 +02:00
Micha Reiser	3973836420	Correctly handle left/right breaking of binary expression <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Black supports for layouts when it comes to breaking binary expressions: ```rust #[derive(Copy, Clone, Debug, Eq, PartialEq)] enum BinaryLayout { /// Put each operand on their own line if either side expands Default, /// Try to expand the left to make it fit. Add parentheses if the left or right don't fit. /// ///```python /// [ /// a, /// b /// ] & c ///``` ExpandLeft, /// Try to expand the right to make it fix. Add parentheses if the left or right don't fit. /// /// ```python /// a & [ /// b, /// c /// ] /// ``` ExpandRight, /// Both the left and right side can be expanded. Try in the following order: /// * expand the right side /// * expand the left side /// * expand both sides /// /// to make the expression fit /// /// ```python /// [ /// a, /// b /// ] & [ /// c, /// d /// ] /// ``` ExpandRightThenLeft, } ``` Our current implementation only handles `ExpandRight` and `Default` correctly. This PR adds support for `ExpandRightThenLeft` and `ExpandLeft`. ## Test Plan I added tests that play through all 4 binary expression layouts.	2023-06-21 09:40:05 +02:00
Micha Reiser	e520a3a721	Fix ArgWithDefault comments handling (#5204 )	2023-06-20 20:48:07 +00:00
Micha Reiser	b369288833	Accept any `Into<AnyNodeRef>` as `Comments` arguments (#5205 )	2023-06-20 16:49:21 +00:00
Charlie Marsh	6331598511	Upgrade `RustPython` to access ranged names (#5194 ) ## Summary In https://github.com/astral-sh/RustPython-Parser/pull/8, we modified RustPython to include ranges for any identifiers that aren't `Expr::Name` (which already has an identifier). For example, the `e` in `except ValueError as e` was previously un-ranged. To extract its range, we had to do some lexing of our own. This change should improve performance and let us remove a bunch of code. ## Test Plan `cargo test`	2023-06-20 15:43:38 +00:00
David Szotten	773e79b481	basic formatting for ExprDict (#5167 )	2023-06-20 09:25:08 +00:00
Charlie Marsh	36e01ad6eb	Upgrade RustPython (#5192 ) ## Summary This PR upgrade RustPython to pull in the changes to `Arguments` (zip defaults with their identifiers) and all the renames to `CmpOp` and friends.	2023-06-19 21:09:53 +00:00
konstin	0e028142f4	Explain dangling comments in the formatter (#5170 ) This documentation change improves the section on dangling comments in the formatter. --------- Co-authored-by: David Szotten <davidszotten@gmail.com> Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-19 14:24:45 +02:00
Chris Pryer	195b36c429	Format `continue` statement (#5165 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Format `continue` statement. ## Test Plan `continue` is used already in some tests, but if a new test is needed I could add it. --------- Co-authored-by: konstin <konstin@mailbox.org>	2023-06-18 11:25:59 +00:00
David Szotten	4b9b6829dc	format StmtBreak (#5158 ) ## Summary format `StmtBreak` trying to learn how to help out with the formatter. starting simple ## Test Plan new snapshot test	2023-06-17 10:31:29 +02:00
Charlie Marsh	5ea3e42513	Always use identifier ranges to store bindings (#5110 ) ## Summary At present, when we store a binding, we include a `TextRange` alongside it. The `TextRange` _sometimes_ matches the exact range of the identifier to which the `Binding` is linked, but... not always. For example, given: ```python x = 1 ``` The binding we create _will_ use the range of `x`, because the left-hand side is an `Expr::Name`, which has a valid range on it. However, given: ```python try: pass except ValueError as e: pass ``` When we create a binding for `e`, we don't have a `TextRange`... The AST doesn't give us one. So we end up extracting it via lexing. This PR extends that pattern to the rest of the binding kinds, to ensure that whenever we create a binding, we always use the range of the bound name. This leads to better diagnostics in cases like pattern matching, whereby the diagnostic for "unused variable `x`" here used to include `x`, instead of just `x`: ```python def f(provided: int) -> int: match provided: case [_, x]: pass ``` This is _also_ required for symbol renames, since we track writes as bindings -- so we need to know the ranges of the bound symbols. By storing these bindings precisely, we can also remove the `binding.trimmed_range` abstraction -- since bindings already use the "trimmed range". To implement this behavior, I took some of our existing utilities (like the code we had for `except ValueError as e` above), migrated them from a full lexer to a zero-allocation lexer that _only_ identifies "identifiers", and moved the behavior into a trait, so we can now do `stmt.identifier(locator)` to get the range for the identifier. Honestly, we might end up discarding much of this if we decide to put ranges on all identifiers (https://github.com/astral-sh/RustPython-Parser/pull/8). But even if we do, this will _still_ be a good change, because the lexer introduced here is useful beyond names (e.g., we use it find the `except` keyword in an exception handler, to find the `else` after a `for` loop, and so on). So, I'm fine committing this even if we end up changing our minds about the right approach. Closes #5090. ## Benchmarks No significant change, with one statistically significant improvement (-2.1654% on `linter/all-rules/large/dataset.py`): ``` linter/default-rules/numpy/globals.py time: [73.922 µs 73.955 µs 73.986 µs] thrpt: [39.882 MiB/s 39.898 MiB/s 39.916 MiB/s] change: time: [-0.5579% -0.4732% -0.3980%] (p = 0.00 < 0.05) thrpt: [+0.3996% +0.4755% +0.5611%] Change within noise threshold. Found 6 outliers among 100 measurements (6.00%) 4 (4.00%) low severe 1 (1.00%) low mild 1 (1.00%) high mild linter/default-rules/pydantic/types.py time: [1.4909 ms 1.4917 ms 1.4926 ms] thrpt: [17.087 MiB/s 17.096 MiB/s 17.106 MiB/s] change: time: [+0.2140% +0.2741% +0.3392%] (p = 0.00 < 0.05) thrpt: [-0.3380% -0.2734% -0.2136%] Change within noise threshold. Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) high mild 1 (1.00%) high severe linter/default-rules/numpy/ctypeslib.py time: [688.97 µs 691.34 µs 694.15 µs] thrpt: [23.988 MiB/s 24.085 MiB/s 24.168 MiB/s] change: time: [-1.3282% -0.7298% -0.1466%] (p = 0.02 < 0.05) thrpt: [+0.1468% +0.7351% +1.3461%] Change within noise threshold. Found 15 outliers among 100 measurements (15.00%) 1 (1.00%) low mild 2 (2.00%) high mild 12 (12.00%) high severe linter/default-rules/large/dataset.py time: [3.3872 ms 3.4032 ms 3.4191 ms] thrpt: [11.899 MiB/s 11.954 MiB/s 12.011 MiB/s] change: time: [-0.6427% -0.2635% +0.0906%] (p = 0.17 > 0.05) thrpt: [-0.0905% +0.2642% +0.6469%] No change in performance detected. Found 20 outliers among 100 measurements (20.00%) 1 (1.00%) low severe 2 (2.00%) low mild 4 (4.00%) high mild 13 (13.00%) high severe linter/all-rules/numpy/globals.py time: [148.99 µs 149.21 µs 149.42 µs] thrpt: [19.748 MiB/s 19.776 MiB/s 19.805 MiB/s] change: time: [-0.7340% -0.5068% -0.2778%] (p = 0.00 < 0.05) thrpt: [+0.2785% +0.5094% +0.7395%] Change within noise threshold. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high severe linter/all-rules/pydantic/types.py time: [3.0362 ms 3.0396 ms 3.0441 ms] thrpt: [8.3779 MiB/s 8.3903 MiB/s 8.3997 MiB/s] change: time: [-0.0957% +0.0618% +0.2125%] (p = 0.45 > 0.05) thrpt: [-0.2121% -0.0618% +0.0958%] No change in performance detected. Found 11 outliers among 100 measurements (11.00%) 1 (1.00%) low severe 3 (3.00%) low mild 5 (5.00%) high mild 2 (2.00%) high severe linter/all-rules/numpy/ctypeslib.py time: [1.6879 ms 1.6894 ms 1.6909 ms] thrpt: [9.8478 MiB/s 9.8562 MiB/s 9.8652 MiB/s] change: time: [-0.2279% -0.0888% +0.0436%] (p = 0.18 > 0.05) thrpt: [-0.0435% +0.0889% +0.2284%] No change in performance detected. Found 5 outliers among 100 measurements (5.00%) 4 (4.00%) low mild 1 (1.00%) high severe linter/all-rules/large/dataset.py time: [7.1520 ms 7.1586 ms 7.1654 ms] thrpt: [5.6777 MiB/s 5.6831 MiB/s 5.6883 MiB/s] change: time: [-2.5626% -2.1654% -1.7780%] (p = 0.00 < 0.05) thrpt: [+1.8102% +2.2133% +2.6300%] Performance has improved. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high mild ```	2023-06-15 18:43:19 +00:00
konstin	66089e1a2e	Fix a number of formatter errors from the cpython repository (#5089 ) ## Summary This fixes a number of problems in the formatter that showed up with various files in the [cpython](https://github.com/python/cpython) repository. These problems surfaced as unstable formatting and invalid code. This is not the entirety of problems discovered through cpython, but a big enough chunk to separate it. Individual fixes are generally individual commits. They were discovered with #5055, which i update as i work through the output ## Test Plan I added regression tests with links to cpython for each entry, except for the two stubs that also got comment stubs since they'll be implemented properly later.	2023-06-15 11:24:14 +00:00
Charlie Marsh	716cab2f19	Run `rustfmt` on nightly to clean up erroneous comments (#5106 ) ## Summary This PR runs `rustfmt` with a few nightly options as a one-time fix to catch some malformatted comments. I ended up just running with: ```toml condense_wildcard_suffixes = true edition = "2021" max_width = 100 normalize_comments = true normalize_doc_attributes = true reorder_impl_items = true unstable_features = true use_field_init_shorthand = true ``` Since these all seem like reasonable things to fix, so may as well while I'm here.	2023-06-15 00:19:05 +00:00
konstin	95ee6dcb3b	Add contributor docs to formatter (#5023 ) I've written done my condensed learnings from working on the formatter so that others can have an easier start working on it. This is a pure docs change	2023-06-13 07:22:17 +00:00
Charlie Marsh	cc44349401	Use dedicated structs in `comparable.rs` (#5042 ) ## Summary Updating to match the updated AST structure, for consistency.	2023-06-13 03:57:34 +00:00
konstin	e586c27590	Format ExprTuple (#4963 ) This implements formatting ExprTuple, including magic trailing comma. I intentionally didn't change the settings mechanism but just added a dummy global const flag. Besides the snapshots, I added custom breaking/joining tests and a deeply nested test case. The diffs look better than previously, proper black compatibility depends on parentheses handling. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-12 12:55:47 +00:00
Charlie Marsh	68b6d30c46	Use consistent `Cargo.toml` metadata in all crates (#5015 )	2023-06-12 00:02:40 +00:00
Charlie Marsh	f401050878	Introduce `PythonWhitespace` to confine trim operations to Python whitespace (#4994 ) ## Summary We use `.trim()` and friends in a bunch of places, to strip whitespace from source code. However, not all Unicode whitespace characters are considered "whitespace" in Python, which only supports the standard space, tab, and form-feed characters. This PR audits our usages of `.trim()`, `.trim_start()`, `.trim_end()`, and `char::is_whitespace`, and replaces them as appropriate with a new `.trim_whitespace()` analogues, powered by a `PythonWhitespace` trait. In general, the only place that should continue to use `.trim()` is content within docstrings, which don't need to adhere to Python's semantic definitions of whitespace. Closes #4991.	2023-06-09 21:44:50 -04:00
Charlie Marsh	1d756dc3a7	Move Python whitespace utilities into new `ruff_python_whitespace` crate (#4993 ) ## Summary `ruff_newlines` becomes `ruff_python_whitespace`, and includes the existing "universal newline" handlers alongside the Python whitespace-specific utilities.	2023-06-10 00:59:57 +00:00
Micha Reiser	111e1f93ca	perf(formatter): Skip bodies without comments (#4978 )	2023-06-09 11:33:57 +02:00
Micha Reiser	68d52da43b	Track formatted comments (#4979 )	2023-06-09 09:09:45 +00:00
Micha Reiser	646ab64850	Fix binary expression formatting with leading comments (#4964 )	2023-06-09 09:02:50 +00:00
Micha Reiser	1accbeffd6	Format `if` statements (#4961 )	2023-06-09 10:55:14 +02:00
Micha Reiser	68969240c5	Format Function definitions (#4951 )	2023-06-08 16:07:33 +00:00
Micha Reiser	9c3fb23ace	Simple lexer for formatter (#4922 )	2023-06-08 17:37:39 +02:00
konstin	467df23e65	Implement StmtReturn (#4960 ) * Implement StmtPass This implements StmtPass as `pass`. The snapshot diff is small because pass mainly occurs in bodies and function (#4951) and if/for bodies. * Implement StmtReturn This implements StmtReturn as `return` or `return {value}`. The snapshot diff is small because return occurs in functions (#4951)	2023-06-08 16:29:39 +02:00
konstin	c8442e91ce	Implement StmtPass (#4959 ) This implements StmtPass as `pass`. The snapshot diff is small because pass mainly occurs in bodies and function (#4951) and if/for bodies.	2023-06-08 16:29:27 +02:00
Micha Reiser	6bef347a8e	Trailing own line comments before func or class (#4921 )	2023-06-08 12:50:25 +00:00
Micha Reiser	c1cc6f3be1	Add basic Constant formatting (#4954 )	2023-06-08 11:42:44 +00:00
Micha Reiser	83cf6d6e2f	Implement Binary expression without `best_fitting` (#4952 )	2023-06-08 12:45:03 +02:00
konstin	23abad0bd5	A basic StmtAssign formatter and better dummies for expressions (#4938 ) * A basic StmtAssign formatter and better dummies for expressions The goal of this PR was formatting StmtAssign since many nodes in the black tests (and in python in general) are after an assignment. This caused unstable formatting: The spacing of power op spacing depends on the type of the two involved expressions, but each expression was formatted as dummy string and re-parsed as a ExprName, so in the second round the different rules of ExprName were applied, causing unstable formatting. This PR does not necessarily bring us closer to black's style, but it unlocks a good porting of black's test suite and is a basis for implementing the Expr nodes. * fmt * Review	2023-06-08 12:20:25 +02:00
Micha Reiser	39a1f3980f	Upgrade RustPython (#4900 )	2023-06-08 05:53:14 +00:00
Micha Reiser	bcf745c5ba	Replace verbatim text with `NOT_YET_IMPLEMENTED` (#4904 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR replaces the `verbatim_text` builder with a `not_yet_implemented` builder that emits `NOT_YET_IMPLEMENTED_<NodeKind>` for not yet implemented nodes. The motivation for this change is that partially formatting compound statements can result in incorrectly indented code, which is a syntax error: ```python def func_no_args(): a; b; c if True: raise RuntimeError if False: ... for i in range(10): print(i) continue ``` Get's reformatted to ```python def func_no_args(): a; b; c if True: raise RuntimeError if False: ... for i in range(10): print(i) continue ``` because our formatter does not yet support `for` statements and just inserts the text from the source. ## Downsides Using an identifier will not work in all situations. For example, an identifier is invalid in an `Arguments ` position. That's why I kept `verbatim_text` around and e.g. use it in the `Arguments` formatting logic where incorrect indentations are impossible (to my knowledge). Meaning, `verbatim_text` we can opt in to `verbatim_text` when we want to iterate quickly on nodes that we don't want to provide a full implementation yet and using an identifier would be invalid. ## Upsides Running this on main discovered stability issues with the newline handling that were previously "hidden" because of the verbatim formatting. I guess that's an upside :) ## Test Plan None?	2023-06-07 14:57:25 +02:00
Micha Reiser	6ab3fc60f4	Correctly handle newlines after/before comments (#4895 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This issue fixes the removal of empty lines between a leading comment and the previous statement: ```python a = 20 # leading comment b = 10 ``` Ruff removed the empty line between `a` and `b` because: * The leading comments formatting does not preserve leading newlines (to avoid adding new lines at the top of a body) * The `JoinNodesBuilder` counted the lines before `b`, which is 1 -> Doesn't insert a new line This is fixed by changing the `JoinNodesBuilder` to count the lines instead after the last node. This correctly gives 1, and the `# leading comment` will insert the empty lines between any other leading comment or the node. ## Test Plan I added a new test for empty lines.	2023-06-07 14:49:43 +02:00
Micha Reiser	3f032cf09d	Format binary expressions (#4862 ) * Format Binary Expressions * Extract NeedsParentheses trait	2023-06-06 08:34:53 +00:00
Micha Reiser	913b9d1fcf	Normalize newlines in `verbatim_text` (#4850 )	2023-06-05 19:30:28 +00:00
Micha Reiser	33434fcb9c	Add Formatter benchmark (#4860 )	2023-06-05 21:05:42 +02:00
konstin	209aaa5add	Ensure type_ignores for Module are empty (#4861 ) According to https://docs.python.org/3/library/ast.html#ast-helpers, we expect type_ignores to be always be empty, so this adds a debug assert. Test plan: I confirmed that the assertion holdes for the file below and for all the black tests which include a number of `type: ignore` comments. ```python # type: ignore if 1: print("1") # type: ignore # elsebranch # type: ignore else: # type: ignore print("2") # type: ignore while 1: print() # type: ignore ```	2023-06-05 11:38:08 +02:00
konstin	ff37d7af23	Implement module formatting using JoinNodesBuilder (#4808 ) * Implement module formatting using JoinNodesBuilder This uses JoinNodesBuilder to implement module formatting for #4800 See the snapshots for the changed behaviour. See one PR up for a CLI that i used to verify the trailing new line behaviour	2023-06-05 08:35:05 +00:00
Micha Reiser	c65f47d7c4	Format `while` Statement (#4810 )	2023-06-05 08:24:00 +00:00
konstin	d1d06960f0	Add a formatter CLI for debugging (#4809 ) * Add a formatter CLI for debugging This adds a ruff_python_formatter cli modelled aber `rustfmt` that i use for debugging * clippy * Add print IR and print comments options Tested with `cargo run --bin ruff_python_formatter -- --print-ir --print-comments scratch.py`	2023-06-05 07:33:33 +00:00
Micha Reiser	2c41c54e0c	Format `ExprName` (#4803 )	2023-06-03 16:06:14 +02:00
Micha Reiser	d6daa61563	Handle trailing end-of-line comments in-between-bodies (#4812 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary And more custom logic around comments in bodies... uff. Let's say we have the following code ```python if x == y: pass # trailing comment of pass else: # trailing comment of `else` print("I have no comments") ``` Right now, the formatter attaches the `# trailing comment of `else` as a trailing comment of `pass` because it doesn't "see" that there's an `else` keyword in between (because the else body is just a Vec and not a node). This PR adds custom logic that attaches the trailing comments after the `else` as dangling comments to the `if` statement. The if statement must then split the dangling comments by `comments.text_position()`: * All comments up to the first end-of-line comment are leading comments of the `else` keyword. * All end-of-line comments coming after are `trailing` comments for the `else` keyword. ## Test Plan I added new unit tests.	2023-06-03 15:29:22 +02:00
Micha Reiser	cb6788ab5f	Handle trailing body end-of-line comments (#4811 ) ### Summary This PR adds custom logic to handle end-of-line comments of the last statement in a body. For example: ```python while True: if something.changed: do.stuff() # trailing comment b ``` The `# trailing comment` is a trailing comment of the `do.stuff()` expression statement. We incorrectly attached the comment as a trailing comment of the enclosing `while` statement because the comment is between the end of the while statement (the `while` statement ends right after `do.stuff()`) and before the `b` statement. This PR fixes the placement to correctly attach these comments to the last statement in a body (recursively). ## Test Plan I reviewed the snapshots and they now look correct. This may appear odd because a lot comments have now disappeared. This is the expected result because we use `verbatim` formatting for the block statements (like `while`) and that means that it only formats the inner content of the block, but not any trailing comments. The comments were visible before, because they were associated with the block statement (e.g. `while`).	2023-06-03 15:17:33 +02:00
Micha Reiser	ebdc4afc33	Suite formatting and `JoinNodesBuilder` (#4805 )	2023-06-02 14:14:38 +00:00
Micha Reiser	a401989b7a	Format StmtExpr (#4788 )	2023-06-02 12:52:38 +00:00
Micha Reiser	4cd4b37e74	Format the comment content (#4786 )	2023-06-02 11:22:34 +00:00
konstin	c4fdbf8903	Switch PyFormatter lifetimes (#4804 ) Stylistic change to have the input lifetime first and the output lifetime second. I'll rebase my other PR on top of this. Test plan: `cargo clippy`	2023-06-02 12:26:39 +02:00
Micha Reiser	5d939222db	Leading, Dangling, and Trailing comments formatting (#4785 )	2023-06-02 09:26:36 +02:00
konstin	63d892f1e4	Implement basic module formatting (#4784 ) * Add Format for Stmt * Implement basic module formatting This implements formatting each statement in a module with a hard line break in between, so that we can start formatting statements. Basic testing is done by the snapshots	2023-06-01 15:25:50 +02:00
Micha Reiser	4ea4fd1984	Introduce `lines_before` helper (#4780 )	2023-06-01 11:56:43 +02:00
konstin	d4027d8b65	Use new formatter infrastructure in CLI and test (#4767 ) * Use dummy verbatim formatter for all nodes * Use new formatter infrastructure in CLI and test * Expose the new formatter in the CLI * Merge import blocks	2023-06-01 11:55:04 +02:00
konstin	9bf168c0a4	Use dummy verbatim formatter for all nodes (#4755 )	2023-06-01 08:25:26 +00:00
Micha Reiser	59148344be	Place comments of left and right binary expression operands (#4751 )	2023-06-01 07:01:32 +00:00
konstin	0945803427	Generate FormatRule definitions (#4724 ) * Generate FormatRule definitions * Generate verbatim output * pub(crate) everything * clippy fix * Update crates/ruff_python_formatter/src/lib.rs Co-authored-by: Micha Reiser <micha@reiser.io> * Update crates/ruff_python_formatter/src/lib.rs Co-authored-by: Micha Reiser <micha@reiser.io> * stub out with Ok(()) again * Update crates/ruff_python_formatter/src/lib.rs Co-authored-by: Micha Reiser <micha@reiser.io> * PyFormatContext::{contents, locator} with `#[allow(unused)]` * Can't leak private type * remove commented code * Fix ruff errors * pub struct Format{node} due to rust rules --------- Co-authored-by: Julian LaNeve <lanevejulian@gmail.com> Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-01 08:38:53 +02:00
Micha Reiser	b7294b48e7	Handle positional-only-arguments separator comments (#4748 )	2023-06-01 06:22:49 +00:00
Micha Reiser	be31d71849	Correctly associate own-line comments in bodies (#4671 )	2023-06-01 08:12:53 +02:00
Charlie Marsh	9d0ffd33ca	Move universal newline handling into its own crate (#4729 )	2023-05-31 12:00:47 -04:00
Micha Reiser	e209b5fc5f	Add reformat check (#4753 )	2023-05-31 17:36:15 +02:00
Micha Reiser	6c1ff6a85f	Upgrade RustPython (#4747 )	2023-05-31 08:26:35 +00:00
Micha Reiser	06bcb85f81	formatter: Remove CST and old formatting (#4730 )	2023-05-31 08:27:23 +02:00
Micha Reiser	0cd453bdf0	Generic "comment to node" association logic (#4642 )	2023-05-30 09:28:01 +00:00
Micha Reiser	84a5584888	Add `Comments` data structure (#4641 )	2023-05-30 08:54:55 +00:00
Micha Reiser	6146b75dd0	Add `MultiMap` implementation for storing comments (#4639 )	2023-05-30 09:51:25 +02:00
Micha Reiser	edc6c4058f	Move `shared_traits` to `ruff_formatter` (#4632 )	2023-05-24 17:38:11 +02:00
Micha Reiser	86ced3516b	Introduce `SourceCodeSlice` to reduce the size of `FormatElement` (#4622 ) Introduce `SourceCodeSlice` to reduce the size of `FormatElement`	2023-05-24 15:04:52 +00:00
Micha Reiser	6943beee66	Remove source position from `FormatElement::DynamicText` (#4619 )	2023-05-24 16:36:14 +02:00
Micha Reiser	daadd24bde	Include decorators in `Function` and `Class` definition ranges (#4467 )	2023-05-22 17:50:42 +02:00
Charlie Marsh	e8e66f3824	Remove unnecessary path prefixes (#4492 )	2023-05-18 10:19:09 -04:00
Micha Reiser	ddf7de7e86	Prototype Black's string joining/splitting (#4449 )	2023-05-16 18:42:40 +01:00
Jeong, YunWon	4b05ca1198	Specialize ConversionFlag (#4450 )	2023-05-16 18:00:13 +02:00
Charlie Marsh	f0465bf106	Emit non-logical newlines for "empty" lines (#4444 )	2023-05-16 14:58:56 +00:00
Micha Reiser	fa26860296	Refactor range from `Attributed` to `Node`s (#4422 )	2023-05-16 06:36:32 +00:00
Jonathan Plasse	c10a4535b9	Disallow `unreachable_pub` (#4314 )	2023-05-11 18:00:00 -04:00
Micha Reiser	1ccef5150d	Remove lifetime from FormatContext (#4376 )	2023-05-11 15:43:42 +00:00
Jeong, YunWon	be6e00ef6e	Re-integrate RustPython parser repository (#4359 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-05-11 07:47:17 +00:00
Calum Young	f0f4bf2929	Move typos to pre-commit config (#4148 )	2023-04-29 12:13:35 -04:00
Micha Reiser	cab65b25da	Replace row/column based `Location` with byte-offsets. (#3931 )	2023-04-26 18:11:02 +00:00
Micha Reiser	381203c084	Store source code on message (#3897 )	2023-04-11 07:57:36 +00:00
Micha Reiser	76c47a9a43	Cheap cloneable LineIndex (#3896 )	2023-04-11 07:33:40 +00:00
Charlie Marsh	d919adc13c	Introduce a `ruff_python_semantic` crate (#3865 )	2023-04-04 16:50:47 +00:00
Charlie Marsh	cf7e1ddd08	Remove some `usize` references (#3819 )	2023-03-30 17:35:42 -04:00
Charlie Marsh	c2750a59ab	Implement an iterator for universal newlines (#3454 ) # Summary We need to support CR line endings (as opposed to LF and CRLF line endings, which are already supported). They're rare, but they do appear in Python code, and we tend to panic on any file that uses them. Our `Locator` abstraction now supports CR line endings. However, Rust's `str#lines` implementation does _not_. This PR adds a `UniversalNewlineIterator` implementation that respects all of CR, LF, and CRLF line endings, and plugs it into most of the `.lines()` call sites. As an alternative design, it could be nice if we could leverage `Locator` for this. We've already computed all of the line endings, so we could probably iterate much more efficiently? # Test Plan Largely relying on automated testing, however, also ran over some known failure cases, like #3404.	2023-03-13 00:01:29 -04:00
Charlie Marsh	da1f83fe32	Remove `core` module from `ruff_python_formatter` (#3373 )	2023-03-08 19:11:39 +00:00
Charlie Marsh	0a9d259f9c	Remove copied `core` modules from `ruff_python_formatter` (#3371 )	2023-03-08 19:03:40 +00:00
Charlie Marsh	130e733023	Implement `From<Located>` for `Range` (#3377 )	2023-03-08 18:50:20 +00:00
Charlie Marsh	ff2c0dd491	Use shared `leading_quote` implementation in ruff_python_formatter (#3396 )	2023-03-08 18:21:59 +00:00
Charlie Marsh	d1c48016eb	Rename `ruff_python` crate to `ruff_python_stdlib` (#3354 ) In hindsight, `ruff_python` is too general. A good giveaway is that it's actually a prefix of some other crates. The intent of this crate is to reimplement pieces of the Python standard library and CPython itself, so `ruff_python_stdlib` feels appropriate.	2023-03-06 13:43:22 +00:00
Jonathan Plasse	8828e12283	Bump dependencies and move more shared dependencies into workspace (#3340 )	2023-03-04 12:36:26 -05:00
Charlie Marsh	f5f09b489b	Introduce dedicated CST tokens for other operator kinds (#3267 )	2023-02-27 23:54:57 -05:00
Charlie Marsh	061495a9eb	Make BoolOp its own located token (#3265 )	2023-02-28 03:43:28 +00:00
Charlie Marsh	470e1c1754	Preserve comments on non-defaulted arguments (#3264 )	2023-02-27 23:41:40 +00:00
Charlie Marsh	16be691712	Enable more non-panicking formatter tests (#3262 )	2023-02-27 18:21:53 -05:00
Charlie Marsh	2261e194a0	Create dedicated `Body` nodes in the formatter CST (#3223 )	2023-02-27 22:55:05 +00:00
Charlie Marsh	1c75071136	Implement basic rendering of remaining AST nodes (#3233 )	2023-02-26 05:05:56 +00:00
Charlie Marsh	51bca19c1d	Add builders for common comment rendering (#3232 )	2023-02-26 04:16:24 +00:00
Jeong YunWon	84e96cdcd9	More enum work (#3212 )	2023-02-25 11:40:16 -05:00
Charlie Marsh	159422071e	Handle end-of-line comments on `excepthandler` and `alias` (#3196 )	2023-02-23 22:35:39 -05:00
Charlie Marsh	6eaacf96be	Introduce a new CST element for slice segments (#3195 )	2023-02-24 00:49:41 +00:00
Charlie Marsh	eb15371453	Make Locator available in AST-to-CST conversion pass (#3194 )	2023-02-23 19:43:03 -05:00
Charlie Marsh	bda2a0007a	Parenthesize numbers during attribute accesses (#3189 )	2023-02-23 14:57:23 -05:00
Charlie Marsh	32d165b7ad	Implement complex literal formatting (#3186 )	2023-02-23 19:09:33 +00:00
Charlie Marsh	ac79bf4ee9	Implement float literal formatting (#3184 )	2023-02-23 14:02:23 -05:00
Charlie Marsh	376eab3a53	Implement integer literal formatting (#3183 )	2023-02-23 18:31:56 +00:00
Charlie Marsh	08be7bd285	Add a TODO to string_literal (#3181 )	2023-02-23 12:46:20 -05:00
Charlie Marsh	1e7233a8eb	Add support for reformatting byte strings (#3176 )	2023-02-23 16:50:24 +00:00
Charlie Marsh	f967f344fc	Add support for basic `Constant::Str` formatting (#3173 ) This PR enables us to apply the proper quotation marks, including support for escapes. There are some significant TODOs, especially around implicit concatenations like: ```py ( "abc" "def" ) ``` Which are represented as a single AST node, which requires us to tokenize _within_ the formatter to identify all the individual string parts.	2023-02-23 16:23:10 +00:00
Charlie Marsh	095f005bf4	Move RustPython vendored and helper code into its own crate (#3171 )	2023-02-23 14:14:16 +00:00
Charlie Marsh	e5c1f95545	Check-in updated snapshot (#3161 )	2023-02-23 03:42:27 +00:00
Charlie Marsh	227ff62a4e	Don't touch tuple brackets after `in` (#3160 )	2023-02-23 03:10:24 +00:00
Charlie Marsh	d8e4902516	Un-modify `tupleassign` and `function2` tests (#3158 ) I manually changed these in #3080 and #3083 to get the tests passing (with notes around the deviations) -- but that's no longer necessary, now that we have proper testing that takes deviations into account.	2023-02-23 02:37:25 +00:00
Charlie Marsh	5fd827545b	Add a trailing newline to all .py.expect files (#3156 ) This just re-formats all the `.py.expect` files with Black, both to add a trailing newline and be doubly-certain that they're correctly formatted. I also ensured that we add a hard line break after each statement, and that we avoid including an extra newline in the generated Markdown (since the code should contain the exact expected newlines).	2023-02-23 02:29:27 +00:00
Charlie Marsh	2f9de335db	Upgrade RustPython to match new flattened exports (#3141 )	2023-02-22 19:36:13 +00:00
Charlie Marsh	1efa2e07ad	Avoid match statement misidentification in token rules (#3129 )	2023-02-22 15:44:45 +00:00
Micha Reiser	ffd8e958fc	chore: Upgrade Rust to 1.67.0 (#3125 )	2023-02-22 10:03:17 -05:00
Micha Reiser	ed33b75bad	test(ruff_python_formatter): Run all Black tests (#2993 ) This PR changes the testing infrastructure to run all black tests and: * Pass if Ruff and Black generate the same formatting * Fail and write a markdown snapshot that shows the input code, the differences between Black and Ruff, Ruffs output, and Blacks output This is achieved by introducing a new `fixture` macro (open to better name suggestions) that "duplicates" the attributed test for every file that matches the specified glob pattern. Creating a new test for each file over having a test that iterates over all files has the advantage that you can run a single test, and that test failures indicate which case is failing. The `fixture` macro also makes it straightforward to e.g. setup our own spec tests that test very specific formatting by creating a new folder and use insta to assert the formatted output.	2023-02-22 09:25:06 -05:00
Charlie Marsh	cdc4e86158	Add support for TryStar (#3089 )	2023-02-21 13:42:20 -05:00
Charlie Marsh	a6eb60cdd5	Enable `function2` test (#3083 )	2023-02-21 04:37:50 +00:00
Charlie Marsh	90c04b9cff	Enable `tupleassign` test (#3080 )	2023-02-21 00:42:23 +00:00
Charlie Marsh	b701cca779	Enable some already-passing Black tests (#3079 )	2023-02-21 00:10:35 +00:00
Charlie Marsh	ce8953442d	Add support for trailing colons in slice expressions (#3077 )	2023-02-20 23:24:32 +00:00
Charlie Marsh	6e02405bd6	Add `StmtKind::Try`; fix trailing newlines (#3074 )	2023-02-20 22:55:32 +00:00
Jeong YunWon	35606d7b05	clean up to fix nightly clippy warnings and dedents (#3057 )	2023-02-20 09:33:47 -05:00
Charlie Marsh	c297d46899	Remove unused `AsFormat` trait for `Option<T>` (#3041 ) We should re-add this, but it's currently unused and doesn't compile under 1.66.0. See: #3039.	2023-02-19 20:19:35 +00:00
Jonathan Plasse	b75663be6d	Add missing rust-version in crates (#3009 )	2023-02-19 15:07:17 +00:00
Charlie Marsh	180541a924	Unify comment terminology with that of `rome_formatter` (#2979 )	2023-02-17 03:02:25 +00:00
Charlie Marsh	6088a36cd3	Use `line_suffix` for end-of-line comments (#2975 )	2023-02-16 18:37:40 -05:00
Charlie Marsh	5157f584ab	Improve pow operator spacing (#2970 ) Ensure that we add spaces to expressions like `foo.bar() ** 2`.	2023-02-16 15:17:32 -05:00
Charlie Marsh	1c01ec21cb	Regenerate expected Black snapshots (#2968 )	2023-02-16 19:39:17 +00:00
Charlie Marsh	cb971d3a48	Respect self as positional-only argument in annotation rules (#2927 )	2023-02-15 15:25:17 +00:00
Charlie Marsh	57a5071b4e	Rename some methods on `Locator` (#2926 )	2023-02-15 10:21:49 -05:00
Charlie Marsh	ca49b00e55	Add initial formatter implementation (#2883 ) # Summary This PR contains the code for the autoformatter proof-of-concept. ## Crate structure The primary formatting hook is the `fmt` function in `crates/ruff_python_formatter/src/lib.rs`. The current formatter approach is outlined in `crates/ruff_python_formatter/src/lib.rs`, and is structured as follows: - Tokenize the code using the RustPython lexer. - In `crates/ruff_python_formatter/src/trivia.rs`, extract a variety of trivia tokens from the token stream. These include comments, trailing commas, and empty lines. - Generate the AST via the RustPython parser. - In `crates/ruff_python_formatter/src/cst.rs`, convert the AST to a CST structure. As of now, the CST is nearly identical to the AST, except that every node gets a `trivia` vector. But we might want to modify it further. - In `crates/ruff_python_formatter/src/attachment.rs`, attach each trivia token to the corresponding CST node. The logic for this is mostly in `decorate_trivia` and is ported almost directly from Prettier (given each token, find its preceding, following, and enclosing nodes, then attach the token to the appropriate node in a second pass). - In `crates/ruff_python_formatter/src/newlines.rs`, normalize newlines to match Black’s preferences. This involves traversing the CST and inserting or removing `TriviaToken` values as we go. - Call `format!` on the CST, which delegates to type-specific formatter implementations (e.g., `crates/ruff_python_formatter/src/format/stmt.rs` for `Stmt` nodes, and similar for `Expr` nodes; the others are trivial). Those type-specific implementations delegate to kind-specific functions (e.g., `format_func_def`). ## Testing and iteration The formatter is being developed against the Black test suite, which was copied over in-full to `crates/ruff_python_formatter/resources/test/fixtures/black`. The Black fixtures had to be modified to create `[insta](https://github.com/mitsuhiko/insta)`-compatible snapshots, which now exist in the repo. My approach thus far has been to try and improve coverage by tackling fixtures one-by-one. ## What works, and what doesn’t - Most nodes are supported at a basic level (though there are a few stragglers at time of writing, like `StmtKind::Try`). - Newlines are properly preserved in most cases. - Magic trailing commas are properly preserved in some (but not all) cases. - Trivial leading and trailing standalone comments mostly work (although maybe not at the end of a file). - Inline comments, and comments within expressions, often don’t work -- they work in a few cases, but it’s one-off right now. (We’re probably associating them with the “right” nodes more often than we are actually rendering them in the right place.) - We don’t properly normalize string quotes. (At present, we just repeat any constants verbatim.) - We’re mishandling a bunch of wrapping cases (if we treat Black as the reference implementation). Here are a few examples (demonstrating Black's stable behavior): ```py # In some cases, if the end expression is "self-closing" (functions, # lists, dictionaries, sets, subscript accesses, and any length-two # boolean operations that end in these elments), Black # will wrap like this... if some_expression and f( b, c, d, ): pass # ...whereas we do this: if ( some_expression and f( b, c, d, ) ): pass # If function arguments can fit on a single line, then Black will # format them like this, rather than exploding them vertically. if f( a, b, c, d, e, f, g, ... ): pass ``` - We don’t properly preserve parentheses in all cases. Black preserves parentheses in some but not all cases.	2023-02-15 04:06:35 +00:00

... 13 14 15 16 17

841 Commits