Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Tom Kuson	98920909c6	Complete documentation for `flake8-blind-except` and `flake8-raise` rules (#5143 ) ## Summary Completes the documentation for the `flake8-blind-except` and `flake8-raise` rules. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-17 12:56:27 -04:00
Evan Rittenhouse	e1e1d2d341	Add Applicability to flynt (#5160 ) ## Summary Fixes some of #4184.	2023-06-17 12:05:43 -04:00
David Szotten	4b9b6829dc	format StmtBreak (#5158 ) ## Summary format `StmtBreak` trying to learn how to help out with the formatter. starting simple ## Test Plan new snapshot test	2023-06-17 10:31:29 +02:00
Charlie Marsh	d0ad1ed0af	Replace static `CallPath` vectors with `matches!` macros (#5148 ) ## Summary After #5140, I audited the codebase for similar patterns (defining a list of `CallPath` entities in a static vector, then looping over them to pattern-match). This PR migrates all other such cases to use `match` and `matches!` where possible. There are a few benefits to this: 1. It more clearly denotes the intended semantics (branches are exclusive). 2. The compiler can help deduplicate the patterns and detect unreachable branches. 3. Performance: in the benchmark below, the all-rules performance is increased by nearly 10%... ## Benchmarks I decided to benchmark against a large file in the Airflow repository with a lot of type annotations ([`views.py`](https://raw.githubusercontent.com/apache/airflow/f03f73100e8a7d6019249889de567cb00e71e457/airflow/www/views.py)): ``` linter/default-rules/airflow/views.py time: [10.871 ms 10.882 ms 10.894 ms] thrpt: [19.739 MiB/s 19.761 MiB/s 19.781 MiB/s] change: time: [-2.7182% -2.5687% -2.4204%] (p = 0.00 < 0.05) thrpt: [+2.4805% +2.6364% +2.7942%] Performance has improved. linter/all-rules/airflow/views.py time: [24.021 ms 24.038 ms 24.062 ms] thrpt: [8.9373 MiB/s 8.9461 MiB/s 8.9527 MiB/s] change: time: [-8.9537% -8.8516% -8.7527%] (p = 0.00 < 0.05) thrpt: [+9.5923% +9.7112% +9.8342%] Performance has improved. Found 12 outliers among 100 measurements (12.00%) 5 (5.00%) high mild 7 (7.00%) high severe ``` The impact is dramatic -- nearly a 10% improvement for `all-rules`.	2023-06-16 17:34:42 +00:00
Charlie Marsh	b3240dbfa2	Avoid propagating `BindingKind::Global` and `BindingKind::Nonlocal` (#5136 ) ## Summary This PR fixes a small quirk in the semantic model. Typically, when we see an import, like `import foo`, we create a `BindingKind::Importation` for it. However, if `foo` has been declared as a `global`, then we propagate the kind forward. So given: ```python global foo import foo ``` We'd create two bindings for `foo`, both with type `global`. This was originally borrowed from Pyflakes, and it exists to help avoid false-positives like: ```python def f(): global foo # Don't mark `foo` as "assigned but unused"! It's a global! foo = 1 ``` This PR removes that behavior, and instead tracks "Does this binding refer to a global?" as a flag. This is much cleaner, since it means we don't "lose" the identity of various bindings. As a very strange example of why this matters, consider: ```python def foo(): global Member from module import Member x: Member = 1 ``` `Member` is only used in a typing context, so we should flag it and say "move it to a `TYPE_CHECKING` block". However, when we go to analyze `from module import Member`, it has `BindingKind::Global`. So we don't even know that it's an import!	2023-06-16 11:06:59 -04:00
Charlie Marsh	fd1dfc3bfa	Add support for global and nonlocal symbol renames (#5134 ) ## Summary In #5074, we introduced an abstraction to support local symbol renames ("local" here refers to "within a module"). However, that abstraction didn't support `global` and `nonlocal` symbols. This PR extends it to those cases. Broadly, there are considerations. First, if we're renaming a symbol in a scope in which it is declared `global` or `nonlocal`. For example, given: ```python x = 1 def foo(): global x ``` Then when renaming `x` in `foo`, we need to detect that it's `global` and instead perform the rename starting from the module scope. Second, when renaming a symbol, we need to determine the scopes in which it is declared `global` or `nonlocal`. This is effectively the inverse of the above: when renaming `x` in the module scope, we need to detect that we should _also_ rename `x` in `foo`. To support these cases, the renaming algorithm was adjusted as follows: - When we start a rename in a scope, determine whether the symbol is declared `global` or `nonlocal` by looking for a `global` or `nonlocal` binding. If it is, start the rename in the defining scope. (This requires storing the defining scope on the `nonlocal` binding, which is new.) - We then perform the rename in the defining scope. - We then check whether the symbol was declared as `global` or `nonlocal` in any scopes, and perform the rename in those scopes too. (Thankfully, this doesn't need to be done recursively.) Closes #5092. ## Test Plan Added some additional snapshot tests.	2023-06-16 14:35:10 +00:00
Charlie Marsh	b9754bd5c5	Add autofix for `Set`-to-`AbstractSet` rewrite using reference tracking (#5074 ) ## Summary This PR enables autofix behavior for the `flake8-pyi` rule that asks you to alias `Set` to `AbstractSet` when importing `collections.abc.Set`. It's not the most important rule, but it's a good isolated test-case for local symbol renaming. The renaming algorithm is outlined in-detail in the `renamer.rs` module. But to demonstrate the behavior, here's the diff when running this fix over a complex file that exercises a few edge cases: ```diff --- a/foo.pyi +++ b/foo.pyi @@ -1,16 +1,16 @@ if True: - from collections.abc import Set + from collections.abc import Set as AbstractSet else: - Set = 1 + AbstractSet = 1 -x: Set = set() +x: AbstractSet = set() -x: Set +x: AbstractSet -del Set +del AbstractSet def f(): - print(Set) + print(AbstractSet) def Set(): pass ``` Making this work required resolving a bunch of edge cases in the semantic model that were causing us to "lose track" of references. For example, the above wasn't possible with our previous approach to handling deletions (#5071). Similarly, the `x: Set` "delayed annotation" tracking was enabled via #5070. And many of these edits would've failed if we hadn't changed `BindingKind` to always match the identifier range (#5090). So it's really the culmination of a bunch of changes over the course of the week. The main outstanding TODO is that this doesn't support `global` or `nonlocal` usages. I'm going to take a look at that tonight, but I'm comfortable merging this as-is. Closes #1106. Closes #5091.	2023-06-16 14:12:33 +00:00
Charlie Marsh	307f7a735c	Avoid allocations in lowercase comparisons (#5137 ) ## Summary I noticed that we have a few hot comparisons that involve called `s.to_lowercase()`. We can avoid an allocation by comparing characters directly.	2023-06-16 08:57:43 -04:00
Charlie Marsh	3af9dfeb0a	Rewrite `suspicious_function_call` as a match statement (#5140 ) ## Summary @konstin mentioned that in profiling, this function accounted for a non-trivial amount of time (0.33% of total execution, the most of any rule). This PR attempts to rewrite it as a match statement for better performance over a looping comparison. ## Test Plan `cargo test`	2023-06-16 08:57:20 -04:00
Charlie Marsh	5526699535	Use const-singleton helpers in more rules (#5142 )	2023-06-16 04:28:35 +00:00
Charlie Marsh	fab2a4adf7	Use `matches!` for insecure hash rule (#5141 )	2023-06-16 04:18:32 +00:00
Charlie Marsh	13813dc1b1	Skip `DJ008` enforcement in stub files (#5139 ) Closes #5138.	2023-06-16 03:49:40 +00:00
Charlie Marsh	70c01257ca	Minor formatting changes to `Checker` (#5135 )	2023-06-15 22:42:21 -04:00
Evan Rittenhouse	26d19655db	Add Applicability to flake8_tidy_imports (#5131 ) ## Summary Fixes some of https://github.com/astral-sh/ruff/issues/4184	2023-06-15 18:09:00 -04:00
Charlie Marsh	1f856aa576	Don't treat straight imports of __future__ as `__future__` imports (#5128 ) ## Summary If you `import __future__`, it's not subject to the same rules as `from __future__ import feature` -- i.e., this is fine: ```python x = 1 import __future__ ``` It doesn't really make sense to treat these as `__future__` imports (though I can't imagine anyone ever does this anyway).	2023-06-15 20:53:02 +00:00
Evan Rittenhouse	1e383483f7	Add Applicability to flake8_quotes fixes (#5130 ) ## Summary Fixes some of #4184	2023-06-15 16:50:54 -04:00
Evan Rittenhouse	89b328c6be	Add Applicability to flake8_logging_format fixes (#5129 ) ## Summary Fixes some of #4184	2023-06-15 16:50:19 -04:00
Evan Rittenhouse	6143065fc2	Add Applicability to flake8_comma fixes (#5127 ) ## Summary Fixes some of #4184	2023-06-15 16:49:54 -04:00
Charlie Marsh	107a295af4	Allow `async with` in `redefined-loop-name` (#5125 ) ## Summary Closes #5124.	2023-06-15 15:00:19 -04:00
Charlie Marsh	5ea3e42513	Always use identifier ranges to store bindings (#5110 ) ## Summary At present, when we store a binding, we include a `TextRange` alongside it. The `TextRange` _sometimes_ matches the exact range of the identifier to which the `Binding` is linked, but... not always. For example, given: ```python x = 1 ``` The binding we create _will_ use the range of `x`, because the left-hand side is an `Expr::Name`, which has a valid range on it. However, given: ```python try: pass except ValueError as e: pass ``` When we create a binding for `e`, we don't have a `TextRange`... The AST doesn't give us one. So we end up extracting it via lexing. This PR extends that pattern to the rest of the binding kinds, to ensure that whenever we create a binding, we always use the range of the bound name. This leads to better diagnostics in cases like pattern matching, whereby the diagnostic for "unused variable `x`" here used to include `x`, instead of just `x`: ```python def f(provided: int) -> int: match provided: case [_, x]: pass ``` This is _also_ required for symbol renames, since we track writes as bindings -- so we need to know the ranges of the bound symbols. By storing these bindings precisely, we can also remove the `binding.trimmed_range` abstraction -- since bindings already use the "trimmed range". To implement this behavior, I took some of our existing utilities (like the code we had for `except ValueError as e` above), migrated them from a full lexer to a zero-allocation lexer that _only_ identifies "identifiers", and moved the behavior into a trait, so we can now do `stmt.identifier(locator)` to get the range for the identifier. Honestly, we might end up discarding much of this if we decide to put ranges on all identifiers (https://github.com/astral-sh/RustPython-Parser/pull/8). But even if we do, this will _still_ be a good change, because the lexer introduced here is useful beyond names (e.g., we use it find the `except` keyword in an exception handler, to find the `else` after a `for` loop, and so on). So, I'm fine committing this even if we end up changing our minds about the right approach. Closes #5090. ## Benchmarks No significant change, with one statistically significant improvement (-2.1654% on `linter/all-rules/large/dataset.py`): ``` linter/default-rules/numpy/globals.py time: [73.922 µs 73.955 µs 73.986 µs] thrpt: [39.882 MiB/s 39.898 MiB/s 39.916 MiB/s] change: time: [-0.5579% -0.4732% -0.3980%] (p = 0.00 < 0.05) thrpt: [+0.3996% +0.4755% +0.5611%] Change within noise threshold. Found 6 outliers among 100 measurements (6.00%) 4 (4.00%) low severe 1 (1.00%) low mild 1 (1.00%) high mild linter/default-rules/pydantic/types.py time: [1.4909 ms 1.4917 ms 1.4926 ms] thrpt: [17.087 MiB/s 17.096 MiB/s 17.106 MiB/s] change: time: [+0.2140% +0.2741% +0.3392%] (p = 0.00 < 0.05) thrpt: [-0.3380% -0.2734% -0.2136%] Change within noise threshold. Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) high mild 1 (1.00%) high severe linter/default-rules/numpy/ctypeslib.py time: [688.97 µs 691.34 µs 694.15 µs] thrpt: [23.988 MiB/s 24.085 MiB/s 24.168 MiB/s] change: time: [-1.3282% -0.7298% -0.1466%] (p = 0.02 < 0.05) thrpt: [+0.1468% +0.7351% +1.3461%] Change within noise threshold. Found 15 outliers among 100 measurements (15.00%) 1 (1.00%) low mild 2 (2.00%) high mild 12 (12.00%) high severe linter/default-rules/large/dataset.py time: [3.3872 ms 3.4032 ms 3.4191 ms] thrpt: [11.899 MiB/s 11.954 MiB/s 12.011 MiB/s] change: time: [-0.6427% -0.2635% +0.0906%] (p = 0.17 > 0.05) thrpt: [-0.0905% +0.2642% +0.6469%] No change in performance detected. Found 20 outliers among 100 measurements (20.00%) 1 (1.00%) low severe 2 (2.00%) low mild 4 (4.00%) high mild 13 (13.00%) high severe linter/all-rules/numpy/globals.py time: [148.99 µs 149.21 µs 149.42 µs] thrpt: [19.748 MiB/s 19.776 MiB/s 19.805 MiB/s] change: time: [-0.7340% -0.5068% -0.2778%] (p = 0.00 < 0.05) thrpt: [+0.2785% +0.5094% +0.7395%] Change within noise threshold. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high severe linter/all-rules/pydantic/types.py time: [3.0362 ms 3.0396 ms 3.0441 ms] thrpt: [8.3779 MiB/s 8.3903 MiB/s 8.3997 MiB/s] change: time: [-0.0957% +0.0618% +0.2125%] (p = 0.45 > 0.05) thrpt: [-0.2121% -0.0618% +0.0958%] No change in performance detected. Found 11 outliers among 100 measurements (11.00%) 1 (1.00%) low severe 3 (3.00%) low mild 5 (5.00%) high mild 2 (2.00%) high severe linter/all-rules/numpy/ctypeslib.py time: [1.6879 ms 1.6894 ms 1.6909 ms] thrpt: [9.8478 MiB/s 9.8562 MiB/s 9.8652 MiB/s] change: time: [-0.2279% -0.0888% +0.0436%] (p = 0.18 > 0.05) thrpt: [-0.0435% +0.0889% +0.2284%] No change in performance detected. Found 5 outliers among 100 measurements (5.00%) 4 (4.00%) low mild 1 (1.00%) high severe linter/all-rules/large/dataset.py time: [7.1520 ms 7.1586 ms 7.1654 ms] thrpt: [5.6777 MiB/s 5.6831 MiB/s 5.6883 MiB/s] change: time: [-2.5626% -2.1654% -1.7780%] (p = 0.00 < 0.05) thrpt: [+1.8102% +2.2133% +2.6300%] Performance has improved. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high mild ```	2023-06-15 18:43:19 +00:00
konstin	66089e1a2e	Fix a number of formatter errors from the cpython repository (#5089 ) ## Summary This fixes a number of problems in the formatter that showed up with various files in the [cpython](https://github.com/python/cpython) repository. These problems surfaced as unstable formatting and invalid code. This is not the entirety of problems discovered through cpython, but a big enough chunk to separate it. Individual fixes are generally individual commits. They were discovered with #5055, which i update as i work through the output ## Test Plan I added regression tests with links to cpython for each entry, except for the two stubs that also got comment stubs since they'll be implemented properly later.	2023-06-15 11:24:14 +00:00
Dhruv Manilawala	097823b56d	Ability to perform integration test on Jupyter notebooks (#5076 ) ## Summary Ability to perform integration test on Jupyter notebooks Part of #1218 ## Test Plan `cargo test`	2023-06-15 08:04:27 +05:30
Charlie Marsh	ed8113267c	Add autofix specification levels for a variety of rules (#5109 )	2023-06-14 22:03:37 -04:00
Charlie Marsh	c654280d84	Use direct links for all PEP 8 references (#5108 )	2023-06-14 21:12:23 -04:00
Charlie Marsh	99486b38f4	Disambiguate all Python documentation references (#5107 )	2023-06-15 00:47:00 +00:00
Charlie Marsh	716cab2f19	Run `rustfmt` on nightly to clean up erroneous comments (#5106 ) ## Summary This PR runs `rustfmt` with a few nightly options as a one-time fix to catch some malformatted comments. I ended up just running with: ```toml condense_wildcard_suffixes = true edition = "2021" max_width = 100 normalize_comments = true normalize_doc_attributes = true reorder_impl_items = true unstable_features = true use_field_init_shorthand = true ``` Since these all seem like reasonable things to fix, so may as well while I'm here.	2023-06-15 00:19:05 +00:00
Charlie Marsh	9ab16fb417	Add `target-version` link to relevant rules (#5105 )	2023-06-15 00:12:32 +00:00
Charlie Marsh	458beccf14	Uniformly put `## Options` at the end of documentation (#5104 )	2023-06-15 00:04:51 +00:00
Tom Kuson	ccbc863960	Complete `pyupgrade` documentation (#5096 ) ## Summary Completes the documentation for the `pyupgrade` rules. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-14 23:43:12 +00:00
Charlie Marsh	71b3130ff1	Remove manual `await` detection (#5103 ) We can just use `any_over_expr` instead.	2023-06-14 22:17:14 +00:00
Tom Kuson	08cd140ea6	Ignore `reimplemented-builtin` if in `async` context (#5101 ) ## Summary Checks if `checker` is in an `async` context. If yes, return early. Fixes #5098. ## Test Plan `cargo test`	2023-06-14 18:00:30 -04:00
Charlie Marsh	848f184b8c	Enable UTC-import for `datetime-utc-alias` fix (#5100 ) ## Summary Small update to leverage `get_or_import_symbol` to fix `UP017` in more cases (e.g., when we need to import `UTC`, or access it from an alias or something). ## Test Plan Check out the updated snapshot.	2023-06-14 21:13:36 +00:00
Charlie Marsh	56476dfd61	Use `matches!` for `CallPath` comparisons (#5099 ) ## Summary This PR consistently uses `matches! for static `CallPath` comparisons. In some cases, we can significantly reduce the number of cases or checks. ## Test Plan `cargo test `	2023-06-14 17:06:34 -04:00
Charlie Marsh	bae183b823	Rename `semantic_model` and `model` usages to `semantic` (#5097 ) ## Summary As discussed in Discord, and similar to oxc, we're going to refer to this as `.semantic()` everywhere. While I was auditing usages of `model: &SemanticModel`, I also changed as many function signatures as I could find to consistently take the model as the _last_ argument, rather than the first.	2023-06-14 15:01:51 -04:00
Charlie Marsh	65dbfd2556	Improve names and documentation on scope API (#5095 ) ## Summary Just minor improvements to improve consistency of method names and availability.	2023-06-14 18:28:55 +00:00
Charlie Marsh	86ff1febea	Re-export `ruff_python_semantic` members (#5094 ) ## Summary This PR adds a more unified public API to `ruff_python_semantic`, so that we don't need to do deeply nested imports all over the place.	2023-06-14 18:23:38 +00:00
Charlie Marsh	a33bbe6335	Track "delayed" annotations in the semantic model (#5070 ) ## Summary This PR tackles a corner case that we'll need to support local symbol renaming. It relates to a nuance in how we want handle annotations (i.e., `AnnAssign` statements with no value, like `x: int` in a function body). When we see a statement like: ```python x: int ``` We create a `BindingKind::Annotation` for `x`. This is a special `BindingKind` that the resolver isn't allowed to return. For example, given: ```python x: int print(x) ``` The second line will yield an `undefined-name` error. So why does this `BindingKind` exist at all? In Pyflakes, to support the `unused-annotation` lint: ```python def f(): x: int # unused-annotation ``` If we don't track `BindingKind::Annotation`, we can't lint for unused variables that are only "defined" via annotations. There are a few other wrinkles to `BindingKind::Annotation`. One is that, if a binding already exists in the scope, we actually just discard the `BindingKind`. So in this case: ```python x = 1 x: int ``` When we go to create the `BindingKind::Annotation` for the second statement, we notice that (1) we're creating an annotation but (2) the scope already has binding for the name -- so we just drop the binding on the floor. This has the nice property that annotations aren't considered to "shadow" another binding, which is important in a bunch of places (e.g., if we have `import os; os: int`, we still consider `os` to be an import, as we should). But it also means that these "delayed" annotations are one of the few remaining references that we don't track anywhere in the semantic model. This PR adds explicit support for these via a new `delayed_annotations` attribute on the semantic model. These should be extremely rare, but we do need to track them if we want to support local symbol renaming. ### This isn't the right way to model this This isn't the right way to model this. Here's an alternative: - Remove `BindingKind::Annotation`, and treat annotations as their own, separate concept. - Instead of storing a map from name to `BindingId` on each `Scope`, store a map from name to... `SymbolId`. - Introduce a `Symbol` abstraction, where a symbol can point to a current binding, and a list of annotations, like: ```rust pub struct Symbol { binding: Option<BindingId>, annotations: Vec<AnnotationId> } ``` If we did this, we could appropriately model the semantics described above. When we go to resolve a binding, we ignore annotations (always). When we try to find unused variables, we look through the list of symbols, and have sufficient information to discriminate between annotations and bound variables. Etc. The main downside of this `Symbol`-based approach is that it's going to take a lot more work to implement, and it'll be less performant (we'll be storing more data per symbol, and our binding lookups will have an added layer of indirection).	2023-06-14 17:54:35 +00:00
Charlie Marsh	c992cfa76e	Make some of `ruff_python_semantic` `pub(crate)` (#5093 )	2023-06-14 17:49:37 +00:00
konstin	916f0889f8	Add pyproject.toml to include option doc (#5080 ) Fixes an oversight where i didn't update this initially	2023-06-14 15:55:12 +00:00
Charlie Marsh	732b0405d7	Remove `FixMode::None` (#5087 ) ## Summary We now _always_ generate fixes, so `FixMode::None` and `FixMode::Generate` are redundant. We can also remove the TODO around `--fix-dry-run`, since that's our default behavior. Closes #5081.	2023-06-14 11:17:09 -04:00
Thomas de Zeeuw	e7316c1cc6	Consider ignore-names in all pep8 naming rules (#5079 ) ## Summary This changes all remaining pep8 naming rules to consider the `ingore-names` argument. Closes #5050 ## Test Plan Added new tests.	2023-06-14 16:57:09 +02:00
Charlie Marsh	6f10aeebaa	Remove unused `Scope#delete` method (#5085 ) ## Summary This is now intentionally unused and is now made impossible (via this PR).	2023-06-14 14:15:14 +00:00
Charlie Marsh	c74ef77e85	Move binding accesses into `SemanticModel` method (#5084 )	2023-06-14 14:07:46 +00:00
Charlie Marsh	1e497162d1	Add a dedicated read result for unbound locals (#5083 ) ## Summary Small follow-up to #4888 to add a dedicated `ResolvedRead` case for unbound locals, mostly for clarity and documentation purposes (no behavior changes). ## Test Plan `cargo test`	2023-06-14 09:58:48 -04:00
Charlie Marsh	aa41ffcfde	Add `BindingKind` variants to represent deleted bindings (#5071 ) ## Summary Our current mechanism for handling deletions (e.g., `del x`) is to remove the symbol from the scope's `bindings` table. This "does the right thing", in that if we then reference a deleted symbol, we're able to determine that it's unbound -- but it causes a variety of problems, mostly in that it makes certain bindings and references unreachable after-the-fact. Consider: ```python x = 1 print(x) del x ``` If we analyze this code _after_ running the semantic model over the AST, we'll have no way of knowing that `x` was ever introduced in the scope, much less that it was bound to a value, read, and then deleted -- because we effectively erased `x` from the model entirely when we hit the deletion. In practice, this will make it impossible for us to support local symbol renames. It also means that certain rules that we want to move out of the model-building phase and into the "check dead scopes" phase wouldn't work today, since we'll have lost important information about the source code. This PR introduces two new `BindingKind` variants to model deletions: - `BindingKind::Deletion`, which represents `x = 1; del x`. - `BindingKind::UnboundException`, which represents: ```python try: 1 / 0 except Exception as e: pass ``` In the latter case, `e` gets unbound after the exception handler (assuming it's triggered), so we want to handle it similarly to a deletion. The main challenge here is auditing all of our existing `Binding` and `Scope` usages to understand whether they need to accommodate deletions or otherwise behave differently. If you look one commit back on this branch, you'll see that the code is littered with `NOTE(charlie)` comments that describe the reasoning behind changing (or not) each of those call sites. I've also augmented our test suite in preparation for this change over a few prior PRs. ### Alternatives As an alternative, I considered introducing a flag to `BindingFlags`, like `BindingFlags::UNBOUND`, and setting that at the appropriate time. This turned out to be a much more difficult change, because we tend to match on `BindingKind` all over the place (e.g., we have a bunch of code blocks that only run when a `BindingKind` is `BindingKind::Importation`). As a result, introducing these new `BindingKind` variants requires only a few changes at the client sites. Adding a flag would've required a much wider-reaching change.	2023-06-14 09:27:24 -04:00
Charlie Marsh	fc6580592d	Use Expr::is_* methods at more call sites (#5075 )	2023-06-14 04:02:39 +00:00
Tom Kuson	4d9b0b925d	Add documentation to `flake8-executable` rules (#5063 ) ## Summary Completes the documentation for the `flake8-executable` rules. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-14 01:31:06 +00:00
Charlie Marsh	0daeea1f42	Tweak exception-handler handling in AST visitor (#5069 )	2023-06-14 01:00:42 +00:00
Charlie Marsh	3f6584b74f	Fix erroneous kwarg reference (#5068 )	2023-06-14 00:01:52 +00:00
Charlie Marsh	c2fa568b46	Use dedicated structs for excepthandler variants (#5065 ) ## Summary Oversight from #5042.	2023-06-13 22:37:06 +00:00
Charlie Marsh	1895011ac2	Document some attributes on the semantic model (#5064 )	2023-06-13 20:45:24 +00:00
Charlie Marsh	364bd82aee	Don't treat annotations as resolved in forward references (#5060 ) ## Summary This behavior dates back to a Pyflakes commit (5fc37cbd), which was used to allow this test to pass: ```py from __future__ import annotations T: object def f(t: T): pass def g(t: 'T'): pass ``` But, I think this is an error. Mypy and Pyright don't accept it -- you can only use variables as type annotations if they're type aliases (i.e., annotated with `TypeAlias`), in which case, there has to be an assignment on the right-hand side (see: [PEP 613](https://peps.python.org/pep-0613/)).	2023-06-13 14:47:29 -04:00
Charlie Marsh	f9f08d6b03	Add a few more tests for deletion behaviors (#5058 )	2023-06-13 17:54:04 +00:00
Charlie Marsh	b0984a2868	Treat exception binding as explicit deletion (#5057 ) ## Summary This PR corrects a misunderstanding I had related to Python's handling of bound exceptions. Previously, I thought this code ran without error: ```py def f(): x = 1 try: 1 / 0 except Exception as x: pass print(x) ``` My understanding was that `except Exception as x` bound `x` within the `except` block, but then restored the `x = 1` binding after exiting the block. In practice, however, this throws a `UnboundLocalError` error, because `x` becomes "unbound" after exiting the exception handler. It's similar to a `del` statement in this way. This PR removes our behavior to "restore" the previous binding. This could lead to faulty analysis in conditional blocks due to our lack of control flow analysis, but those same problems already exist for `del` statements.	2023-06-13 13:45:51 -04:00
Charlie Marsh	a431dd0368	Respect all `__all__` definitions for docstring visibility (#5052 ) ## Summary We changed the semantics around `__all__` in #4885, but didn't update the docstring visibility code to match those changes.	2023-06-13 12:22:20 -04:00
Charlie Marsh	099a9152d1	Use `.is_unbound()` in flake8-errmsg fix (#5053 ) ## Summary Trying to bring some more consistent to these APIs as I look to change them to accommodate deletions.	2023-06-13 12:22:05 -04:00
Charlie Marsh	19f972a305	Use `Scope#has` in lieu of `Scope#get` (#5051 ) ## Summary These usages don't actually need the `BindingId`.	2023-06-13 15:59:53 +00:00
Thomas de Zeeuw	b0f89fa814	Support glob patterns in pep8_naming ignore-names (#5024 ) ## Summary Support glob patterns in pep8_naming ignore-names. Closes #2787 ## Test Plan Added new tests.	2023-06-13 17:37:13 +02:00
Charlie Marsh	65312bad01	Remove unannotated attributes from RUF008 (#5049 ) ## Summary In a dataclass: ```py from dataclasses import dataclass @dataclass class X: class_var = {} x: int ``` `class_var` isn't actually a dataclass attribute, since it's unannotated. This PR removes such attributes from RUF008 (`mutable-dataclass-default`), but it does enforce them in RUF012 (`mutable-class-default`), since those should be annotated with `ClassVar` like any other mutable class attribute. Closes #5043.	2023-06-13 10:21:14 -04:00
Aarni Koskela	7b4dde0c6c	Add JSON Lines (NDJSON) message serialization (#5048 ) ## Summary This adds `json-lines` (https://jsonlines.org/ or http://ndjson.org/) as an output format. I'm sure you already know, but * JSONL is more greppable (each record is a single line) than the pretty JSON * JSONL is faster to ingest piecewise (and/or in parallel) than JSON ## Test Plan Snapshot test in the new module :)	2023-06-13 14:15:55 +00:00
Thomas de Zeeuw	e1fd3965a2	Start with Upper case in error messages (#5045 ) ## Summary To be consistent with the format used by other errors. ## Test Plan N/A.	2023-06-13 13:14:45 +02:00
konstin	95ee6dcb3b	Add contributor docs to formatter (#5023 ) I've written done my condensed learnings from working on the formatter so that others can have an easier start working on it. This is a pure docs change	2023-06-13 07:22:17 +00:00
Charlie Marsh	cc44349401	Use dedicated structs in `comparable.rs` (#5042 ) ## Summary Updating to match the updated AST structure, for consistency.	2023-06-13 03:57:34 +00:00
qdegraaf	a477720f4e	[`perflint`] Add `perflint` plugin, add first rule `PERF102` (#4821 ) ## Summary Adds boilerplate for implementing the [perflint](https://github.com/tonybaloney/perflint/) plugin, plus a first rule. ## Test Plan Fixture added for PER8102 ## Issue link Refers: https://github.com/charliermarsh/ruff/issues/4789	2023-06-13 01:54:44 +00:00
Charlie Marsh	be2fa6d217	Increase density of `Checker` arms (#5041 )	2023-06-13 01:08:23 +00:00
Charlie Marsh	cbd4c10fdd	Support 'reason' argument to `pytest.fail` (#5040 ) ## Summary Per the [API reference](https://docs.pytest.org/en/7.1.x/reference/reference.html#pytest.fail), `reason` was added in version 7, and is equivalent to `msg` (but preferred going forward). I also grepped for `msg` usages in `flake8_pytest_style`, but found no others (apart from those that reference `unittest` APIs.) Closes #3387.	2023-06-12 20:54:07 -04:00
Timofei Kukushkin	e2130707f5	Autofixer for ISC001 (#4853 ) ## Summary This PR adds autofixer for rule ISC001 in cases where both string literals are of the same kind and with same quotes (double / single). Fixes #4829 ## Test Plan I added testcases with different combinations of string literals.	2023-06-12 23:28:57 +00:00
Charlie Marsh	780336db0a	Include f-string prefixes in quote-stripping utilities (#5039 ) Mentioned here: https://github.com/astral-sh/ruff/pull/4853#discussion_r1217560348. Generated with this hacky script: https://gist.github.com/charliermarsh/8ecc4e55bc87d51dc27340402f33b348.	2023-06-12 18:25:47 -04:00
Charlie Marsh	7e37d8916c	Remove lexer dependency from identifier_range (#5036 ) ## Summary We run this quite a bit -- the new version is zero-allocation, though it's not quite as nice as the lexer we have in the formatter.	2023-06-12 22:06:03 +00:00
Charlie Marsh	ab11dd08df	Improve `TypedDict` conversion logic for shadowed builtins and dunder methods (#5038 ) ## Summary This PR (1) avoids flagging `TypedDict` and `NamedTuple` conversions when attributes are dunder methods, like `__dict__`, and (2) avoids flagging the `A003` shadowed-attribute rule for `TypedDict` classes at all, where it doesn't really apply (since those attributes are only accessed via subscripting anyway). Closes #5027.	2023-06-12 21:23:39 +00:00
Charlie Marsh	4080f36850	Handle decorators in class-parenthesis-modifying rules (#5034 ) ## Summary A few of our rules look at the parentheses that follow a class definition (e.g., `class Foo(object):`) and attempt to modify those parentheses. Neither of those rules were behaving properly in the presence of decorators, which were recently added to the statement range. ## Test Plan `cargo test` with a variety of new fixture tests.	2023-06-12 15:19:59 -04:00
Charlie Marsh	6d861743c8	Remove custom tests in `rules/ruff/mod.rs` (#5033 )	2023-06-12 18:54:04 +00:00
Charlie Marsh	54e103fc99	Add a rule to remove unnecessary parentheses in class definitions (#5032 ) Closes #2409.	2023-06-12 18:43:06 +00:00
Dhruv Manilawala	3470dee7d4	Add rule to disallow implicit optional with autofix (#4831 ) ## Summary Add rule to disallow implicit optional with autofix. Currently, I've added it under `RUF` category. ### Limitation Type aliases could result in false positive: ```python from typing import Optional StrOptional = Optional[str] def foo(arg: StrOptional = None): pass ``` ## Test Plan `cargo test` resolves: #1983 --------- Co-authored-by: Micha Reiser <micha@reiser.io> Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2023-06-12 18:12:10 +00:00
Dhruv Manilawala	cb4f086cbf	Add roundtrip support for Jupyter notebook (#5028 ) ## Summary Add roundtrip support for Jupyter notebook. 1. Read the notebook 2. Extract out the source code content 3. Use it to update the notebook itself (should be exactly the same [^1]) 4. Serialize into JSON and print it to stdout ## Test Plan `cargo run --all-features --bin ruff_dev --package ruff_dev -- round-trip <path/to/notebook.ipynb>` <details><summary>Example output:</summary> <p> ``` { "cells": [ { "cell_type": "markdown", "id": "f3c286e9-fa52-4440-816f-4449232f199a", "metadata": {}, "source": [ "# Ruff Test" ] }, { "cell_type": "markdown", "id": "a2b7bc6c-778a-4b07-86ae-dde5a2d9511e", "metadata": {}, "source": [ "Markdown block before the first import" ] }, { "cell_type": "code", "id": "5e3ef98e-224c-450a-80e6-be442ad50907", "metadata": { "tags": [] }, "source": "", "execution_count": 1, "outputs": [] }, { "cell_type": "code", "id": "6bced3f8-e0a4-450c-ae7c-f60ad5671ee9", "metadata": {}, "source": "import contextlib\n\nwith contextlib.suppress(ValueError):\n print()\n", "outputs": [] }, { "cell_type": "code", "id": "d7102cfd-5bb5-4f5b-a3b8-07a7b8cca34c", "metadata": {}, "source": "import random\n\nrandom.randint(10, 20)", "outputs": [] }, { "cell_type": "code", "id": "88471d1c-7429-4967-898f-b0088fcb4c53", "metadata": {}, "source": "foo = 1\nif foo < 2:\n msg = f\"Invalid foo: {foo}\"\n raise ValueError(msg)", "outputs": [] } ], "metadata": { "kernelspec": { "display_name": "Python (ruff-playground)", "name": "ruff-playground", "language": "python" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "pygments_lexer": "ipython3", "nbconvert_exporter": "python", "version": "3.11.3" } }, "nbformat": 4, "nbformat_minor": 5 } ``` </p> </details> [^1]: The type in JSON might be different (https://github.com/astral-sh/ruff/pull/4665#discussion_r1212663495) Part of #1218	2023-06-12 23:27:45 +05:30
Charlie Marsh	a77d2df934	Split mutable-class-defaults rules into separate modules (#5031 )	2023-06-12 17:21:28 +00:00
Adam Pauls	638c18f007	Expand RUF008 to all classes, but to a new code (RUF012) (#4390 ) AFAIK, there is no reason to limit RUF008 to just dataclasses -- mutable defaults have the same problems for regular classes. Partially addresses https://github.com/charliermarsh/ruff/issues/4053 and broken out from https://github.com/charliermarsh/ruff/pull/4096. --------- Co-authored-by: Micha Reiser <micha@reiser.io> Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2023-06-12 16:54:27 +00:00
Addison Crump	70e6c212d9	Improve ruff_parse_simple to find UTF-8 violations (#5008 ) Improves the `ruff_parse_simple` fuzz harness by adding checks for parsed locations to ensure they all lie on UTF-8 character boundaries. This will allow for faster identification of issues like #5004. This also adds additional details for Apple M1 users and clarifies the importance of using `init-fuzzer.sh` (thanks for the feedback, @jasikpark 🙂).	2023-06-12 12:10:23 -04:00
Charlie Marsh	9db622afe1	Allow `Options`-to-`Settings` conversion to use `TryFrom` (#5025 ) ## Summary This avoids a bad `expect()` call in the `copyright` conversion. ## Test Plan `cargo test`	2023-06-12 15:31:50 +00:00
Thomas de Zeeuw	d3aa81a474	Suggest combining async with statements (#5022 ) ## Summary Previously the rule for SIM117 explicitly ignored `async with` statements as it would incorrectly suggestion to merge `async with` and regular `with` statements as reported in issue #1902. This partially reverts the fix for that (commit `396be5edea`) by enabling the rules for `async with` statements again, but with a check ensuring that the statements are both of the same kind, i.e. both `async with` or both (just) `with` statements. Closes #3025 ## Test Plan Updated and existing test and added a new test case from #3025.	2023-06-12 16:33:18 +02:00
Dhruv Manilawala	d8f5d2d767	Add support for auto-fix in Jupyter notebooks (#4665 ) ## Summary Add support for applying auto-fixes in Jupyter Notebook. ### Solution Cell offsets are the boundaries for each cell in the concatenated source code. They are represented using `TextSize`. It includes the start and end offset as well, thus creating a range for each cell. These offsets are updated using the `SourceMap` markers. ### SourceMap `SourceMap` contains markers constructed from each edits which tracks the original source code position to the transformed positions. The following drawing might make it clear: ![SourceMap visualization](https://github.com/astral-sh/ruff/assets/67177269/3c94e591-70a7-4b57-bd32-0baa91cc7858) The center column where the dotted lines are present are the markers included in the `SourceMap`. The `Notebook` looks at these markers and updates the cell offsets after each linter loop. If you notice closely, the destination takes into account all of the markers before it. The index is constructed only when required as it's only used to render the diagnostics. So, a `OnceCell` is used for this purpose. The cell offsets, cell content and the index will be updated after each iteration of linting in the mentioned order. The order is important here as the content is updated as per the new offsets and index is updated as per the new content. ## Limitations ### 1 Styling rules such as the ones in `pycodestyle` will not be applicable everywhere in Jupyter notebook, especially at the cell boundaries. Let's take an example where a rule suggests to have 2 blank lines before a function and the cells contains the following code: ```python import something # --- def first(): pass def second(): pass ``` (Again, the comment is only to visualize cell boundaries.) In the concatenated source code, the 2 blank lines will be added but it shouldn't actually be added when we look in terms of Jupyter notebook. It's as if the function `first` is at the start of a file. `nbqa` solves this by recording newlines before and after running `autopep8`, then running the tool and restoring the newlines at the end (refer https://github.com/nbQA-dev/nbQA/pull/807). ## Test Plan Three commands were run in order with common flags (`--select=ALL --no-cache --isolated`) to isolate which stage the problem is occurring: 1. Only diagnostics 2. Fix with diff (`--fix --diff`) 3. Fix (`--fix`) ### https://github.com/facebookresearch/segment-anything ``` ------------------------------------------------------------------------------- Jupyter Notebooks 3 0 0 0 0 \|- Markdown 3 98 0 94 4 \|- Python 3 513 468 4 41 (Total) 611 468 98 45 ------------------------------------------------------------------------------- ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/segment-anything/*/.ipynb --fix ... Found 180 errors (89 fixed, 91 remaining). ``` ### https://github.com/openai/openai-cookbook ``` ------------------------------------------------------------------------------- Jupyter Notebooks 65 0 0 0 0 \|- Markdown 64 3475 12 2507 956 \|- Python 65 9700 7362 1101 1237 (Total) 13175 7374 3608 2193 =============================================================================== ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/openai-cookbook/*/.ipynb --fix error: Failed to parse /path/to/openai-cookbook/examples/vector_databases/Using_vector_databases_for_embeddings_search.ipynb:cell 4:29:18: unexpected token '-' ... Found 4227 errors (2165 fixed, 2062 remaining). ``` ### https://github.com/tensorflow/docs ``` ------------------------------------------------------------------------------- Jupyter Notebooks 150 0 0 0 0 \|- Markdown 1 55 0 46 9 \|- Python 1 402 289 60 53 (Total) 457 289 106 62 ------------------------------------------------------------------------------- ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/tensorflow-docs/*/.ipynb --fix error: Failed to parse /path/to/tensorflow-docs/site/en/guide/extension_type.ipynb:cell 80:1:1: unexpected token Indent error: Failed to parse /path/to/tensorflow-docs/site/en/r1/tutorials/eager/custom_layers.ipynb:cell 20:1:1: unexpected token Indent error: Failed to parse /path/to/tensorflow-docs/site/en/guide/data.ipynb:cell 175:5:14: unindent does not match any outer indentation level error: Failed to parse /path/to/tensorflow-docs/site/en/r1/tutorials/representation/unicode.ipynb:cell 30:1:1: unexpected token Indent ... Found 12726 errors (5140 fixed, 7586 remaining). ``` ### https://github.com/tensorflow/models ``` ------------------------------------------------------------------------------- Jupyter Notebooks 46 0 0 0 0 \|- Markdown 1 11 0 6 5 \|- Python 1 328 249 19 60 (Total) 339 249 25 65 ------------------------------------------------------------------------------- ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/tensorflow-models/*/.ipynb --fix ... Found 4856 errors (2690 fixed, 2166 remaining). ``` resolves: #1218 fixes: #4556	2023-06-12 14:14:15 +00:00
konstin	e586c27590	Format ExprTuple (#4963 ) This implements formatting ExprTuple, including magic trailing comma. I intentionally didn't change the settings mechanism but just added a dummy global const flag. Besides the snapshots, I added custom breaking/joining tests and a deeply nested test case. The diffs look better than previously, proper black compatibility depends on parentheses handling. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-12 12:55:47 +00:00
Thomas de Zeeuw	8161757229	[flake8-pyi] Implement PYI044 (#5021 ) ## Summary This implements PYI044. This rule checks if `from __future__ import annotations` is used in stub files as it has no effect in stub files, since type checkers automatically treat stubs as having those semantics. Updates https://github.com/astral-sh/ruff/issues/848 ## Test Plan Added a test case and snapshots.	2023-06-12 13:20:16 +02:00
Charlie Marsh	6a5f317362	Use `use::*` for rule re-exports (#5018 )	2023-06-12 00:32:45 +00:00
Dhruv Manilawala	c3d1fa851e	Ignore pyproject.toml for adding noqa directives (#5013 ) ## Summary Ignore pyproject.toml file for adding noqa directives using `--add-noqa` ## Test Plan `cargo run --bin ruff -- check --add-noqa .` fixes: #5012	2023-06-11 20:21:24 -04:00
Charlie Marsh	eac3a0cc3d	Update CONTRIBUTING.md guide (#5017 )	2023-06-12 00:20:59 +00:00
Charlie Marsh	68b6d30c46	Use consistent `Cargo.toml` metadata in all crates (#5015 )	2023-06-12 00:02:40 +00:00
Ryan Yang	ab3c02342b	Implement copyright notice detection (#4701 ) ## Summary Add copyright notice detection to enforce the presence of copyright headers in Python files. Configurable settings include: the relevant regular expression, the author name, and the minimum file size, similar to [flake8-copyright](https://github.com/savoirfairelinux/flake8-copyright). Closes https://github.com/charliermarsh/ruff/issues/3579 --------- Signed-off-by: ryan <ryang@waabi.ai> Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2023-06-11 02:17:58 +00:00
Trevor Gross	9f7cc86a22	Add more details to E722 (bare-except) docs (#5007 ) ## Summary Note that catching a bare `Exception` is better than catching no specific exception. ## Test Plan Documentation only.	2023-06-10 18:42:43 -04:00
Charlie Marsh	445e1723ab	Use `Stmt::parse` in lieu of `Suite` unwraps (#5002 )	2023-06-10 04:55:31 +00:00
Charlie Marsh	42c8054268	Implement autofix for revised `RET504` rule (#4999 ) ## Summary This PR enables autofix for the revised `RET504` rule, by changing: ```py def f(): x = 1 return x ``` ...to: ```py def f(): return 1 ``` Closes #2263. Closes #2788.	2023-06-10 04:32:03 +00:00
Charlie Marsh	2d597bc1fb	Parenthesize expressions prior to lexing in F632 (#5001 )	2023-06-10 04:23:43 +00:00
Charlie Marsh	7275c16d98	Extend revised `RET504` implementation to `with` statements (#4998 ) ## Summary This PR extends the new `RET504` implementation to handle cases like: ```py def foo(): with open("foo.txt", "r") as f: x = f.read() return x ``` This was originally suggested in https://github.com/astral-sh/ruff/issues/2950#issuecomment-1433441503.	2023-06-10 04:15:35 +00:00
Charlie Marsh	02b8ce82af	Refactor `RET504` to only enforce assignment-then-return pattern (#4997 ) ## Summary The `RET504` rule, which looks for unnecessary assignments before return statements, is a frequent source of issues (#4173, #4236, #4242, #1606, #2950). Over time, we've tried to refine the logic to handle more cases. For example, we now avoid analyzing any functions that contain any function calls or attribute assignments, since those operations can contain side effects (and so we mark them as a "read" on all variables in the function -- we could do a better job with code graph analysis to handle this limitation, but that'd be a more involved change.) We also avoid flagging any variables that are the target of multiple assignments. Ultimately, though, I'm not happy with the implementation -- we just can't do sufficiently reliable analysis of arbitrary code flow given the limited logic herein, and the existing logic is very hard to reason about and maintain. This PR refocuses the rule to only catch cases of the form: ```py def f(): x = 1 return x ``` That is, we now only flag returns that are immediately preceded by an assignment to the returned variable. While this is more limiting, in some ways, it lets us flag more cases vis-a-vis the previous implementation, since we no longer "fully eject" when functions contain function calls and other effect-ful operations. Closes #4173. Closes #4236. Closes #4242.	2023-06-10 00:05:01 -04:00
Charlie Marsh	5abb8ec0dc	Use Python whitespace utilities in `ruff_textwrap` (#4996 ) ## Summary This change was intended to be included in #4994, but was somehow dropped.	2023-06-10 02:32:42 +00:00
Charlie Marsh	f401050878	Introduce `PythonWhitespace` to confine trim operations to Python whitespace (#4994 ) ## Summary We use `.trim()` and friends in a bunch of places, to strip whitespace from source code. However, not all Unicode whitespace characters are considered "whitespace" in Python, which only supports the standard space, tab, and form-feed characters. This PR audits our usages of `.trim()`, `.trim_start()`, `.trim_end()`, and `char::is_whitespace`, and replaces them as appropriate with a new `.trim_whitespace()` analogues, powered by a `PythonWhitespace` trait. In general, the only place that should continue to use `.trim()` is content within docstrings, which don't need to adhere to Python's semantic definitions of whitespace. Closes #4991.	2023-06-09 21:44:50 -04:00
Charlie Marsh	c1ac50093c	Use super visibility in helpers (#4995 )	2023-06-10 01:23:13 +00:00
Charlie Marsh	1d756dc3a7	Move Python whitespace utilities into new `ruff_python_whitespace` crate (#4993 ) ## Summary `ruff_newlines` becomes `ruff_python_whitespace`, and includes the existing "universal newline" handlers alongside the Python whitespace-specific utilities.	2023-06-10 00:59:57 +00:00
Charlie Marsh	e86f12a1ec	Rename some methods on `SemanticModel` (#4990 )	2023-06-09 19:36:59 +00:00
Charlie Marsh	5c502a3320	Add documentation for `BindingKind` variants (#4989 )	2023-06-09 18:32:50 +00:00
Micha Reiser	901bcb6f21	Fix line numbers in source frames (#4984 )	2023-06-09 17:21:18 +02:00
Micha Reiser	111e1f93ca	perf(formatter): Skip bodies without comments (#4978 )	2023-06-09 11:33:57 +02:00
Micha Reiser	68d52da43b	Track formatted comments (#4979 )	2023-06-09 09:09:45 +00:00
Micha Reiser	646ab64850	Fix binary expression formatting with leading comments (#4964 )	2023-06-09 09:02:50 +00:00
Micha Reiser	1accbeffd6	Format `if` statements (#4961 )	2023-06-09 10:55:14 +02:00
Charlie Marsh	16d1e63a5e	Respect 'is not' operators split across newlines (#4977 )	2023-06-09 05:07:45 +00:00
Charlie Marsh	d647105e97	Support concatenated string key removals (#4976 )	2023-06-09 04:56:35 +00:00
Davide Canton	63fdcea29e	Handled dict and set inside f-string (#4249 ) (#4563 )	2023-06-09 04:53:13 +00:00
qdegraaf	2bb32ee943	[`flake8-slots`] Add plugin, add `SLOT000`, `SLOT001` and `SLOT002` (#4909 )	2023-06-09 04:14:16 +00:00
rodjunger	ee1f094834	[`ruff`] Add a rule for static keys in dict comprehensions (#4929 )	2023-06-09 02:06:34 +00:00
Tom Kuson	efd8f3bdab	Complete `flake8-simplify` documentation (#4930 )	2023-06-09 02:02:41 +00:00
Charlie Marsh	293889a352	Support concatenated literals in format-literals (#4974 )	2023-06-09 01:29:19 +00:00
Tom Kuson	2c19000e4a	Add Pylint rule `comparison-with-itself` (`R0124`) (#4957 )	2023-06-09 00:57:50 +00:00
Charlie Marsh	aba073a791	Upgrade explicit-type-conversion rule (`RUF010`) to remove unnecessary `str` calls (#4971 )	2023-06-08 20:02:57 +00:00
Charlie Marsh	d042eddccc	Remove `unwrap` from none-comparison rule (#4969 )	2023-06-08 18:21:56 +00:00
Charlie Marsh	775d247731	Allow private accesses within special dunder methods (#4968 )	2023-06-08 17:36:49 +00:00
Charlie Marsh	58d08219e8	Allow re-assignments to `__all__` (#4967 )	2023-06-08 17:19:56 +00:00
Charlie Marsh	902c4e7d77	Make SIM118 a suggested fix (#4966 )	2023-06-08 17:02:42 +00:00
Micha Reiser	68969240c5	Format Function definitions (#4951 )	2023-06-08 16:07:33 +00:00
Dhruv Manilawala	07cc4bcb0f	Update links to point to Astral org (#4949 )	2023-06-08 11:43:40 -04:00
Micha Reiser	9c3fb23ace	Simple lexer for formatter (#4922 )	2023-06-08 17:37:39 +02:00
konstin	467df23e65	Implement StmtReturn (#4960 ) * Implement StmtPass This implements StmtPass as `pass`. The snapshot diff is small because pass mainly occurs in bodies and function (#4951) and if/for bodies. * Implement StmtReturn This implements StmtReturn as `return` or `return {value}`. The snapshot diff is small because return occurs in functions (#4951)	2023-06-08 16:29:39 +02:00
konstin	c8442e91ce	Implement StmtPass (#4959 ) This implements StmtPass as `pass`. The snapshot diff is small because pass mainly occurs in bodies and function (#4951) and if/for bodies.	2023-06-08 16:29:27 +02:00
Micha Reiser	6bef347a8e	Trailing own line comments before func or class (#4921 )	2023-06-08 12:50:25 +00:00
Micha Reiser	c1cc6f3be1	Add basic Constant formatting (#4954 )	2023-06-08 11:42:44 +00:00
Micha Reiser	83cf6d6e2f	Implement Binary expression without `best_fitting` (#4952 )	2023-06-08 12:45:03 +02:00
konstin	23abad0bd5	A basic StmtAssign formatter and better dummies for expressions (#4938 ) * A basic StmtAssign formatter and better dummies for expressions The goal of this PR was formatting StmtAssign since many nodes in the black tests (and in python in general) are after an assignment. This caused unstable formatting: The spacing of power op spacing depends on the type of the two involved expressions, but each expression was formatted as dummy string and re-parsed as a ExprName, so in the second round the different rules of ExprName were applied, causing unstable formatting. This PR does not necessarily bring us closer to black's style, but it unlocks a good porting of black's test suite and is a basis for implementing the Expr nodes. * fmt * Review	2023-06-08 12:20:25 +02:00
konstin	651d89794c	Use phf for confusables to reduce llvm lines (#4926 ) * Use phf for confusables to reduce llvm lines ## Summary This replaces FxHashMap for the confusables with a perfect hash map from the [phf crate](https://github.com/rust-phf/rust-phf) to reduce the generated llvm instructions. A perfect hash function is one that doesn't have any collisions. We can build one because we know all keys at compile time. This improves hashmap efficiency, even though this is likely not noticeable in our case (except someone has a large non-english crate to test on). The original hashmap contained a lot of duplicates, which i had to remove when phf_map complained, i did so by sorting the keys. The important part that it reduces the llvm instructions generated (#3808, `RUSTFLAGS="-Csymbol-mangling-version=v0" cargo llvm-lines -p ruff --lib \| head -20`): ``` Lines Copies Function name ----- ------ ------------- 1740502 38973 (TOTAL) 27423 (1.6%, 1.6%) 1 (0.0%, 0.0%) ruff[cef4c65d96248843]::rules::ruff::rules::confusables::CONFUSABLES::{closure#0} 10193 (0.6%, 2.2%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::RuleCodePrefix>::iter 8107 (0.5%, 2.6%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::Rule>::noqa_code 7345 (0.4%, 3.0%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::checkers::ast::Checker as ruff_python_ast[3778b140caf21545]::visitor::Visitor>::visit_stmt 6412 (0.4%, 3.4%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:spanned::SpannedDeserializer<toml_edit[7e3a6c5e67260672]:🇩🇪:value::ValueDeserializer>> 6412 (0.4%, 3.8%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:table::TableMapAccess> 6409 (0.4%, 4.2%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:datetime::DatetimeDeserializer> 5696 (0.3%, 4.5%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::checkers::ast::Checker as ruff_python_ast[3778b140caf21545]::visitor::Visitor>::visit_expr 4448 (0.3%, 4.7%) 1 (0.0%, 0.0%) ruff[cef4c65d96248843]::flake8_to_ruff::converter::convert 3702 (0.2%, 4.9%) 1 (0.0%, 0.0%) <&ruff[cef4c65d96248843]::registry::Linter as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter 3349 (0.2%, 5.1%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::registry::Linter>::code_for_rule 3132 (0.2%, 5.3%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::Rule as core[da82827a87f140f9]::fmt::Debug>::fmt 3130 (0.2%, 5.5%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<&ruff[cef4c65d96248843]::codes::Rule>>::from 3130 (0.2%, 5.7%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<ruff[cef4c65d96248843]::codes::Rule>>::from 3130 (0.2%, 5.9%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::Rule as core[da82827a87f140f9]::convert::AsRef<str>>::as_ref 3128 (0.2%, 6.0%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::RuleIter>::get 2669 (0.2%, 6.2%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_seq::<toml_edit[7e3a6c5e67260672]:🇩🇪:array::ArraySeqAccess> ``` After: ``` Lines Copies Function name ----- ------ ------------- 1710487 38900 (TOTAL) 10193 (0.6%, 0.6%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::RuleCodePrefix>::iter 8107 (0.5%, 1.1%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::Rule>::noqa_code 7345 (0.4%, 1.5%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::checkers::ast::Checker as ruff_python_ast[5588cd60041c8605]::visitor::Visitor>::visit_stmt 6412 (0.4%, 1.9%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:spanned::SpannedDeserializer<toml_edit[7e3a6c5e67260672]:🇩🇪:value::ValueDeserializer>> 6412 (0.4%, 2.2%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:table::TableMapAccess> 6409 (0.4%, 2.6%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:datetime::DatetimeDeserializer> 5696 (0.3%, 3.0%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::checkers::ast::Checker as ruff_python_ast[5588cd60041c8605]::visitor::Visitor>::visit_expr 4448 (0.3%, 3.2%) 1 (0.0%, 0.0%) ruff[52408f46d2058296]::flake8_to_ruff::converter::convert 3702 (0.2%, 3.4%) 1 (0.0%, 0.0%) <&ruff[52408f46d2058296]::registry::Linter as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter 3349 (0.2%, 3.6%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::registry::Linter>::code_for_rule 3132 (0.2%, 3.8%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::Rule as core[da82827a87f140f9]::fmt::Debug>::fmt 3130 (0.2%, 4.0%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<&ruff[52408f46d2058296]::codes::Rule>>::from 3130 (0.2%, 4.2%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<ruff[52408f46d2058296]::codes::Rule>>::from 3130 (0.2%, 4.4%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::Rule as core[da82827a87f140f9]::convert::AsRef<str>>::as_ref 3128 (0.2%, 4.5%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::RuleIter>::get 2669 (0.2%, 4.7%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_seq::<toml_edit[7e3a6c5e67260672]:🇩🇪:array::ArraySeqAccess> 2659 (0.2%, 4.9%) 1 (0.0%, 0.0%) <&ruff[52408f46d2058296]::codes::Pylint as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter ``` I'd assume this has a positive effect both on compile time and on runtime, but i don't know the actual effect on compile times and can't really measure. ## Test plan Check CI for any performance regressions. This should fix #3808 if we merge it. * clippy * Update update_ambiguous_characters.py	2023-06-08 08:13:20 +02:00
Micha Reiser	39a1f3980f	Upgrade RustPython (#4900 )	2023-06-08 05:53:14 +00:00
Charlie Marsh	4b78141f6b	Generate one fix per statement for flake8-type-checking rules (#4915 )	2023-06-07 22:22:35 -04:00
Charlie Marsh	5235977abc	Bump version to 0.0.272 (#4948 )	2023-06-08 02:17:29 +00:00
kyoto7250	01d3d4bbd2	ignore if using infinite iterators in `B905` (#4914 )	2023-06-08 02:12:50 +00:00
Charlie Marsh	ac4a4da50e	Handle implicit string concatenations in conversion-flag rewrites (#4947 )	2023-06-08 02:04:35 +00:00
Charlie Marsh	a6d269f263	Apply `dict.get` fix before ternary rewrite (#4944 )	2023-06-07 22:33:40 +00:00
Charlie Marsh	f17282d615	Skip class scopes when resolving nonlocal references (#4943 )	2023-06-07 22:25:36 +00:00
Dhruv Manilawala	6950c93934	Make `C413` fix as suggested for `reversed` call (#4891 )	2023-06-07 18:23:19 -04:00
Charlie Marsh	ae75b303f0	Avoid attributing runtime references to module-level imports (#4942 )	2023-06-07 21:56:03 +00:00
Charlie Marsh	20240fc3d9	Move flake8-fixme rules to FIX prefix (#4917 )	2023-06-07 21:14:49 +00:00
Micha Reiser	bcf745c5ba	Replace verbatim text with `NOT_YET_IMPLEMENTED` (#4904 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR replaces the `verbatim_text` builder with a `not_yet_implemented` builder that emits `NOT_YET_IMPLEMENTED_<NodeKind>` for not yet implemented nodes. The motivation for this change is that partially formatting compound statements can result in incorrectly indented code, which is a syntax error: ```python def func_no_args(): a; b; c if True: raise RuntimeError if False: ... for i in range(10): print(i) continue ``` Get's reformatted to ```python def func_no_args(): a; b; c if True: raise RuntimeError if False: ... for i in range(10): print(i) continue ``` because our formatter does not yet support `for` statements and just inserts the text from the source. ## Downsides Using an identifier will not work in all situations. For example, an identifier is invalid in an `Arguments ` position. That's why I kept `verbatim_text` around and e.g. use it in the `Arguments` formatting logic where incorrect indentations are impossible (to my knowledge). Meaning, `verbatim_text` we can opt in to `verbatim_text` when we want to iterate quickly on nodes that we don't want to provide a full implementation yet and using an identifier would be invalid. ## Upsides Running this on main discovered stability issues with the newline handling that were previously "hidden" because of the verbatim formatting. I guess that's an upside :) ## Test Plan None?	2023-06-07 14:57:25 +02:00
Addison Crump	2f125f4019	Create fuzzers for testing correctness of parsing, linting and fixing (#4822 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-07 14:57:07 +02:00
Micha Reiser	6ab3fc60f4	Correctly handle newlines after/before comments (#4895 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This issue fixes the removal of empty lines between a leading comment and the previous statement: ```python a = 20 # leading comment b = 10 ``` Ruff removed the empty line between `a` and `b` because: * The leading comments formatting does not preserve leading newlines (to avoid adding new lines at the top of a body) * The `JoinNodesBuilder` counted the lines before `b`, which is 1 -> Doesn't insert a new line This is fixed by changing the `JoinNodesBuilder` to count the lines instead after the last node. This correctly gives 1, and the `# leading comment` will insert the empty lines between any other leading comment or the node. ## Test Plan I added a new test for empty lines.	2023-06-07 14:49:43 +02:00
Charlie Marsh	ec609f5c3b	Clarify requires-python inference requirements (#4918 )	2023-06-07 04:18:56 +00:00
Justin Prieto	b9060ea2bd	[`flake8-pyi`] Implement PYI050 (#4884 )	2023-06-07 01:56:53 +00:00
Charlie Marsh	b56a799417	Add some more test coverage for `del` statements (#4913 )	2023-06-06 21:40:23 -04:00
Charlie Marsh	780d153ae8	Replace one-off locals property with `ScopeFlags` (#4912 )	2023-06-06 21:22:21 -04:00
Tom Kuson	7cc205b5d6	Change `iteration-over-set` to flag set literals only (#4907 )	2023-06-06 21:06:46 +00:00
Charlie Marsh	2a6d7cd71c	Avoid no-op fix for nested with expressions (#4906 )	2023-06-06 20:15:21 +00:00
Charlie Marsh	2b5fb70482	Bump version to 0.0.271 (#4890 )	2023-06-06 15:11:48 -04:00
Charlie Marsh	8c048b463c	Track symbol deletions separately from bindings (#4888 )	2023-06-06 18:49:36 +00:00
Micha Reiser	19abee086b	Introduce `AnyFunctionDefinition` Node (#4898 )	2023-06-06 20:37:46 +02:00

1 2 3 4 5 ...

1408 Commits