Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Micha Reiser	8665a1a19d	Pass `FormatContext` to `NeedsParentheses` <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary I started working on this because I assumed that I would need access to options inside of `NeedsParantheses` but it then turned out that I won't. Anyway, it kind of felt nice to pass fewer arguments. So I'm gonna put this out here to get your feedback if you prefer this over passing individual fiels. Oh, I sneeked in another change. I renamed `context.contents` to `source`. `contents` is too generic and doesn't tell you anything. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan It compiles	2023-07-11 14:28:50 +02:00
Micha Reiser	9a8ba58b4c	Remove `mode` from `BestFitting` <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR removes the `mode` field from `BestFitting` because it is no longer used (we now use `conditional_group` and `fits_expanded). <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan `cargo test` <!-- How was it tested? -->	2023-07-11 14:19:26 +02:00
Micha Reiser	715250a179	Prefer expanding parenthesized expressions before operands <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR implements Black's behavior where it first splits off parenthesized expressions before splitting before operands to avoid unnecessary parentheses: ```python # We want if a + [ b, c ]: pass # Rather than if ( a + [b, c] ): pass ``` This is implemented by using the new IR elements introduced in #5596. * We give the group wrapping the optional parentheses an ID (`parentheses_id`) * We use `conditional_group` for the lower priority groups (all non-parenthesized expressions) with the condition that the `parentheses_id` group breaks (we want to split before operands only if the parentheses are necessary) * We use `fits_expanded` to wrap all other parenthesized expressions (lists, dicts, sets), to prevent that expanding e.g. a list expands the `parentheses_id` group. We gate the `fits_expand` to only apply if the `parentheses_id` group fits (because we prefer `a\n+[b, c]` over expanding `[b, c]` if the whole expression gets parenthesized). We limit using `fits_expanded` and `conditional_group` only to expressions that themselves are not in parentheses (checking the conditions isn't free) ## Test Plan It increases the Jaccard index for Django from 0.915 to 0.917 ## Incompatibilites There are two incompatibilities left that I'm aware of (there may be more, I didn't go through all snapshot differences). ### Long string literals I commented on the regression. The issue is that a very long string (or any content without a split point) may not fit when only breaking the right side. The formatter than inserts the optional parentheses. But this is kind of useless because the overlong string will still not fit, because there are no new split points. I think we should ignore this incompatibility for now ### Expressions on statement level I don't fully understand the logic behind this yet, but black doesn't break before the operators for the following example even though the expression exceeds the configured line width ```python aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa < bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb > ccccccccccccccccccccccccccccc == ddddddddddddddddddddd ``` But it would if the expression is used inside of a condition. What I understand so far is that Black doesn't insert optional parentheses on the expression statement level (and a few other places) and, therefore, only breaks after opening parentheses. I propose to keep this deviation for now to avoid overlong-lines and use the compatibility report to make a decision if we should implement the same behavior.	2023-07-11 14:07:39 +02:00
Micha Reiser	d30e9125eb	Extend formatter IR to support Black's expression formatting (#5596 )	2023-07-11 11:20:04 +00:00
konsti	212fd86bf0	Switch from jaccard index to similarity index (#5679 ) ## Summary The similarity index, the fraction of unchanged lines, is easier to understand than the jaccard index, the fraction between intersection and union. ## Test Plan I ran this on django and git a 0.945 index, meaning 5.5% of lines are currently reformatted when compared to black	2023-07-11 13:03:44 +02:00
David Szotten	4b58a9c092	formatter: tidy: list_comp is an expression, not a statement (#5677 )	2023-07-11 08:00:10 +00:00
konsti	b7794f855b	Format StmtAugAssign (#5655 ) ## Summary Format statements such as `tree_depth += 1`. This is a statement that does not allow any line breaks, the only thing to be mindful of is to parenthesize the assigned expression Jaccard index on django: 0.915 -> 0.918 ## Test Plan black tests, and two new tests, a basic one and one that ensures that the child gets parentheses. I ran the django stability check.	2023-07-11 09:06:23 +02:00
Chris Pryer	15c7b6bcf7	Format `delete` statement (#5169 )	2023-07-11 08:36:26 +02:00
David Szotten	1782fb8c30	format ExprListComp (#5600 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-11 06:35:51 +00:00
Micha Reiser	987111f5fb	Format `ExpressionStarred` nodes (#5654 )	2023-07-11 06:08:08 +00:00
Charlie Marsh	9f486fa841	[`flake8-bugbear`] Implement `re-sub-positional-args` (`B034`) (#5669 ) ## Summary Needed to do some coding to end the day. Closes #5665.	2023-07-11 03:52:55 +00:00
Charlie Marsh	4dee49d6fa	Run nightly Clippy over the Ruff repo (#5670 ) ## Summary This is the result of running `cargo +nightly clippy --workspace --all-targets --all-features -- -D warnings` and fixing all violations. Just wanted to see if there were any interesting new checks on nightly 👀	2023-07-10 23:44:38 -04:00
Louis Dispa	e7e2f44440	Format `raise` statement (#5595 ) ## Summary This PR implements the formatting of `raise` statements. I haven't looked at the black implementation, this is inspired from from the `return` statements formatting. ## Test Plan The black differences with insta. I also compared manually some edge cases with very long string and call chaining and it seems to do the same formatting as black. There is one issue: ```python # input raise OsError( "aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa" ) from a.aaaaa(aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa).a(aaaa) # black raise OsError( "aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa" ) from a.aaaaa( aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa ).a( aaaa ) # ruff raise OsError( "aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa" ) from a.aaaaa( aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa ).a(aaaa) ``` But I'm not sure this diff is the raise formatting implementation. --------- Co-authored-by: Louis Dispa <ldispa@deezer.com>	2023-07-10 21:23:49 +02:00
monosans	14f2158e5d	[`flake8-self`] Ignore `_name_` and `_value_` (#5663 ) ## Summary `Enum._name_` and `Enum._value_` are so named to prevent conflicts. See <https://docs.python.org/3/library/enum.html#supported-sunder-names>. ## Test Plan Tests for `ignore-names` already exist.	2023-07-10 14:52:59 -04:00
Tom Kuson	b8a6ce43a2	Properly ignore bivariate types in `type-name-incorrect-variance` (#5660 ) ## Summary #5658 didn't actually ignore bivariate types in some all cases (sorry about that). This PR fixes that and adds bivariate types to the test fixture. ## Test Plan `cargo test`	2023-07-10 14:19:17 -04:00
Tom Kuson	5ab9538573	Improve `type-name-incorrect-variance` message (#5658 ) ## Summary Change the `type-name-incorrect-variance` diagnostic message to include the detected variance and a name change recommendation. For example, ``` `TypeVar` name "T_co" does not reflect its contravariance; consider renaming it to "T_contra" ``` Related to #5651. ## Test Plan `cargo test`	2023-07-10 13:33:37 -04:00
Zanie	d19839fe0f	Add support for `Union` declarations without `\|` to PYI016 (#5598 ) Previously, PYI016 only supported reporting violations for unions defined with `\|`. Now, union declarations with `typing.Union` are supported.	2023-07-10 17:11:54 +00:00
Charlie Marsh	120e9d37f1	Audit some `SemanticModel#is_builtin` usages (#5659 ) ## Summary Non-behavior-changing refactors to delay some `.is_builtin` calls in a few older rules. Cheaper pre-conditions should always be checked first.	2023-07-10 13:10:08 -04:00
Evan Rittenhouse	28fe2d334a	Implement `UnnecessaryListAllocationForFirstElement` (#5549 ) ## Summary Fixes #5503. Ready for final review as the `mkdocs` issue involving SSH keys is fixed. Note that this will only throw on a `Name` - it will be refactorable once we have a type-checker. This means that this is the only sort of input that will throw. ```python x = range(10) list(x)[0] ``` I thought it'd be confusing if we supported direct function results. Consider this example, assuming we support direct results: ```python # throws list(range(10))[0] def createRange(bound): return range(bound) # "why doesn't this throw, but a direct `range(10)` call does?" list(createRange(10))[0] ``` If it's necessary, I can go through the list of built-ins and find those which produce iterables, then add them to the throwing list. ## Test Plan Added a new fixture, then ran `cargo t`	2023-07-10 16:32:41 +00:00
Tom Kuson	3562d809b2	[`pylint`] Implement Pylint `typevar-name-incorrect-variance` (`C0105`) (#5651 ) ## Summary Implement Pylint `typevar-name-incorrect-variance` (`C0105`) as `type-name-incorrect-variance` (`PLC0105`). Includes documentation. Related to #970. The Pylint implementation checks only `TypeVar`, but this PR checks `ParamSpec` as well. ## Test Plan Added test fixture. `cargo test`	2023-07-10 12:28:44 -04:00
Tom Kuson	4cac75bc27	Add documentation to `pandas-vet` rules (#5629 ) ## Summary Completes all the documentation for the `pandas-vet` rules, except for `pandas-use-of-dot-read-table` as I am unclear of the rule's motivation (see #5628). Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py && mkdocs serve`	2023-07-10 15:45:36 +00:00
Charlie Marsh	ed872145fe	Always allow PEP 585 and PEP 604 rewrites in stub files (#5653 ) Closes https://github.com/astral-sh/ruff/issues/5640.	2023-07-10 14:51:38 +00:00
Charlie Marsh	35b04c2fab	Skip flake8-future-annotations checks in stub files (#5652 ) Closes https://github.com/astral-sh/ruff/issues/5649.	2023-07-10 10:49:17 -04:00
Evan Rittenhouse	ae4a7ef0ed	Make TRY301 trigger only if a `raise` throws a caught exception (#5455 ) ## Summary Fixes #5246. We generate a hash set of all exception IDs caught by the `try` statement, then check that the inner `raise` actually raises a caught exception. ## Test Plan Added a new test, `cargo t`.	2023-07-10 10:00:43 -04:00
konsti	cab3a507bc	Fix find_only_token_in_range with expression parentheses (#5645 ) ## Summary Fix an oversight in `find_only_token_in_range` where the following code would panic due do the closing and opening parentheses being in the range we scan: ```python d1 = [ ("a") if # 1 ("b") else # 2 ("c") ] ``` Closing and opening parentheses respectively are now correctly skipped. ## Test Plan I added a regression test	2023-07-10 15:55:19 +02:00
Harutaka Kawamura	82317ba1fd	Support autofix for some multiline `str.format` calls (#5638 ) ## Summary Fixes #5531 ## Test Plan New test cases	2023-07-10 09:49:13 -04:00
Aarni Koskela	24bcbb85a1	Rework upstream categories so we can `all_rules()` (#5591 ) ## Summary This PR reworks the `upstream_categories` mechanism that is only used for documentation purposes to make it easier to generate docs using `all_rules()`. The new implementation also relies on "tribal knowledge" about rule codes, so it's not the best implementation, but gets us forward. Another option would be to change the rule-defining proc macros to allow configuring an optional `RuleCategory`, but that seems more heavy-handed and possibly unnecessary in the long run... Draft since this builds on #5439. cc @charliermarsh :)	2023-07-10 09:41:26 -04:00
Micha Reiser	089a671adb	Fix Black compatible snapshot deletion (#5646 )	2023-07-10 15:00:18 +02:00
konsti	bd8f65814c	Format named expressions (walrus operator) (#5642 ) ## Summary Format named expressions (walrus operator) such a `value := f()`. Unlike tuples, named expression parentheses are not part of the range even when mandatory, so mapping optional parentheses to always gives us decent formatting without implementing all [PEP 572](https://peps.python.org/pep-0572/) rules on when we need parentheses where other expressions wouldn't. We might want to revisit this decision later and implement special cases, but for now this gives us what we need. ## Test Plan black fixtures, i added some fixtures and checked django and cpython for stability. Closes #5613	2023-07-10 12:32:15 +00:00
David Szotten	1e894f328c	formatter: multi char tokens in SimpleTokenizer (#5610 )	2023-07-10 09:00:59 +01:00
Charlie Marsh	c9d7c0d7d5	Add a link to the nursery; tweak icons (#5637 ) ## Summary We now always render the icons, but very faintly if inactive, and always right-align. This ensures consistent alignment as you scroll down the page: <img width="1792" alt="Screen Shot 2023-07-09 at 10 45 50 PM" src="https://github.com/astral-sh/ruff/assets/1309177/da47ac0e-d646-49e1-bbe1-9f43adf94bb4">	2023-07-10 03:09:08 +00:00
Charlie Marsh	27011448ea	Fix typo in complex-if-statement-in-stub message (#5635 )	2023-07-10 02:35:34 +00:00
Aarni Koskela	b4d6b7c230	docs: show nursery icon for nursery rules (#5439 ) ## Summary This changes the docs to show a nursery icon (🌅) for rules in the nursery. It currently doesn't do that for the rules that are in sub-categories (Pylint, Pycodestyle) because there is no `all_rules()` for the `RuleCodePrefix` that's returned by `UpstreamCategory` iteration (and as mentioned on Discord, I think `UpstreamCategory` maybe shouldn't be a thing). (That would be enabled by #5591.) ## Test Plan Generated docs to see new icons (with the caveat above).	2023-07-09 22:24:57 -04:00
Charlie Marsh	fa1341b0db	Improve PERF203 example in docs (#5634 ) Closes #5624.	2023-07-10 02:24:46 +00:00
Charlie Marsh	401d172e47	Use a simple match statement for case-insensitive noqa lookup (#5633 ) ## Summary It turns out that just doing this match directly without `AhoCorasick` is much faster, like 2x (and removes one dependency, though we likely already rely on this transitively).	2023-07-09 22:15:23 -04:00
Dhruv Manilawala	6a4b216362	Avoid `PERF401` if conditional depends on list var (#5603 ) ## Summary Avoid `PERF401` if conditional depends on list var ## Test Plan `cargo test` fixes: #5581	2023-07-09 15:53:27 -04:00
Tom Kuson	ac2e374a5a	Add `tkinter` import convention (#5626 ) ## Summary Adds `import tkinter as tk` to the list of default import conventions. Closes #5620. ## Test Plan Added `tkinter` to test fixture. `cargo test`	2023-07-09 16:26:31 +05:30
Charlie Marsh	38fa305f35	Refactor isort directive skips to use iterators (#5623 ) ## Summary We're doing some unsafe accesses to advance these iterators. It's easier to model these as actual iterators to ensure safety everywhere. Also added some additional test cases. Closes #5621.	2023-07-08 19:05:44 +00:00
Charlie Marsh	456273a92e	Support individual codes on `# flake8: noqa` directives (#5618 ) ## Summary We now treat `# flake8: noqa: F401` as turning off F401 for the entire file. (Flake8 treats this as turning off _all rules_ for the entire file). This deviates from Flake8, but I think it's a much more user-friendly deviation than what I introduced in #5571. See https://github.com/astral-sh/ruff/issues/5617 for an explanation. Closes https://github.com/astral-sh/ruff/issues/5617.	2023-07-08 16:51:37 +00:00
Charlie Marsh	507961f27d	Emit warnings for invalid `# noqa` directives (#5571 ) ## Summary This PR adds a `ParseError` type to the `noqa` parsing system to enable us to render useful warnings instead of silently failing when parsing `noqa` codes. For example, given `foo.py`: ```python # ruff: noqa: x # ruff: noqa foo # flake8: noqa: F401 import os # noqa: foo-bar ``` We would now output: ```console warning: Invalid `# noqa` directive on line 2: expected a comma-separated list of codes (e.g., `# noqa: F401, F841`). warning: Invalid `# noqa` directive on line 4: expected `:` followed by a comma-separated list of codes (e.g., `# noqa: F401, F841`). warning: Invalid `# noqa` directive on line 6: Flake8's blanket exemption does not support exempting specific codes. To exempt specific codes, use, e.g., `# ruff: noqa: F401, F841` instead. warning: Invalid `# noqa` directive on line 7: expected a comma-separated list of codes (e.g., `# noqa: F401, F841`). ``` There's one important behavior change here too. Right now, with Flake8, if you do `# flake8: noqa: F401`, Flake8 treats that as equivalent to `# flake8: noqa` -- it turns off _all_ diagnostics in the file, not just `F401`. Historically, we respected this... but, I think it's confusing. So we now raise a warning, and don't respect it at all. This will lead to errors in some projects, but I'd argue that right now, those directives are almost certainly behaving in an unintended way for users anyway. Closes https://github.com/astral-sh/ruff/issues/3339.	2023-07-08 16:37:55 +00:00
Charlie Marsh	a1c559eaa4	Only run pyproject.toml lint rules when enabled (#5578 ) ## Summary I was testing some changes on Airflow, and I realized that we _always_ run the `pyproject.toml` validation rules, even if they're not enabled. This PR gates them behind the appropriate enablement flags. ## Test Plan - Ran: `cargo run -p ruff_cli -- check ../airflow -n`. Verified that no RUF200 violations were raised. - Run: `cargo run -p ruff_cli -- check ../airflow -n --select RUF200`. Verified that two RUF200 violations were raised.	2023-07-08 11:05:05 -04:00
konsti	d0dae7e576	Fix CI by downgrading to cargo insta 1.29.0 (#5589 ) Since the (implicit) update to cargo-insta 1.30, CI would pass even when the tests failed. This downgrades to cargo insta 1.29.0 and CI fails again when it should (which i can't show here, because CI needs to pass to merge this PR). I've improved the unreferenced snapshot handling in the process See https://github.com/mitsuhiko/insta/issues/392	2023-07-08 14:54:49 +00:00
Dimitri Papadopoulos Orfanos	efe7c393d1	Fix typos found by codespell (#5607 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Fix typos found by [codespell](https://github.com/codespell-project/codespell). I have left out `memoize` for now (see #5606). <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan CI tests. <!-- How was it tested? -->	2023-07-08 12:33:18 +02:00
konsti	0b9af031fb	Format ExprIfExp (ternary operator) (#5597 ) ## Summary Format `ExprIfExp`, also known as the ternary operator or inline `if`. It can look like ```python a1 = 1 if True else 2 ``` but also ```python b1 = ( # We return "a" ... "a" # that's our True value # ... if this condition matches ... if True # that's our test # ... otherwise we return "b§ else "b" # that's our False value ) ``` This also fixes a visitor order bug. The jaccard index on django goes from 0.911 to 0.915. ## Test Plan I added fixtures without and with comments in strange places.	2023-07-07 19:11:52 +00:00
konsti	0f9d7283e7	Add format-dev contributor docs (#5594 ) ## Summary This adds markdown-level docs for #5492 ## Test Plan n/a --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-07 16:52:13 +00:00
Zanie	bb7303f867	Implement PYI030: Unnecessary literal union (#5570 ) Implements PYI030 as part of https://github.com/astral-sh/ruff/issues/848 > Union expressions should never have more than one Literal member, as Literal[1] \| Literal[2] is semantically identical to Literal[1, 2]. Note we differ slightly from the flake8-pyi implementation: - We detect cases where there are parentheses or nested unions - We detect cases with mixed `Union` and `\|` syntax - We use the same error message for all violations; flake8-pyi has two different messages - We retain the user's quoting style when displaying string literals; flake8-pyi uses single quotes - We warn on duplicates of the same literal `Literal[1] \| Literal[1]`	2023-07-07 16:43:10 +00:00
konsti	60d318ddcf	Check formatter stability on CI (#5446 ) Check formatter stability on CI using CPython. This should be merged into the ecosystem checks, but i think this is a good start.	2023-07-07 18:28:36 +02:00
Charlie Marsh	5640c310bb	Move file-level rule exemption to lexer-based approach (#5567 ) ## Summary In addition to `# noqa` codes, we also support file-level exemptions, which look like: - `# flake8: noqa` (ignore all rules in the file, for compatibility) - `# ruff: noqa` (all rules in the file) - `# ruff: noqa: F401` (ignore `F401` in the file, Flake8 doesn't support this) This PR moves that logic to something that looks a lot more like our `# noqa` parser. Performance is actually quite a bit _worse_ than the previous approach (lexing `# flake8: noqa` goes from 2ns to 11ns; lexing `# ruff: noqa: F401, F841` is about the same`; lexing `# type: ignore # noqa: E501` fgoes from 4ns to 6ns), but the numbers are very small so it's... maybe worth it? The primary benefit here is that we now properly support flexible whitespace, like: `#flake8:noqa`. Previously, we required exact string matching, and we also didn't support all case-insensitive variants of `noqa`.	2023-07-07 15:41:20 +00:00
Peter Attia	aaab9f1597	Bugfix: Remove version numbers from pypi links (#5579 ) ## Summary There are two pypi links in the documentation that link to specific version numbers of other packages. Removing these versioned links allows users to immediately view the latest version of the package and maintains consistency with the other links. ## Test Plan N/A	2023-07-07 09:35:50 -04:00
konsti	b22e6c3d38	Extend ruff_dev formatter script to compute statistics and format a project (#5492 ) ## Summary This extends the `ruff_dev` formatter script util. Instead of only doing stability checks, you can now choose different compatible options on the CLI and get statistics. * It adds an option the formats all files that ruff would check to allow looking at an entire black-formatted repository with `git diff` * It computes the [Jaccard index](https://en.wikipedia.org/wiki/Jaccard_index) as a measure of deviation between input and output, which is useful as single number metric for assessing our current deviations from black. * It adds progress bars to both the single projects as well as the multi-project mode. * It adds an option to write the multi-project output to a file Sample usage: ``` $ cargo run --bin ruff_dev -- format-dev --stability-check crates/ruff/resources/test/cpython $ cargo run --bin ruff_dev -- format-dev --stability-check /home/konsti/projects/django Syntax error in /home/konsti/projects/django/tests/test_runner_apps/tagged/tests_syntax_error.py: source contains syntax errors (parser error): BaseError { error: UnrecognizedToken(Name { name: "syntax_error" }, None), offset: 131, source_path: "<filename>" } Found 0 stability errors in 2755 files (jaccard index 0.911) in 9.75s $ cargo run --bin ruff_dev -- format-dev --write /home/konsti/projects/django ``` Options: ``` Several utils related to the formatter which can be run on one or more repositories. The selected set of files in a repository is the same as for `ruff check`. * Check formatter stability: Format a repository twice and ensure that it looks that the first and second formatting look the same. * Format: Format the files in a repository to be able to check them with `git diff` * Statistics: The subcommand the Jaccard index between the (assumed to be black formatted) input and the ruff formatted output Usage: ruff_dev format-dev [OPTIONS] [FILES]... Arguments: [FILES]... Like `ruff check`'s files. See `--multi-project` if you want to format an ecosystem checkout Options: --stability-check Check stability We want to ensure that once formatted content stays the same when formatted again, which is known as formatter stability or formatter idempotency, and that the formatter prints syntactically valid code. As our test cases cover only a limited amount of code, this allows checking entire repositories. --write Format the files. Without this flag, the python files are not modified --format <FORMAT> Control the verbosity of the output [default: default] Possible values: - minimal: Filenames only - default: Filenames and reduced diff - full: Full diff and invalid code -x, --exit-first-error Print only the first error and exit, `-x` is same as pytest --multi-project Checks each project inside a directory, useful e.g. if you want to check all of the ecosystem checkouts --error-file <ERROR_FILE> Write all errors to this file in addition to stdout. Only used in multi-project mode ``` ## Test Plan I ran this on django (2755 files, jaccard index 0.911) and discovered a magic trailing comma problem and that we really needed to implement import formatting. I ran the script on cpython to identify https://github.com/astral-sh/ruff/pull/5558.	2023-07-07 11:30:12 +00:00
Micha Reiser	40ddc1604c	Introduce `parenthesized` helper (#5565 )	2023-07-07 11:28:25 +02:00
Charlie Marsh	bf4b96c5de	Differentiate between runtime and typing-time annotations (#5575 ) ## Summary In Python, the annotations on `x` and `y` here have very different treatment: ```python def foo(x: int): y: int ``` The `int` in `x: int` is a runtime-required annotation, because `x` gets added to the function's `__annotations__`. You'll notice, for example, that this fails: ```python from typing import TYPE_CHECKING if TYPE_CHECKING: from foo import Bar def f(x: Bar): ... ``` Because `Bar` is required to be available at runtime, not just at typing time. Meanwhile, this succeeds: ```python from typing import TYPE_CHECKING if TYPE_CHECKING: from foo import Bar def f(): x: Bar = 1 f() ``` (Both cases are fine if you use `from __future__ import annotations`.) Historically, we've tracked those annotations that are _not_ runtime-required via the semantic model's `ANNOTATION` flag. But annotations that _are_ runtime-required have been treated as "type definitions" that aren't annotations. This causes problems for the flake8-future-annotations rules, which try to detect whether adding `from __future__ import annotations` would _allow_ you to rewrite a type annotation. We need to know whether we're in _any_ type annotation, runtime-required or not, since adding `from __future__ import annotations` will convert any runtime-required annotation to a typing-only annotation. This PR adds separate state to track these runtime-required annotations. The changes in the test fixtures are correct -- these were false negatives before. Closes https://github.com/astral-sh/ruff/issues/5574.	2023-07-07 00:21:44 -04:00
Charlie Marsh	b11492e940	Fix remaining Copyright rule references (#5577 )	2023-07-07 02:49:19 +00:00
Tom Kuson	5908b39102	Support globbing in `isort` options (#5473 ) ## Summary Support glob patterns in `isort` options. Closes #5420. ## Test Plan Added test. `cargo test`	2023-07-06 20:37:41 -04:00
konsti	5e5a96ca28	Fix formatter `StmtTry` test (#5568 ) For some reason this didn't turn up on CI before CC @michareiser this is the fix for the error you had	2023-07-06 18:23:53 +00:00
Tom Kuson	3650aaa8b3	Add documentation to the `S1XX` rules (#5479 ) ## Summary Add documentation to the `S1XX` rules (the `flake8-bandit` ['misc tests'](https://bandit.readthedocs.io/en/latest/plugins/index.html#plugin-id-groupings) rule group). ## Test Plan `python scripts/check_docs_formatted.py && mkdocs serve`	2023-07-06 17:46:16 +00:00
Charlie Marsh	cc822082a7	Refactor `noqa` directive parsing away from regex-based implementation (#5554 ) ## Summary I'll write up a more detailed description tomorrow, but in short, this PR removes our regex-based implementation in favor of "manual" parsing. I tried a couple different implementations. In the benchmarks below: - `Directive/Regex` is our implementation on `main`. - `Directive/Find` just uses `text.find("noqa")`, which is insufficient, since it doesn't cover case-insensitive variants like `NOQA`, and doesn't handle multiple `noqa` matches in a single like, like ` # Here's a noqa comment # noqa: F401`. But it's kind of a baseline. - `Directive/Memchr` uses three `memchr` iterative finders (one for `noqa`, `NOQA`, and `NoQA`). - `Directive/AhoCorasick` is roughly the variant checked-in here. The raw results: ``` Directive/Regex/# noqa: F401 time: [273.69 ns 274.71 ns 276.03 ns] change: [+1.4467% +1.8979% +2.4243%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 3 (3.00%) low mild 8 (8.00%) high mild 4 (4.00%) high severe Directive/Find/# noqa: F401 time: [66.972 ns 67.048 ns 67.132 ns] change: [+2.8292% +2.9377% +3.0540%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 1 (1.00%) low severe 3 (3.00%) low mild 8 (8.00%) high mild 3 (3.00%) high severe Directive/AhoCorasick/# noqa: F401 time: [76.922 ns 77.189 ns 77.536 ns] change: [+0.4265% +0.6862% +0.9871%] (p = 0.00 < 0.05) Change within noise threshold. Found 8 outliers among 100 measurements (8.00%) 1 (1.00%) low mild 3 (3.00%) high mild 4 (4.00%) high severe Directive/Memchr/# noqa: F401 time: [62.627 ns 62.654 ns 62.679 ns] change: [-0.1780% -0.0887% -0.0120%] (p = 0.03 < 0.05) Change within noise threshold. Found 11 outliers among 100 measurements (11.00%) 1 (1.00%) low severe 5 (5.00%) low mild 3 (3.00%) high mild 2 (2.00%) high severe Directive/Regex/# noqa: F401, F841 time: [321.83 ns 322.39 ns 322.93 ns] change: [+8602.4% +8623.5% +8644.5%] (p = 0.00 < 0.05) Performance has regressed. Found 5 outliers among 100 measurements (5.00%) 1 (1.00%) low severe 2 (2.00%) low mild 1 (1.00%) high mild 1 (1.00%) high severe Directive/Find/# noqa: F401, F841 time: [78.618 ns 78.758 ns 78.896 ns] change: [+1.6909% +1.8771% +2.0628%] (p = 0.00 < 0.05) Performance has regressed. Found 3 outliers among 100 measurements (3.00%) 3 (3.00%) high mild Directive/AhoCorasick/# noqa: F401, F841 time: [87.739 ns 88.057 ns 88.468 ns] change: [+0.1843% +0.4685% +0.7854%] (p = 0.00 < 0.05) Change within noise threshold. Found 11 outliers among 100 measurements (11.00%) 5 (5.00%) low mild 3 (3.00%) high mild 3 (3.00%) high severe Directive/Memchr/# noqa: F401, F841 time: [80.674 ns 80.774 ns 80.860 ns] change: [-0.7343% -0.5633% -0.4031%] (p = 0.00 < 0.05) Change within noise threshold. Found 14 outliers among 100 measurements (14.00%) 4 (4.00%) low severe 9 (9.00%) low mild 1 (1.00%) high mild Directive/Regex/# noqa time: [194.86 ns 195.93 ns 196.97 ns] change: [+11973% +12039% +12103%] (p = 0.00 < 0.05) Performance has regressed. Found 6 outliers among 100 measurements (6.00%) 5 (5.00%) low mild 1 (1.00%) high mild Directive/Find/# noqa time: [25.327 ns 25.354 ns 25.383 ns] change: [+3.8524% +4.0267% +4.1845%] (p = 0.00 < 0.05) Performance has regressed. Found 9 outliers among 100 measurements (9.00%) 6 (6.00%) high mild 3 (3.00%) high severe Directive/AhoCorasick/# noqa time: [34.267 ns 34.368 ns 34.481 ns] change: [+0.5646% +0.8505% +1.1281%] (p = 0.00 < 0.05) Change within noise threshold. Found 5 outliers among 100 measurements (5.00%) 5 (5.00%) high mild Directive/Memchr/# noqa time: [21.770 ns 21.818 ns 21.874 ns] change: [-0.0990% +0.1464% +0.4046%] (p = 0.26 > 0.05) No change in performance detected. Found 10 outliers among 100 measurements (10.00%) 4 (4.00%) low mild 4 (4.00%) high mild 2 (2.00%) high severe Directive/Regex/# type: ignore # noqa: E501 time: [278.76 ns 279.69 ns 280.72 ns] change: [+7449.4% +7469.8% +7490.5%] (p = 0.00 < 0.05) Performance has regressed. Found 3 outliers among 100 measurements (3.00%) 1 (1.00%) low mild 1 (1.00%) high mild 1 (1.00%) high severe Directive/Find/# type: ignore # noqa: E501 time: [67.791 ns 67.976 ns 68.184 ns] change: [+2.8321% +3.1735% +3.5418%] (p = 0.00 < 0.05) Performance has regressed. Found 6 outliers among 100 measurements (6.00%) 5 (5.00%) high mild 1 (1.00%) high severe Directive/AhoCorasick/# type: ignore # noqa: E501 time: [75.908 ns 76.055 ns 76.210 ns] change: [+0.9269% +1.1427% +1.3955%] (p = 0.00 < 0.05) Change within noise threshold. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high severe Directive/Memchr/# type: ignore # noqa: E501 time: [72.549 ns 72.723 ns 72.957 ns] change: [+1.5881% +1.9660% +2.3974%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 10 (10.00%) high mild 5 (5.00%) high severe Directive/Regex/# type: ignore # nosec time: [66.967 ns 67.075 ns 67.207 ns] change: [+1713.0% +1715.8% +1718.9%] (p = 0.00 < 0.05) Performance has regressed. Found 10 outliers among 100 measurements (10.00%) 1 (1.00%) low severe 3 (3.00%) low mild 2 (2.00%) high mild 4 (4.00%) high severe Directive/Find/# type: ignore # nosec time: [18.505 ns 18.548 ns 18.597 ns] change: [+1.3520% +1.6976% +2.0333%] (p = 0.00 < 0.05) Performance has regressed. Found 4 outliers among 100 measurements (4.00%) 4 (4.00%) high mild Directive/AhoCorasick/# type: ignore # nosec time: [16.162 ns 16.206 ns 16.252 ns] change: [+1.2919% +1.5587% +1.8430%] (p = 0.00 < 0.05) Performance has regressed. Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) high mild 1 (1.00%) high severe Directive/Memchr/# type: ignore # nosec time: [39.192 ns 39.233 ns 39.276 ns] change: [+0.5164% +0.7456% +0.9790%] (p = 0.00 < 0.05) Change within noise threshold. Found 13 outliers among 100 measurements (13.00%) 2 (2.00%) low severe 4 (4.00%) low mild 3 (3.00%) high mild 4 (4.00%) high severe Directive/Regex/# some very long comment that # is interspersed with characters but # no directive time: [81.460 ns 81.578 ns 81.703 ns] change: [+2093.3% +2098.8% +2104.2%] (p = 0.00 < 0.05) Performance has regressed. Found 4 outliers among 100 measurements (4.00%) 2 (2.00%) low mild 2 (2.00%) high mild Directive/Find/# some very long comment that # is interspersed with characters but # no directive time: [26.284 ns 26.331 ns 26.387 ns] change: [+0.7554% +1.1027% +1.3832%] (p = 0.00 < 0.05) Change within noise threshold. Found 6 outliers among 100 measurements (6.00%) 5 (5.00%) high mild 1 (1.00%) high severe Directive/AhoCorasick/# some very long comment that # is interspersed with characters but # no direc... time: [28.643 ns 28.714 ns 28.787 ns] change: [+1.3774% +1.6780% +2.0028%] (p = 0.00 < 0.05) Performance has regressed. Found 2 outliers among 100 measurements (2.00%) 2 (2.00%) high mild Directive/Memchr/# some very long comment that # is interspersed with characters but # no directive time: [55.766 ns 55.831 ns 55.897 ns] change: [+1.5802% +1.7476% +1.9021%] (p = 0.00 < 0.05) Performance has regressed. Found 2 outliers among 100 measurements (2.00%) 2 (2.00%) low mild ``` While memchr is faster than aho-corasick in some of the common cases (like `# noqa: F401`), the latter is way, way faster when there _isn't_ a match (like 2x faster -- see the last two cases). Since most comments _aren't_ `noqa` comments, this felt like the right tradeoff. Note that all implementations are significantly faster than the regex version. (I know I originally reported a 10x speedup, but I ended up improving the regex version a bit in some prior PRs, so it got unintentionally faster via some refactors.) There's also one behavior change in here, which is that we now allow variable spaces, e.g., `#noqa` or `# noqa`. Previously, we required exactly one space. This thus closes #5177.	2023-07-06 16:03:10 +00:00
Charlie Marsh	9713ee4b80	Remove `ParsedFileExemption::None` (#5555 ) ## Summary This is more aligned with the other enums in this module. Should've been changed in a previous refactor, just an oversight.	2023-07-06 11:15:46 -04:00
konsti	8184235f93	Try statements have a body: Fix formatter instability (#5558 ) ## Summary The following code was previously leading to unstable formatting: ```python try: try: pass finally: print(1) # issue7208 except A: pass ``` The comment would be formatted as a trailing comment of `try` which is unstable as an end-of-line comment gets two extra whitespaces. This was originally found in `99b00efd5e/Lib/getpass.py (L68-L91)` ## Test Plan I added a regression test	2023-07-06 16:07:47 +02:00
Charlie Marsh	bf02c77fd7	Replace stat mapping with match statement (#5548 )	2023-07-05 23:42:21 +00:00
Charlie Marsh	ba7041b6bf	Remove Directive's dependency on Locator (#5547 ) ## Summary It's a bit simpler to let the API just take the text itself, plus an offset (to make the returned `TextRange` absolute, rather than relative).	2023-07-05 23:33:57 +00:00
Charlie Marsh	5dff3195d4	Refactor tokens-based rules to take an `&mut Vec<Diagnostic>` (#5525 )	2023-07-05 19:21:42 -04:00
Charlie Marsh	23363cafd1	Move `Directive` fields behind accessor methods (#5546 )	2023-07-05 23:13:41 +00:00
Charlie Marsh	e4596ebc35	Remove leading and trailing space length from `Directive` (#5545 ) ## Summary We only need this in one place (when removing the directive), and it simplifies a lot of details to just compute it there.	2023-07-05 23:03:06 +00:00
Charlie Marsh	c9e02c52a8	Add separate configuration for MkDocs Insiders plugins (#5544 ) ## Summary This PR adds a separate configuration file to enable us to turn on [Insiders-only plugins](https://squidfunk.github.io/mkdocs-material/insiders/getting-started/#built-in-plugins). I've turned on the `typeset` plugin which ensures that the settings on the left-hand navigation pane render as code: <img width="1792" alt="Screen Shot 2023-07-05 at 6 27 20 PM" src="https://github.com/astral-sh/ruff/assets/1309177/c93676dd-bb48-417a-9d3b-528bf001e9b7">	2023-07-05 18:40:21 -04:00
Charlie Marsh	d097b49371	Remove `Directive::None` variant (#5543 ) ## Summary This is creating some weird, impossible states. Make impossible states unrepresentable!	2023-07-05 22:22:21 +00:00
Charlie Marsh	cdb9fda3b8	Add debug-based snapshot tests for noqa directive parsing (#5535 ) ## Summary Better tests, helpful for future refactors.	2023-07-05 21:49:07 +00:00
Charlie Marsh	a0c0b74b6d	Use structs for noqa `Directive` variants (#5533 ) ## Summary No behavioral changes, just clearer (IMO) and with better documentation.	2023-07-05 21:37:32 +00:00
qdegraaf	6f548d9872	`[isort]` Add `--case-sensitive` flag (#5539 ) ## Summary Adds a `--case-sensitive` setting/flag to isort (default: `false`) which, when set to `true` sorts imports case sensitively instead of case insensitively. Tests and Docs can be improved, can do that if the general idea of the implementation is in order. First `isort` edit so any and all feedback is welcomed even more than usual. ## Test Plan Added a fixture with an assortment of imports in various cases. ## Issue links Closes: https://github.com/astral-sh/ruff/issues/5514	2023-07-05 16:10:53 -04:00
Charlie Marsh	5a74a8e5a1	Avoid syntax errors when rewriting str(dict) in f-strings (#5538 ) Closes https://github.com/astral-sh/ruff/issues/5530.	2023-07-05 19:22:22 +00:00
Charlie Marsh	c5bfd1e877	Allow descriptor instantiations in dataclass fields (#5537 ) ## Summary Per the Python documentation, dataclasses are allowed to instantiate descriptors, like so: ```python class IntConversionDescriptor: def __init__(self, *, default): self._default = default def __set_name__(self, owner, name): self._name = "_" + name def __get__(self, obj, type): if obj is None: return self._default return getattr(obj, self._name, self._default) def __set__(self, obj, value): setattr(obj, self._name, int(value)) @dataclass class InventoryItem: quantity_on_hand: IntConversionDescriptor = IntConversionDescriptor(default=100) ``` Closes https://github.com/astral-sh/ruff/issues/4451.	2023-07-05 15:19:24 -04:00
Charlie Marsh	9e1039f823	Enable attribute lookups via semantic model (#5536 ) ## Summary This PR enables us to resolve attribute accesses within files, at least for static and class methods. For example, we can now detect that this is a function access (and avoid a false-positive): ```python class Class: @staticmethod def error(): return ValueError("Something") # OK raise Class.error() ``` Closes #5487. Closes #5416.	2023-07-05 15:19:14 -04:00
Tom Kuson	9478454b96	[`pylint`] Implement Pylint `typevar-double-variance` (`C0131`) (#5517 ) ## Summary Implement Pylint `typevar-double-variance` (`C0131`) as `type-bivariance` (`PLC0131`). Includes documentation. Related to #970. Renamed the rule to be more clear (it's not immediately obvious what 'double' means, IMO). The Pylint implementation checks only `TypeVar`, but this PR checks `ParamSpec` as well. ## Test Plan Added tests. `cargo test`	2023-07-05 14:53:41 -04:00
Charlie Marsh	9a8e5f7877	Run `cargo update` (#5534 ) ```console ❯ cargo update Updating crates.io index Updating git repository `https://github.com/charliermarsh/LibCST` Updating git repository `https://github.com/astral-sh/RustPython-Parser.git` Updating git repository `https://github.com/youknowone/unicode_names2.git` Updating bitflags v2.3.2 -> v2.3.3 Updating bstr v1.5.0 -> v1.6.0 Updating clap v4.3.8 -> v4.3.11 Updating clap_builder v4.3.8 -> v4.3.11 Updating clap_complete v4.3.1 -> v4.3.2 Updating colored v2.0.0 -> v2.0.4 Removing hermit-abi v0.2.6 Removing hermit-abi v0.3.1 Adding hermit-abi v0.3.2 Updating is-terminal v0.4.7 -> v0.4.8 Updating itoa v1.0.6 -> v1.0.8 Adding linux-raw-sys v0.4.3 Updating num_cpus v1.15.0 -> v1.16.0 Updating paste v1.0.12 -> v1.0.13 Updating pin-project-lite v0.2.9 -> v0.2.10 Updating quote v1.0.28 -> v1.0.29 Updating regex v1.8.4 -> v1.9.0 Updating regex-automata v0.1.10 -> v0.3.0 Updating regex-syntax v0.7.2 -> v0.7.3 Removing rustix v0.37.20 Adding rustix v0.37.23 Adding rustix v0.38.3 Updating rustversion v1.0.12 -> v1.0.13 Updating ryu v1.0.13 -> v1.0.14 Updating serde v1.0.164 -> v1.0.166 Updating serde_derive v1.0.164 -> v1.0.166 Updating serde_json v1.0.99 -> v1.0.100 Updating syn v2.0.22 -> v2.0.23 Updating thiserror v1.0.40 -> v1.0.41 Updating thiserror-impl v1.0.40 -> v1.0.41 Updating unicode-ident v1.0.9 -> v1.0.10 Updating uuid v1.3.4 -> v1.4.0 Updating windows-targets v0.48.0 -> v0.48.1 ```	2023-07-05 12:34:15 -04:00
Dhruv Manilawala	6fd71e6f53	Avoid triggering DTZ001-006 when using `.astimezone()` (#5524 ) ## Summary Avoid triggering DTZ001-006 when using `.astimezone()` ## Test Plan Added test cases to call `.astimezone()` on DTZ001-006 fixes: #5516	2023-07-05 00:18:59 -04:00
Charlie Marsh	dd60a3865c	Avoid triggering `unnecessary-map` (`C417`) for late-bound lambdas (#5520 ) Closes https://github.com/astral-sh/ruff/issues/5502.	2023-07-04 22:11:29 -04:00
Charlie Marsh	26a268a3ec	Refactor the `unnecessary-map` (`C417`) implementation (#5518 ) ## Summary No behavioral changes. Just refactors + adding a test for a false positive, which I'll fix in a downstream PR.	2023-07-04 20:25:54 -04:00
Charlie Marsh	324455f580	Bump version to 0.0.277 (#5515 )	2023-07-04 17:31:32 -04:00
Charlie Marsh	da1c320bfa	Add .ipynb_checkpoints, .pyenv, .pytest_cache, and .vscode to default excludes (#5513 ) ## Summary VS Code extensions are [recommended](https://code.visualstudio.com/docs/python/settings-reference#_linting-settings) to exclude `.vscode` and `site-packages`. Black also now omits `.vscode`, `.pytest_cache`, and `.ipynb_checkpoints` by default. Omitting `.pyenv` is similar to omitting virtual environments, but really only matters in the context of VS Code (see: https://github.com/astral-sh/ruff/discussions/5509). Closes: #5510.	2023-07-04 20:25:16 +00:00
Charlie Marsh	485d997d35	Tweak prefix match to use .all_rules() (#5512 ) ## Summary No behavior change, but I think this is a little cleaner.	2023-07-04 20:02:57 +00:00
Aarni Koskela	d7214e77e6	Add `ruff rule --all` subcommand (with JSON output) (#5059 ) ## Summary This adds a `ruff rule --all` switch that prints out a human-readable Markdown or a machine-readable JSON document of the lint rules known to Ruff. I needed a machine-readable document of the rules [for a project](https://github.com/astral-sh/ruff/discussions/5078), and figured it could be useful for other people – or tooling! – to be able to interrogate Ruff about its arcane knowledge. The JSON output is an array of the same objects printed by `ruff rule --format=json`. ## Test Plan I ran `ruff rule --all --format=json`. I think more might be needed, but maybe a snapshot test is overkill?	2023-07-04 19:45:38 +00:00
Charlie Marsh	952c623102	Avoid returning first-match for rule prefixes (#5511 ) Closes #5495, but there's a TODO here to improve this further. The current `from_code` implementation feels really indirect.	2023-07-04 19:23:05 +00:00
Tom Kuson	0e67757edb	[`pylint`] Implement Pylint `typevar-name-mismatch` (`C0132`) (#5501 ) ## Summary Implement Pylint `typevar-name-mismatch` (`C0132`) as `type-param-name-mismatch` (`PLC0132`). Includes documentation. Related to #970. The Pylint implementation checks only `TypeVar`, but this PR checks `TypeVarTuple`, `ParamSpec`, and `NewType` as well. This seems to better represent the Pylint rule's [intended behaviour](https://github.com/pylint-dev/pylint/issues/5224). Full disclosure: I am not a fan of the translated name and think it should probably be different. ## Test Plan `cargo test`	2023-07-04 18:49:43 +00:00
Charlie Marsh	c395e44bd7	Avoid PERF rules for iteration-dependent assignments (#5508 ) ## Summary We need to avoid raising "rewrite as a comprehension" violations in cases like: ```python d = defaultdict(list) for i in [1, 2, 3]: d[i].append(i**2) ``` Closes https://github.com/astral-sh/ruff/issues/5494. Closes https://github.com/astral-sh/ruff/issues/5500.	2023-07-04 18:21:05 +00:00
Charlie Marsh	75da72bd7f	Update documentation to list double-quote preference first (#5507 ) Closes https://github.com/astral-sh/ruff/issues/5496.	2023-07-04 18:06:01 +00:00
Charlie Marsh	521e6de2c8	Fix eval detection for suspicious-eval-usage (#5506 ) Closes https://github.com/astral-sh/ruff/issues/5505.	2023-07-04 18:01:29 +00:00
Thomas de Zeeuw	0b963ddcfa	Add unreachable code rule (#5384 ) Co-authored-by: Thomas de Zeeuw <thomas@astral.sh> Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-04 14:27:23 +00:00
konsti	937de121f3	check-formatter-stability: Remove newlines and add `--error-file` (#5491 ) ## Summary This makes the output of `check-formatter-stability` more concise by removing extraneous newlines. It also adds a `--error-file` option to that script that allows creating a file with just the errors (without the status messages) to share with others. ## Test Plan I ran it over CPython and looked at the output. I then added the `--error-file` option and looked at the contents of the file	2023-07-04 07:54:35 +00:00
konsti	787e2fd49d	Format import statements (#5493 ) ## Summary Format import statements in all their variants. Specifically, this implemented formatting `StmtImport`, `StmtImportFrom` and `Alias`. ## Test Plan I added some custom snapshots, even though this has been covered well by black's tests.	2023-07-04 07:07:20 +00:00
Aarni Koskela	6acc316d19	Turn Linters', etc. implicit `into_iter()`s into explicit `rules()` (#5436 ) ## Summary As discussed on ~IRC~ Discord, this will make it easier for e.g. the docs generation stuff to get all rules for a linter (using `all_rules()`) instead of just non-nursery ones, and it also makes it more Explicit Is Better Than Implicit to iterate over linter rules. Grepping for `Item = Rule` reveals some remaining implicit `IntoIterator`s that I didn't feel were necessarily in scope for this (and honestly, iterating over a `RuleSet` makes sense).	2023-07-03 19:35:16 -04:00
konsti	a647f31600	Don't add a magic trailing comma for a single entry (#5463 ) ## Summary If a comma separated list has only one entry, black will respect the magic trailing comma, but it will not add a new one. The following code will remain as is: ```python b1 = [ aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa ] b2 = [ aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa, ] b3 = [ aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa, aksjdhflsakhdflkjsadlfajkslhfdkjsaldajlahflashdfljahlfksajlhfajfjfsaahflakjslhdfkjalhdskjfa ] ``` ## Test Plan This was first discovered in `7eeadc82c2/django/contrib/admin/checks.py (L674-L681)`, which i've minimized into a call test. I've added tests for the three cases (one entry + no comma, one entry + comma, more than one entry) to the list tests. The diffs from the black tests get smaller.	2023-07-03 21:48:44 +02:00
Charlie Marsh	3992c47c00	Bump version to 0.0.276 (#5488 )	2023-07-03 18:02:49 +00:00
Charlie Marsh	8de5a3d29d	Allow `Final` assignments in stubs (#5490 ) ## Summary This fixes one incompatibility with `flake8-pyi`, and gives us a clean pass on `typeshed`.	2023-07-03 17:57:49 +00:00
Charlie Marsh	ed1dd09d02	Refine some `perflint` rules (#5484 ) ## Summary Removing some false positives based on running over `zulip`. `PERF401` now also detects cases like: ```py original = list(range(10000)) filtered = [] for i in original: filtered.append(i * i) ``` Previously, these were caught by the list-copy rule, but these too need comprehensions.	2023-07-03 13:53:17 -04:00
Charlie Marsh	ca497fabbd	Remove some `diagnostics.extend` calls (#5483 ) ## Summary It's more efficient (and more idiomatic for us) to pass in the `Checker` directly.	2023-07-03 16:47:23 +00:00
Charlie Marsh	00fbbe4223	Remove some additional manual iterator matches (#5482 ) ## Summary I've done a few of these PRs, I thought I'd caught them all, but missed this pattern.	2023-07-03 16:29:59 +00:00
Charlie Marsh	dadad0e9ed	Remove some allocations in argument detection (#5481 ) ## Summary Drive-by PR to remove some allocations around argument name matching.	2023-07-03 12:21:26 -04:00
Charlie Marsh	d2450c25ab	Audit `remove_argument` usages to use end-of-function (#5480 ) ## Summary This PR applies the fix in #5478 to a variety of other call-sites, and fixes some other range hygienic stuff in the rules that were modified.	2023-07-03 12:21:01 -04:00
Harutaka Kawamura	1e4b88969c	Fix `unnecessary-encode-utf8` to fix `encode` on parenthesized strings correctly (#5478 ) ## Summary Fixes #5477 ## Test Plan New test cases.	2023-07-03 10:11:09 -04:00
Louis Dispa	dc072537e5	Fix python_formatter generate.py with rust path (#5475 ) ## Summary This PR fix an issue with the `generate.py` file of the python formatter. Since https://github.com/astral-sh/ruff/pull/5369 the [node.rs file](`f51dc20497/crates/ruff_python_ast/src/node.rs`) used to generate the types now has `ast::` in the enum. ```rust pub enum AnyNode { ModModule(ModModule), ModInteractive(ModInteractive), ModExpression(ModExpression), ModFunctionType(ModFunctionType), ... ``` And now: ```rust pub enum AnyNode { ModModule(ast::ModModule), ModInteractive(ast::ModInteractive), ModExpression(ast::ModExpression), ModFunctionType(ast::ModFunctionType), ... ``` The python script was not parsing rust paths. This PR adds the possibility to have it. ## Test Plan This was tested locally. ### Script output Before ``` ['ast::ModModule),', 'ast::ModInteractive),', 'ast::ModExpression),', 'ast::ModFunctionType),', 'ast::StmtFunctionDef),', 'ast::StmtAsyncFunctionDef),', 'ast::StmtClassDef),', 'ast::StmtReturn),', 'ast::StmtDelete),', 'ast::StmtAssign),', 'ast::StmtAugAssign),', 'ast::StmtAnnAssign),', 'ast::StmtFor),', 'ast::StmtAsyncFor),', 'ast::StmtWhile),', 'ast::StmtIf),', 'ast::StmtWith),', 'ast::StmtAsyncWith),', 'ast::StmtMatch),', 'ast::StmtRaise),', 'ast::StmtTry),', 'ast::StmtTryStar),', 'ast::StmtAssert),', 'ast::StmtImport),', 'ast::StmtImportFrom),', 'ast::StmtGlobal),', 'ast::StmtNonlocal),', 'ast::StmtExpr),', 'ast::StmtPass),', 'ast::StmtBreak),', 'ast::StmtContinue),', 'ast::ExprBoolOp),', 'ast::ExprNamedExpr),', 'ast::ExprBinOp),', 'ast::ExprUnaryOp),', 'ast::ExprLambda),', 'ast::ExprIfExp),', 'ast::ExprDict),', 'ast::ExprSet),', 'ast::ExprListComp),', 'ast::ExprSetComp),', 'ast::ExprDictComp),', 'ast::ExprGeneratorExp),', 'ast::ExprAwait),', 'ast::ExprYield),', 'ast::ExprYieldFrom),', 'ast::ExprCompare),', 'ast::ExprCall),', 'ast::ExprFormattedValue),', 'ast::ExprJoinedStr),', 'ast::ExprConstant),', 'ast::ExprAttribute),', 'ast::ExprSubscript),', 'ast::ExprStarred),', 'ast::ExprName),', 'ast::ExprList),', 'ast::ExprTuple),', 'ast::ExprSlice),', 'ast::ExceptHandlerExceptHandler),', 'ast::PatternMatchValue),', 'ast::PatternMatchSingleton),', 'ast::PatternMatchSequence),', 'ast::PatternMatchMapping),', 'ast::PatternMatchClass),', 'ast::PatternMatchStar),', 'ast::PatternMatchAs),', 'ast::PatternMatchOr),', 'ast::TypeIgnoreTypeIgnore),', 'Comprehension),', 'Arguments),', 'Arg),', 'ArgWithDefault),', 'Keyword),', 'Alias),', 'WithItem),', 'MatchCase),', 'Decorator),'] error: unexpected closing delimiter: `)` --> <stdin>:3:55 \| 2 \| use ruff_formatter::{write, Buffer, FormatResult}; \| - this opening brace... - ...matches this closing brace 3 \| use rustpython_parser::ast::ast::ModModule),; \| ^ unexpected closing delimiter Traceback (most recent call last): File "/Users/ldispa/Documents/perso/ruff/crates/ruff_python_formatter/generate.py", line 100, in <module> node_path.write_text(rustfmt(code)) ^^^^^^^^^^^^^ File "/Users/ldispa/Documents/perso/ruff/crates/ruff_python_formatter/generate.py", line 12, in rustfmt return check_output(["rustfmt", "--emit=stdout"], input=code, text=True) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/Cellar/python@3.11/3.11.4_1/Frameworks/Python.framework/Versions/3.11/lib/python3.11/subprocess.py", line 466, in check_output return run(*popenargs, stdout=PIPE, timeout=timeout, check=True, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/Cellar/python@3.11/3.11.4_1/Frameworks/Python.framework/Versions/3.11/lib/python3.11/subprocess.py", line 571, in run raise CalledProcessError(retcode, process.args, subprocess.CalledProcessError: Command '['rustfmt', '--emit=stdout']' returned non-zero exit status 1. ``` After: ``` ['ModModule', 'ModInteractive', 'ModExpression', 'ModFunctionType', 'StmtFunctionDef', 'StmtAsyncFunctionDef', 'StmtClassDef', 'StmtReturn', 'StmtDelete', 'StmtAssign', 'StmtAugAssign', 'StmtAnnAssign', 'StmtFor', 'StmtAsyncFor', 'StmtWhile', 'StmtIf', 'StmtWith', 'StmtAsyncWith', 'StmtMatch', 'StmtRaise', 'StmtTry', 'StmtTryStar', 'StmtAssert', 'StmtImport', 'StmtImportFrom', 'StmtGlobal', 'StmtNonlocal', 'StmtExpr', 'StmtPass', 'StmtBreak', 'StmtContinue', 'ExprBoolOp', 'ExprNamedExpr', 'ExprBinOp', 'ExprUnaryOp', 'ExprLambda', 'ExprIfExp', 'ExprDict', 'ExprSet', 'ExprListComp', 'ExprSetComp', 'ExprDictComp', 'ExprGeneratorExp', 'ExprAwait', 'ExprYield', 'ExprYieldFrom', 'ExprCompare', 'ExprCall', 'ExprFormattedValue', 'ExprJoinedStr', 'ExprConstant', 'ExprAttribute', 'ExprSubscript', 'ExprStarred', 'ExprName', 'ExprList', 'ExprTuple', 'ExprSlice', 'ExceptHandlerExceptHandler', 'PatternMatchValue', 'PatternMatchSingleton', 'PatternMatchSequence', 'PatternMatchMapping', 'PatternMatchClass', 'PatternMatchStar', 'PatternMatchAs', 'PatternMatchOr', 'TypeIgnoreTypeIgnore', 'Comprehension', 'Arguments', 'Arg', 'ArgWithDefault', 'Keyword', 'Alias', 'WithItem', 'MatchCase', 'Decorator'] ```	2023-07-03 16:07:57 +02:00
konsti	7ac9e0252e	Document Checking formatter stability and panics (#5415 ) This adds the documentation, but ideally we should add the CI first	2023-07-03 11:22:19 +02:00
konsti	ca6ff72404	Change generator formatting dummy to include NOT_YET_IMPLEMENTED (#5464 ) ## Summary Change generator formatting dummy to include `NOT_YET_IMPLEMENTED`. This makes it easier to correctly identify them as dummies ## Test Plan This is a dummy change	2023-07-03 09:11:14 +02:00
Charlie Marsh	94ac2c4e1b	Reorganize some `flake8-pyi` rules (#5472 )	2023-07-03 04:39:22 +00:00
qdegraaf	93b2bd7184	[`perflint`] Add `PERF401` and `PERF402` rules (#5298 ) ## Summary Adds `PERF401` and `PERF402` mirroring `W8401` and `W8402` from https://github.com/tonybaloney/perflint Implementation is not super smart but should be at parity with upstream implementation judging by: `c07391c176/perflint/comprehension_checker.py (L42-L73)` It essentially checks: - If the body of a for-loop is just one statement - If that statement is an `if` and the if-statement contains a call to `append()` we flag `PERF401` and suggest a list comprehension - If that statement is a plain call to `append()` or `insert()` we flag `PERF402` and suggest `list()` or `list.copy()` I've set the violation to only flag the first append call in a long `if-else` statement for `PERF401`. Happy to change this to some other location or make it multiple violations if that makes more sense. ## Test Plan Fixtures were added with the relevant scenarios for both rules ## Issue Links Refers: https://github.com/astral-sh/ruff/issues/4789	2023-07-03 04:03:09 +00:00
Justin Prieto	0bff4ed4d3	[`flake8-pyi`] Implement PYI002, PYI003, PYI004, PYI005 (#5457 ) ## Summary Implements flake8-pyi checks 002, 003, 004, 005. The logic is a bit complex, as you can see in the [original code](`57921813c1/pyi.py (L1403C18-L1403C18)`). ref: #848 ## Test Plan Updated snapshot tests. Ran flake8 to double check lints, and ran ruff with all PYI lints enabled to check for incorrect overlapping lint errors.	2023-07-02 23:52:16 -04:00
Anders Kaseorg	df13e69c3c	Format let-else with rustfmt nightly (#5461 ) Support for `let…else` formatting was just merged to nightly (rust-lang/rust#113225). Rerun `cargo fmt` with Rust nightly 2023-07-02 to pick this up. Followup to #939. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2023-07-03 02:13:35 +00:00
Charlie Marsh	c8b9a46e2b	[`pyupgrade`] Restore the `keep-runtime-typing` setting (#5470 ) ## Summary This PR reverts #4427. See the included documentation for a detailed explanation. Closes #5434.	2023-07-03 02:11:31 +00:00
Charlie Marsh	6cc04d64e4	[`flake8-django`] Skip duplicate violations in `DJ012` (#5469 ) ## Summary This PR reduces the noise from `DJ012` by emitting a single violation when you have multiple consecutive violations of the same "type". For example, given: ```py class MultipleConsecutiveFields(models.Model): """Model that contains multiple out-of-order field definitions in a row.""" class Meta: verbose_name = "test" first_name = models.CharField(max_length=32) last_name = models.CharField(max_length=32) ``` It's convenient to only error on `first_name`, and not `last_name`, since we're really flagging that the _section_ is out-of-order. Closes #5465.	2023-07-02 21:09:49 -04:00
Charlie Marsh	d0b2fffb87	[`numpy`] Add `numpy-deprecated-function` (NPY003) (#5468 ) ## Summary Closes #5456.	2023-07-02 20:50:14 -04:00
Charlie Marsh	b32d1e8d78	Detect consecutive, non-newline-delimited NumPy sections (#5467 ) ## Summary Given a docstring like: ```py def f(a: int, b: int) -> int: """Showcase function. Parameters ---------- a : int _description_ b : int _description_ Returns ------- int _description """ ``` We were failing to identify `Returns` as a section, because the previous line was neither empty nor ended with punctuation. This was causing a false negative, where by we weren't flagging a missing line before `Returns`. So, the very reason for the rule (no blank line) was causing us to fail to catch it. Note that, we did have a test case for this, which was working properly: ```py def f() -> int: """Showcase function. Parameters ---------- Returns ------- """ ``` ...because the line before `Returns` "ends in a punctuation mark" (`-`). Closes #5442.	2023-07-02 20:29:45 -04:00
Charlie Marsh	af7051b976	Include BaseException in B017 rule (#5466 ) Closes #5462.	2023-07-02 20:18:33 -04:00
Micha Reiser	f0ec9ecd67	Show `BestFitting` mode if it isn't `FirstLine` (#5452 )	2023-06-30 09:49:00 +00:00
Micha Reiser	f9129e435a	Normalize '\r' in string literals to '\n' <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR normalizes line endings inside of strings to `\n` as required by the printer. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a new test using `\r\n` and ran the ecosystem check. There are no remaining end of line panics. https://gist.github.com/MichaReiser/8f36b1391ca7b48475b3a4f592d74ff4 <!-- How was it tested? -->	2023-06-30 10:13:23 +02:00
Micha Reiser	dc65007fe9	Use rayon to parallelize the stability check <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR uses rayon to parallelize the stability check by scheduling each project as its own task. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I ran the ecosystem check. It now makes use of all cores (except at the end, there are some large projects). ## Performance The check now completes in minutes where it took about 30 minutes before. <!-- How was it tested? -->	2023-06-30 10:05:25 +02:00
Micha Reiser	9c2a75284b	Preserve parentheses around left side of binary expression <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR fixes an issue where the binary expression formatting removed parentheses around the left hand side of an expression. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a new regression test and re-ran the ecosystem check. It brings down the `check-formatter-stability` output from a 3.4MB file down to 900KB. <!-- How was it tested? -->	2023-06-30 09:52:14 +02:00
Micha Reiser	ae25638b0b	Update Black tests (#5438 )	2023-06-30 06:32:50 +00:00
Micha Reiser	955e9ef821	Fix invalid syntax for binary expression in unary op (#5370 )	2023-06-29 08:09:26 +02:00
Micha Reiser	38189ed913	Fix invalid printer IR error (#5422 )	2023-06-29 08:09:13 +02:00
David Szotten	ca5e10b5ea	format StmtTryStar (#5418 )	2023-06-29 08:07:33 +02:00
Charlie Marsh	a973019358	Rewrite a variety of `.contains()` calls as `matches!` statements (#5432 ) ## Summary These have the potential to be much more efficient, as we've seen in the past.	2023-06-28 22:42:27 -04:00
Charlie Marsh	aa887d5a1d	Use "manual" fixability for E731 in shadowed context (#5430 ) ## Summary This PR makes E731 a "manual" fix in one other context: when the lambda is shadowing another variable in the scope. Function declarations (with shadowing) cause issues for type checkers, and so rewriting an annotation, e.g., in branches of an `if` statement can lead to failures. Closes https://github.com/astral-sh/ruff/issues/5421.	2023-06-28 22:00:06 -04:00
Charlie Marsh	72f7f11bac	Use `matches!` for reserved attribute lookup (#5431 )	2023-06-29 01:52:11 +00:00
Tom Kuson	5aa2a90e17	Add documentation to `flake8-logging-format` rules (#5417 ) ## Summary Completes the documentation for the `flake8-logging-format` rules. Related to #2646. I included both the `flake8-logging-format` recommendation to use the `extra` keyword and the Pylint recommendation to pass format values as parameters so that formatting is done lazily, as #970 suggests the Pylint logging rules are covered by this ruleset. Using lazy formatting via parameters is probably more common than avoiding formatting entirely in favour of the `extra` argument, regardless. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-29 01:30:11 +00:00
Charlie Marsh	0e89c94947	Run shadowed-variable analyses in deferred handlers (#5181 ) ## Summary This PR extracts a bunch of complex logic from `add_binding`, instead running the the shadowing rules in the deferred handler, thereby decoupling the binding phase (during which we build up the semantic model) from the analysis phase, and generally making `add_binding` much more focused. This was made possible by improving the semantic model to better handle deletions -- previously, we'd "lose track" of bindings if they were deleted, which made this kind of refactor impossible. ## Test Plan We have good automated coverage for this, but I want to benchmark it separately.	2023-06-29 00:08:18 +00:00
Charlie Marsh	c5e20505f8	Remove an unsafe access in the resolver (#5428 )	2023-06-28 19:08:10 +00:00
Charlie Marsh	69c4b7fa11	Add dedicated `struct` for implicit imports (#5427 ) ## Summary This was some feedback on a prior PR that I decided to act on separately.	2023-06-28 18:55:43 +00:00
Charlie Marsh	0e12eb3071	Add a snapshot test for native module resolution (#5423 )	2023-06-28 18:16:39 +00:00
Charlie Marsh	864f50a3a4	Remove all `unwrap` calls from the resolver (#5426 )	2023-06-28 18:06:17 +00:00
Charlie Marsh	4d90a5a9bc	Move resolver tests out to top-level (#5424 ) ## Summary These are really tests for the entire crate.	2023-06-28 13:25:37 -04:00
Charlie Marsh	1d2d015bc5	Make standard input detection robust to invalid arguments (#5393 ) ## Summary This PR fixes a silent failure that manifested itself in https://github.com/astral-sh/ruff-vscode/issues/238. In short, if the user provided invalid arguments to Ruff in the VS Code extension (like `"ruff.args": ["a"]`), then we generated something like the following command: ```console /path/to/ruff --force-exclude --no-cache --no-fix --format json - --fix a --stdin-filename /path/to/file.py ``` Since this contains both `-` and `a` as the "input files", Ruff would treat this as if we're linting the files names `-` and `a`, rather than linting standard input. This PR modifies out standard input detection to force standard input when `--stdin-filename` is present, or at least one file is `-`. (We then warn and ignore the others.)	2023-06-28 14:52:23 +00:00
Charlie Marsh	ea7bb199bc	Fill-in missing implementation for `is_native_module_file_name` (#5410 ) ## Summary This was just an oversight -- the last remaining `todo!()` that I never filled in. We clearly don't have any test coverage for it yet, but this mimics the Pyright implementation.	2023-06-28 14:50:54 +00:00
Charlie Marsh	979049b2a6	Make lib iteration platform-specific (#5406 )	2023-06-28 13:52:20 +00:00
Charlie Marsh	6587fb844a	Add snapshot tests for resolver (#5404 ) ## Summary This PR adds some snapshot tests for the resolver based on executing resolutions within a "mock" of the Airflow repo (that is: a folder that contains a subset of the repo's files, but all empty, and with an only-partially-complete virtual environment). It's intended to act as a lightweight integration test, to enable us to test resolutions on a "real" project without adding a dependency on Airflow itself.	2023-06-28 13:38:51 +00:00
Dhruv Manilawala	a68a86e18b	fixup! Consider Jupyter index for code frames (`--show-source`) (#5402 ) (#5414 )	2023-06-28 10:25:05 +00:00
Christian Clauss	b42d76494c	types.rs: fnmatch url should point to current Python docs (#5413 ) Like #5412	2023-06-28 15:54:13 +05:30
David Szotten	c7adb9117f	format StmtAsyncWith (#5376 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-28 10:21:44 +00:00
David Szotten	1979103ec0	Format `StmtTry` (#5222 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-28 10:02:15 +00:00
Christian Clauss	9e2fd0c620	ruff rule SLOT uses URL to current Python docs (#5412 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary <!-- What's the purpose of the change? What does it do, and why? --> Currently the URL at the bottom of the `ruff rule SLOT00x` output points to Python 3.7 docs. Given that Python 3.7 is now end-of-life (as of yesterday), let's instead point users to the current Python docs. ## Test Plan <!-- How was it tested? -->	2023-06-28 09:48:52 +00:00
Charlie Marsh	366edc5a3f	Fix string annotation in docs (#5411 )	2023-06-28 03:29:56 +00:00
Dhruv Manilawala	2aecaf5060	Consider Jupyter index for code frames (`--show-source`) (#5402 ) ## Summary Consider Jupyter index for code frames (`--show-source`). This solves two problems as mentioned in the linked issue: > Omit any contents from adjoining cells If the Jupyter index is present, we'll use that to check if the surrounding lines belong to the same cell as the content line. If not, we'll skip that line until we either reach the one which does or we reach the content line. > code frame line number If the Jupyter index is present, we'll use that to get the actual start line in corresponding to the computed start index. ## Test Plan `cargo run --bin ruff -- check --no-cache --isolated --select=ALL --show-source /path/to/notebook.ipynb` fixes: #5395	2023-06-28 08:54:51 +05:30
Marti Raudsepp	2c99b268c6	Exclude docstrings from PYI053 (#5405 ) ## Summary The `Y053` rule of `flake8-pyi` ignores docstrings, it only triggers on other string literals. The separate `Y021/PYI021` rule exists to disallow docstrings. ## Test Plan Added some `# OK` test cases to `PYI053.py(i)` files.	2023-06-28 00:19:20 +00:00
Charlie Marsh	56f73de0cb	Misc. clean-up for import resolver (#5401 ) ## Summary Renaming functions, adding documentation, refactoring the test infrastructure a bit.	2023-06-27 19:27:12 +00:00
Tom Kuson	a0a93a636f	Implement Pylint `single-string-used-for-slots` (`C0205`) as `single-string-slots` (`PLC0205`) (#5399 ) ## Summary Implement Pylint rule `single-string-used-for-slots` (`C0205`) as `single-string-slots` (`PLC0205`). This rule checks for single strings being assigned to `__slots__`. For example ```python class Foo: __slots__: str = "bar" def __init__(self, bar: str) -> None: self.bar = bar ``` should be ```python class Foo: __slots__: tuple[str, ...] = ("bar",) def __init__(self, bar: str) -> None: self.bar = bar ``` Related to #970. Includes documentation. ## Test Plan `cargo test`	2023-06-27 18:33:58 +00:00
Tom Kuson	035f8993f4	Complete documentation for `pydocstyle` rules (#5387 ) ## Summary Completes the documentation for the `pydocstyle` ruleset. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-27 18:12:21 +00:00
Charlie Marsh	032b967b05	Enable --watch for Jupyter notebooks (#5394 ) ## Summary The list of extensions that support watching is hard-coded (unfortunately); this PR adds `.ipynb` to the list.	2023-06-27 12:53:47 -04:00
Dhruv Manilawala	962479d943	Replace same length equal line with dash line in D407 (#5383 ) ## Summary Replace same length equal line with dash line in D407 Do we want to update the message and autofix title to reflect this change? ## Test Plan Added test cases for: - Equal line length == dash line length - Equal line length != dash line length fixes: #5378	2023-06-27 16:50:20 +00:00
Evan Rittenhouse	ff0d0ab7a0	Add applicability to pydocstyle (#5390 )	2023-06-27 12:40:19 -04:00
Evan Rittenhouse	0585e14d3b	Add applicability to flake8_pytest_style (#5389 )	2023-06-27 12:39:56 -04:00
Charlie Marsh	1ed227a1e0	Port Pyright's import resolver to Rust (#5381 ) ## Summary This PR contains the first step towards enabling robust first-party, third-party, and standard library import resolution in Ruff (including support for `typeshed`, stub files, native modules, etc.) by porting Pyright's import resolver to Rust. The strategy taken here was to start with a more-or-less direct port of the Pyright's TypeScript resolver. The code is intentionally similar, and the test suite is effectively a superset of Pyright's test suite for its own resolver. Due to the nature of the port, the code is very, very non-idiomatic for Rust. The code is also entirely unused outside of the test suite, and no effort has been made to integrate it with the rest of the codebase. Future work will include: - Refactoring the code (now that it works) to match Rust and Ruff idioms. - Further testing, in practice, to ensure that the resolver can resolve imports in a complex project, when provided with a virtual environment path. - Caching, to minimize filesystem lookups and redundant resolutions. - Integration into Ruff itself (use Ruff's existing settings, find rules that can make use of robust resolution, etc.)	2023-06-27 16:15:07 +00:00
Charlie Marsh	502e15585d	Ignore unpacking in `iteration-over-set` (#5392 ) Closes #5386.	2023-06-27 15:33:42 +00:00
konstin	520f4f33c3	Fix ruff_dev repeat by removing short argument (#5388 ) ruff_dev repeat recently broke (i think with the cargo update?): > thread 'main' panicked at 'Command repeat: Short option names must be unique for each argument, but '-n' is in use by both 'no_cache' and 'repeat'' This fixes this by removing the short argument.	2023-06-27 13:29:20 +00:00
konstin	7f6cb9dfb5	Format call expressions (without call chaining) (#5341 ) ## Summary This formats call expressions with magic trailing comma and parentheses behaviour but without call chaining ## Test Plan Lots of new test fixtures, including some that don't work yet	2023-06-27 09:29:40 +00:00
David Szotten	50a7769d69	magic trailing comma for ExprList (#5365 )	2023-06-26 21:59:01 +02:00
Evan Rittenhouse	190bed124f	[`perflint`] Implement `try-except-in-loop` (`PERF203`) (#5166 ) ## Summary Implements PERF203 from #4789, which throws if a `try/except` block is inside of a loop. Not sure if we want to extend the diagnostic to the `except` as well, but I thought that that may get a little messy. We may also want to just throw on the word `try` - open to suggestions though. ## Test Plan `cargo test`	2023-06-26 17:34:37 +00:00
Charlie Marsh	d53b986fd4	Fix autofix capabilities in playground (#5375 ) ## Summary These had just bitrotted over time -- we were no longer passing along the row-and-column indices, etc. ## Test Plan ![Screen Shot 2023-06-26 at 12 03 41 PM](https://github.com/astral-sh/ruff/assets/1309177/6791330d-010b-45d3-91ef-531d4745193f)	2023-06-26 16:40:28 +00:00
Charlie Marsh	8a1bb7a5af	Fix version number in playground (#5372 ) ## Summary `v0.0.275` in the top-right was showing `v0.0.0` at all times. ## Test Plan ![Screen Shot 2023-06-26 at 11 31 16 AM](https://github.com/astral-sh/ruff/assets/1309177/e6cd0e19-6a5f-4b46-a060-54f492524737)	2023-06-26 15:56:12 +00:00
Dhruv Manilawala	2fc38d81e6	Experimental release for Jupyter notebook integration (#5363 ) ## Summary Experimental release for Jupyter Notebook integration. Currently, this requires a user to explicitly opt-in using the [include](https://beta.ruff.rs/docs/settings/#include) configuration: ```toml [tool.ruff] include = [".py", ".pyi", "*/pyproject.toml", ".ipynb"] ``` Or, a user can pass in the file directly: ```sh ruff check path/to/notebook.ipynb ``` For known limitations, please refer #5188 ## Test Plan Following command should work without the `--all-features` flag: ```sh cargo dev round-trip /path/to/notebook.ipynb ``` Following command should work with the above config file along with `select = ["ALL"]`: ```sh cargo run --bin ruff -- check --no-cache --config=../test-repos/openai-cookbook/pyproject.toml --fix ../test-repos/openai-cookbook/ ``` Passing the Jupyter notebook directly: ```sh cargo run --bin ruff -- check --no-cache --isolated --select=ALL --fix ../test-repos/openai-cookbook/examples/Classification_using_embeddings.ipynb ```	2023-06-26 21:22:42 +05:30
Charlie Marsh	fa1b85b3da	Remove prelude from `ruff_python_ast` (#5369 ) ## Summary Per @MichaReiser, this is causing more confusion than it is helpful.	2023-06-26 11:43:49 -04:00
Tom Kuson	baa7264ca4	Add documentation for `flake8-2020` (#5366 ) ## Summary Completes the documentation for the `flake8-2020` ruleset. Related to #2646 . ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-26 15:24:42 +00:00
Tom Kuson	fde3f09370	Add documentation missing docstring rules (`D1XX`) (#5330 ) ## Summary Add documentation to the `D1XX` rules that flag missing docstrings. The examples are quite long and docstrings practices vary a lot between projects, so I thought it would be best that the documentation for these rules be their own PR separate to the other `pydocstyle` rules. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-26 14:44:46 +00:00
David Szotten	d00559e42a	format StmtWith (#5350 )	2023-06-26 15:09:06 +01:00
Micha Reiser	49cabca3e7	Format implicit string continuation (#5328 )	2023-06-26 12:41:47 +00:00
Micha Reiser	313711aaf9	Prefer the configured quote style <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR extends the string formatting to respect the configured quote style. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan Extended the string test with new cases and set it up to run twice: Once with the `quote_style: Doube`, and once with `quote_style: Single` single and double quotes. <!-- How was it tested? -->	2023-06-26 14:24:25 +02:00
Micha Reiser	f18a1f70de	Add tests for skip magic trailing comma <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds tests that verify that the magic trailing comma is not respected if disabled in the formatter options. Our test setup now allows to create a `<fixture-name>.options.json` file that contains an array of configurations that should be tested. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan It's all about tests :) <!-- How was it tested? -->	2023-06-26 14:15:55 +02:00
Micha Reiser	dd0d1afb66	Create `PyFormatOptions` <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds a new `PyFormatOptions` struct that stores the python formatter options. The new options aren't used yet, with the exception of magical trailing commas and the options passed to the printer. I'll follow up with more PRs that use the new options (e.g. `QuoteStyle`). <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan `cargo test` I'll follow up with a new PR that adds support for overriding the options in our fixture tests.	2023-06-26 14:02:17 +02:00
konstin	a52cd47c7f	Fix attribute chain own line comments (#5340 ) ## Motation Previously, ```python x = ( a1 .a2 # a . # b # c a3 ) ``` got formatted as ```python x = a1.a2 # a . # b # c a3 ``` which is invalid syntax. This fixes that. ## Summary This implements a basic form of attribute chaining (<https://black.readthedocs.io/en/stable/the_black_code_style/current_style.html#call-chains>) by checking if any inner attribute access contains an own line comment, and if this is the case, adds parentheses around the outermost attribute access while disabling parentheses for all inner attribute expressions. We want to replace this with an implementation that uses recursion or a stack while formatting instead of in `needs_parentheses` and also includes calls rather sooner than later, but i'm fixing this now because i'm uncomfortable with having known invalid syntax generation in the formatter. ## Test Plan I added new fixtures.	2023-06-26 09:13:07 +00:00
Micha Reiser	8879927b9a	Use `insta::glob` instead of `fixture` macro (#5364 )	2023-06-26 08:46:18 +00:00
Charlie Marsh	dce6a046b0	Add tests for escape-sequence-in-docstring (#5362 ) ## Summary Looks like I added a regression in #5360. This PR fixes it and adds dedicated tests to avoid it in the future.	2023-06-25 22:42:12 -04:00
Charlie Marsh	18c73c1f9b	Improve backslash-detection rule for docstrings (#5360 )	2023-06-26 01:58:20 +00:00
Charlie Marsh	19c221a2d2	Use matches for `os-error-alias` (#5361 )	2023-06-26 01:57:52 +00:00
Tom Kuson	fd0c3faa70	Add documentation to rules that check docstring quotes (`D3XX`) (#5351 ) ## Summary Add documentation to the `D3XX` rules that check for issues with docstring quotes. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-25 22:34:03 +00:00
Charlie Marsh	1fe4073b56	Update the `invalid-escape-sequence` rule (#5359 ) Just a couple small tweaks based on reading the rule with fresh eyes and new best-practices.	2023-06-25 22:20:31 +00:00
Charlie Marsh	1ef4eee089	Add space when migrating to raw string (#5358 ) ## Summary We had to do this for f-strings too -- if we add a prefix to `"foo"` in `return"foo"`, we also need to add a leading space.	2023-06-25 18:10:08 -04:00
Shantanu	0ce38b650e	Change W605 autofix to use raw strings if possible (#5352 ) Fixes #5061.	2023-06-25 17:35:07 -04:00
Evan Rittenhouse	e0a507e48e	Add Applicability to flake8_simplify (#5348 )	2023-06-23 22:54:43 +00:00
Dhruv Manilawala	adf5cb5ff7	Ignore type aliases for RUF013 (#5344 ) ## Summary Ignore type aliases for RUF013 to avoid flagging false positives: ```python from typing import Optional MaybeInt = Optional[int] def f(arg: MaybeInt = None): pass ``` But, at the expense of having false negatives: ```python Text = str \| bytes def f(arg: Text = None): pass ``` ## Test Plan `cargo test` fixes: #5295	2023-06-23 22:51:09 +00:00
Micha Reiser	d3d69a031e	Add `JoinCommaSeparatedBuilder` (#5342 )	2023-06-23 22:03:05 +01:00
Micha Reiser	6ba9d5d5a4	Upgrade RustPython (#5334 )	2023-06-23 20:39:47 +00:00
Charlie Marsh	f45d1c2b84	Remove HashMap and HashSet for known-standard-library detection (#5345 ) ## Summary This is a lot more concise and probably much more performant (with fewer instructions).	2023-06-23 19:59:03 +00:00
konstin	4b65446de6	Refactor magic trailing comma (#5339 ) ## Summary This is small refactoring to reuse the code that detects the magic trailing comma across functions. I make this change now to avoid copying code in a later PR. @MichaReiser is planning on making a larger refactoring later that integrates with the join nodes builder ## Test Plan No functional changes. The magic trailing comma behaviour is checked by the fixtures.	2023-06-23 18:53:55 +02:00
James Berry	f85eb709e2	Visit AugAssign target after value (#5325 ) ## Summary When visiting AugAssign in evaluation order, the AugAssign `target` should be visited after it's `value`. Based on my testing, the pseudo code for `a += b` is effectively: ```python tmp = a a = tmp.__iadd__(b) ``` That is, an ideal traversal order would look something like this: 1. load a 2. b 3. op 4. store a But, there is only a single AST node which captures `a` in the statement `a += b`, so it cannot be traversed both before and after the traversal of `b` and the `op`. Nonetheless, I think traversing `a` after `b` and the `op` makes the most sense for a number of reasons: 1. All the other assignment expressions traverse their `value`s before their `target`s. Having `AugAssign` traverse in the same order would be more consistent. 2. Within the AST, the `ctx` of the `target` for an `AugAssign` is `Store` (though technically this is a `Load` and `Store` operation, the AST only indicates it as a `Store`). Since the the store portion of the `AugAssign` occurs last, I think it makes sense to traverse the `target` last as well. The effect of this is marginal, but it may have an impact on the behavior of #5271.	2023-06-23 09:54:54 -04:00
Thomas de Zeeuw	1c638264b2	Keep track of when files are last seen in the cache (#5214 ) ## Summary And remove cached files that we haven't seen for a certain period of time, currently 30 days. For the last seen timestamp we actually use an `u64`, it's smaller on disk than `SystemTime` (which size is OS dependent) and fits in an `AtomicU64` which we can use to update it without locks. ## Test Plan Added a new unit test, run by `cargo test`.	2023-06-23 15:40:35 +02:00
Micha Reiser	2dfa6ff58d	Fix unstable set comprehension formatting (#5327 )	2023-06-23 11:50:24 +02:00
konstin	930f03de98	Don't mistake a following if for an elif (#5296 ) In the following code, the comment used to get wrongly associated with the `if False` since it looked like an elif. This fixes it by checking the indentation and adding a regression test ```python if True: pass else: # Comment if False: pass pass ``` Originally found in `1570b94a02/gradio/external.py (L478)`	2023-06-23 10:07:28 +02:00
Micha Reiser	c52aa8f065	Basic string formatting <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR implements formatting for non-f-string Strings that do not use implicit concatenation. Docstring formatting is out of the scope of this PR. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a few tests for simple string literals. ## Performance Ouch. This is hitting performance somewhat hard. This is probably because we now iterate each string a couple of times: 1. To detect if it is an implicit string continuation 2. To detect if the string contains any new lines 3. To detect the preferred quote 4. To normalize the string Edit: I integrated the detection of newlines into the preferred quote detection so that we only iterate the string three time. We can probably do better by merging the implicit string continuation with the quote detection and new line detection by iterating till the end of the string part and returning the offset. We then use our simple tokenizer to skip over any comments or whitespace until we find the first non trivia token. From there we keep continue doing this in a loop until we reach the end o the string. I'll leave this improvement for later.	2023-06-23 09:46:05 +02:00
Micha Reiser	3e12bdff45	Format Compare Op <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds basic formatting for compare operations. The implementation currently breaks diffeently when nesting binary like expressions. I haven't yet figured out what Black's logic is in that case but I think that this by itself is already an improvement worth merging. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a few new tests <!-- How was it tested? -->	2023-06-23 09:35:29 +02:00
James Berry	2142bf6141	Fix annotation and format spec visitors (#5324 ) ## Summary The `Visitor` and `preorder::Visitor` traits provide some convenience functions, `visit_annotation` and `visit_format_spec`, for handling annotation and format spec expressions respectively. Both of these functions accept an `&Expr` and have a default implementation which delegates to `walk_expr`. The problem with this approach is that any custom handling done in `visit_expr` will be skipped for annotations and format specs. Instead, to capture any custom logic implemented in `visit_expr`, both of these function's default implementations should delegate to `visit_expr` instead of `walk_expr`. ## Example Consider the below `Visitor` implementation: ```rust impl<'a> Visitor<'a> for Example<'a> { fn visit_expr(&mut self, expr: &'a Expr) { match expr { Expr::Name(ExprName { id, .. }) => println!("Visiting {:?}", id), _ => walk_expr(self, expr), } } } ``` Run on the following Python snippet: ```python a: b ``` I would expect such a visitor to print the following: ``` Visiting b Visiting a ``` But it instead prints the following: ``` Visiting a ``` Our custom `visit_expr` handler is not invoked for the annotation. ## Test Plan Tests added in #5271 caught this behavior.	2023-06-23 03:55:42 +00:00
Tom Kuson	1cf307c34c	Fix `collection-literal-concatenation` documentation (#5320 ) ## Summary Move `collection-literal-concatenation` markdown documentation to the correct place. Fixes error in #5262. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-22 18:37:54 -04:00
Charlie Marsh	7819b95d7f	Avoid syntax errors when removing f-string prefixes (#5319 ) Closes https://github.com/astral-sh/ruff/issues/5281. Closes https://github.com/astral-sh/ruff/issues/4827.	2023-06-22 17:21:09 -04:00
Lukas Mayrhofer	4a81cfc51a	Allow `@Author` format for "Missing Author" rule in `flake8-todos` (#4903 ) ## Summary The TD-002 rule "Missing Author" was updated to allow another format using "@". This reflects the current 0.3.0 version of flake8-todos.	2023-06-22 20:53:58 +00:00
qdegraaf	38e618cd18	[`perflint`] Add `PERF101` with autofix (#5121 ) ## Summary Adds PERF101 which checks for unnecessary casts to `list` in for loops. NOTE: Is not fully equal to its upstream implementation as this implementation does not flag based on type annotations (i.e.): ```python def foo(x: List[str]): for y in list(x): ... ``` With the current set-up it's quite hard to get the annotation from a function arg from its binding. Problem is best considered broader than this implementation. ## Test Plan Added fixture. ## Issue links Refers: https://github.com/astral-sh/ruff/issues/4789 --------- Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2023-06-22 20:44:26 +00:00
Charlie Marsh	8bc7378002	Add `PythonVersion::Py312` (#5316 ) Closes #5310.	2023-06-22 20:01:07 +00:00
Charlie Marsh	cdbd0bd5cd	Respect `abc` decorators when classifying function types (#5315 ) Closes #5307.	2023-06-22 19:52:36 +00:00
Charlie Marsh	5f88ff8a96	Allow `__slots__` assignments in `mutable-class-default` (#5314 ) Closes #5309.	2023-06-22 19:40:54 +00:00
Charlie Marsh	1c2be54b4a	Support `pydantic.BaseSettings` in `mutable-class-default` (#5312 ) Closes #5308.	2023-06-22 19:27:05 +00:00
konstin	03694ef649	More stability checker options (#5299 ) ## Summary This contains three changes: * repos in `check_ecosystem.py` are stored as `org:name` instead of `org/name` to create a flat directory layout * `check_ecosystem.py` performs a maximum of 50 parallel jobs at the same time to avoid consuming to much RAM * `check-formatter-stability` gets a new option `--multi-project` so it's possible to do `cargo run --bin ruff_dev -- check-formatter-stability --multi-project target/checkouts` With these three changes it becomes easy to check the formatter stability over a larger number of repositories. This is part of the integration of integrating formatter regressions checks into the ecosystem checks. ## Test Plan ```shell python scripts/check_ecosystem.py --checkouts target/checkouts --projects github_search.jsonl -v $(which true) $(which true) cargo run --bin ruff_dev -- check-formatter-stability --multi-project target/checkouts ```	2023-06-22 15:48:11 +00:00
Tom Kuson	eaa10ad2d9	Fix `deprecated-import` false positives (#5291 ) ## Summary Remove recommendations to replace `typing_extensions.dataclass_transform` and `typing_extensions.SupportsIndex` with their `typing` library counterparts. Closes #5112. ## Test Plan Added extra checks to the test fixture. `cargo test`	2023-06-22 15:34:44 +00:00
Evan Rittenhouse	84259f5440	Add Applicability to pycodestyle (#5282 )	2023-06-22 11:25:20 -04:00
konstin	d407165aa7	Fix formatter panic with comment after parenthesized dict value (#5293 ) ## Summary This snippet used to panic because it expected to see a comma or something similar after the `2` but met the closing parentheses that is not part of the range and panicked ```python a = { 1: (2), # comment 3: True, } ``` Originally found in `636a717ef0/testing/marionette/client/marionette_driver/geckoinstance.py (L109)` This snippet is also the test plan.	2023-06-22 16:52:48 +02:00
Micha Reiser	f7e1cf4b51	Format `class` definitions (#5289 )	2023-06-22 09:09:43 +00:00
konstin	7d4f8e59da	Improve FormatExprCall dummy (#5290 ) This solves an instability when formatting cpython. It also introduces another one, but i think it's still a worthwhile change for now. There's no proper testing since this is just a dummy.	2023-06-22 10:59:30 +02:00
Charlie Marsh	1c0a3a467f	Bump version to 0.0.275 (#5276 )	2023-06-21 21:53:37 -04:00
Charlie Marsh	6b8b318d6b	Use `mod tests` consistently (#5278 ) As per the Rust documentation.	2023-06-22 01:50:28 +00:00
Charlie Marsh	c0c59b82ec	Use 'Checks for uses' consistently (#5279 )	2023-06-22 01:44:52 +00:00
Charlie Marsh	ac146e11f0	Allow `typing.Final` for `mutable-class-default annotations` (`RUF012`) (#5274 ) ## Summary See: https://github.com/astral-sh/ruff/issues/5243.	2023-06-22 00:24:53 +00:00
Charlie Marsh	1229600e1d	Ignore Pydantic classes when evaluating `mutable-class-default` (`RUF012`) (#5273 ) Closes https://github.com/astral-sh/ruff/issues/5272.	2023-06-21 23:59:44 +00:00
Micha Reiser	ccf34aae8c	Format Attribute Expression (#5259 )	2023-06-21 21:33:53 +00:00
Tom Kuson	341b12d918	Complete documentation for Ruff-specific rules (#5262 ) ## Summary Completes the documentation for the Ruff-specific ruleset. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-21 21:30:44 +00:00
Micha Reiser	3d7411bfaf	Use trait for labels instead of `TypeId` (#5270 )	2023-06-21 22:26:09 +01:00
David Szotten	1eccbbb60e	Format StmtFor (#5163 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary format StmtFor still trying to learn how to help out with the formatter. trying something slightly more advanced than [break](#5158) mostly copied form StmtWhile ## Test Plan snapshots	2023-06-21 23:00:31 +02:00
Charlie Marsh	e71f044f0d	Avoid including nursery rules in linter-level selectors (#5268 ) ## Summary Ensures that `--select PL` and `--select PLC` don't include `PLC1901`. Previously, `--select PL` _did_, because it's a "linter-level selector" (`--select PLC` is viewed as selecting the `C` prefix from `PL`), and we were missing this filtering path.	2023-06-21 20:11:40 +00:00
James Berry	f194572be8	Remove visit_arg_with_default (#5265 ) ## Summary This is a follow up to #5221. Turns out it was easy to restructure the visitor to get the right order, I'm just dumb 🤷‍♂️ I've removed `visit_arg_with_default` entirely from the `Visitor`, although it still exists as part of `preorder::Visitor`.	2023-06-21 16:00:24 -04:00
Charlie Marsh	62e2c46f98	Move `compare-to-empty-string` to nursery (#5264 ) ## Summary This rule has too many false positives. It has parity with the Pylint version, but the Pylint version is part of an [extension](https://pylint.readthedocs.io/en/stable/user_guide/messages/convention/compare-to-empty-string.html), and so requires explicit opt-in. I'm moving this rule to the nursery to require explicit opt-in, as with Pylint. Closes #4282.	2023-06-21 19:47:02 +00:00
konstin	9419d3f9c8	Special `ExprTuple` formatting option for `for`-loops (#5175 ) ## Motivation While black keeps parentheses nearly everywhere, the notable exception is in the body of for loops: ```python for (a, b) in x: pass ``` becomes ```python for a, b in x: pass ``` This currently blocks #5163, which this PR should unblock. ## Solution This changes the `ExprTuple` formatting option to include one additional option that removes the parentheses when not using magic trailing comma and not breaking. It is supposed to be used through ```rust #[derive(Debug)] struct ExprTupleWithoutParentheses<'a>(&'a Expr); impl Format<PyFormatContext<'_>> for ExprTupleWithoutParentheses<'_> { fn fmt(&self, f: &mut Formatter<PyFormatContext<'_>>) -> FormatResult<()> { match self.0 { Expr::Tuple(expr_tuple) => expr_tuple .format() .with_options(TupleParentheses::StripInsideForLoop) .fmt(f), other => other.format().with_options(Parenthesize::IfBreaks).fmt(f), } } } ``` ## Testing The for loop formatting isn't merged due to missing this (and i didn't want to create more git weirdness across two people), but I've confirmed that when applying this to while loops instead of for loops, then ```rust write!( f, [ text("while"), space(), ExprTupleWithoutParentheses(test.as_ref()), text(":"), trailing_comments(trailing_condition_comments), block_indent(&body.format()) ] )?; ``` makes ```python while (a, b): pass while ( ajssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssa, b, ): pass while (a,b,): pass ``` formatted as ```python while a, b: pass while ( ajssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssa, b, ): pass while ( a, b, ): pass ```	2023-06-21 21:17:47 +02:00
James Berry	9b5fb8f38f	Fix AST visitor traversal order (#5221 ) ## Summary According to the AST visitor documentation, the AST visitor "visits all nodes in the AST recursively in evaluation-order". However, the current traversal fails to meet this specification in a few places. ### Function traversal ```python order = [] @(order.append("decorator") or (lambda x: x)) def f( posonly: order.append("posonly annotation") = order.append("posonly default"), /, arg: order.append("arg annotation") = order.append("arg default"), args: order.append("vararg annotation"), kwarg: order.append("kwarg annotation") = order.append("kwarg default"), *kwargs: order.append("kwarg annotation") ) -> order.append("return annotation"): pass print(order) ``` Executing the above snippet using CPython 3.10.6 prints the following result (formatted for readability): ```python [ 'decorator', 'posonly default', 'arg default', 'kwarg default', 'arg annotation', 'posonly annotation', 'vararg annotation', 'kwarg annotation', 'kwarg annotation', 'return annotation', ] ``` Here we can see that decorators are evaluated first, followed by argument defaults, and annotations are last. The current traversal of a function's AST does not align with this order. ### Annotated assignment traversal ```python order = [] x: order.append("annotation") = order.append("expression") print(order) ``` Executing the above snippet using CPython 3.10.6 prints the following result: ```python ['expression', 'annotation'] ``` Here we can see that an annotated assignments annotation gets evaluated after the assignment's expression. The current traversal of an annotated assignment's AST does not align with this order. ## Why? I'm slowly working on #3946 and porting over some of the logic and tests from ssort. ssort is very sensitive to AST traversal order, so ensuring the utmost correctness here is important. ## Test Plan There doesn't seem to be existing tests for the AST visitor, so I didn't bother adding tests for these very subtle changes. However, this behavior will be captured in the tests for the PR which addresses #3946.	2023-06-21 14:40:58 -04:00
konstin	d7c7484618	Format function argument separator comments (#5211 ) ## Summary This is a complete rewrite of the handling of `/` and `*` comment handling in function signatures. The key problem is that slash and star don't have a note. We now parse out the positions of slash and star and their respective preceding and following note. I've left code comments for each possible case of function signature structure and comment placement ## Test Plan I extended the function statement fixtures with cases that i found. If you have more weird edge cases your input would be appreciated.	2023-06-21 17:56:47 +00:00
konstin	bc63cc9b3c	Fix remaining CPython formatter errors except for function argument separator comments (#5210 ) ## Summary This fixes two problems discovered when trying to format the cpython repo with `cargo run --bin ruff_dev -- check-formatter-stability projects/cpython`: The first is to ignore try/except trailing comments for now since they lead to unstable formatting on the dummy. The second is to avoid dropping trailing if comments through placement: This changes the placement to keep a comment trailing an if-elif or if-elif-else to keep the comment a trailing comment on the entire if. Previously the last comment would have been lost. ```python if "first if": pass elif "first elif": pass ``` The last remaining problem in cpython so far is function signature argument separator comment placement which is its own PR on top of this. ## Test Plan I added test fixtures of minimized examples with links back to the original cpython location	2023-06-21 19:45:53 +02:00
Charlie Marsh	bf1a94ee54	Initialize caches for packages and standalone files (#5237 ) ## Summary While fixing https://github.com/astral-sh/ruff/pull/5233, I noticed that in FastAPI, 343 out of 823 files weren't hitting the cache. It turns out these are standalone files in the documentation that lack a "package root". Later, when looking up the cache entries, we fallback to the package directory. This PR ensures that we initialize the cache for both kinds of files: those that are in a package, and those that aren't. The total size of the FastAPI cache for me is now 388K. I also suspect that this approach is much faster than as initially written, since before, we were probably initializing one cache per _directory_. ## Test Plan Ran `cargo run -p ruff_cli -- check ../fastapi --verbose`; verified that, on second execution, there were no "Checking" entries in the logs.	2023-06-21 17:29:09 +00:00
Dhruv Manilawala	c792c10eaa	Add support for nested quoted annotations in RUF013 (#5254 ) ## Summary This is a follow up on #5235 to add support for nested quoted annotations for RUF013. ## Test Plan `cargo test`	2023-06-21 17:25:27 +00:00
Evan Rittenhouse	f9ffb3d50d	Add Applicability to pylint (#5251 )	2023-06-21 17:22:01 +00:00
Evan Rittenhouse	2b76d88bd3	Add Applicability to pandas_vet (#5252 )	2023-06-21 17:12:47 +00:00
Evan Rittenhouse	41ef17b007	Add Applicability to pyflakes (#5253 )	2023-06-21 17:04:55 +00:00
Charlie Marsh	0aa21277c6	Improve documentation for overlong-line rules (#5260 ) Closes https://github.com/astral-sh/ruff/issues/5248.	2023-06-21 17:02:20 +00:00
Charlie Marsh	ecf61d49fa	Restore existing bindings when unbinding caught exceptions (#5256 ) ## Summary In the latest release, we made some improvements to the semantic model, but our modifications to exception-unbinding are causing some false-positives. For example: ```py try: v = 3 except ImportError as v: print(v) else: print(v) ``` In the latest release, we started unbinding `v` after the `except` handler. (We used to restore the existing binding, the `v = 3`, but this was quite complicated.) Because we don't have full branch analysis, we can't then know that `v` is still bound in the `else` branch. The solution here modifies `resolve_read` to skip-lookup when hitting unbound exceptions. So when store the "unbind" for `except ImportError as v`, we save the binding that it shadowed `v = 3`, and skip to that. Closes #5249. Closes #5250.	2023-06-21 12:53:58 -04:00
Micha Reiser	e47aa468d5	Format Identifier (#5255 )	2023-06-21 17:35:37 +02:00
konstin	6155fd647d	Format Slice Expressions (#5047 ) This formats slice expressions and subscript expressions. Spaces around the colons follows the same rules as black (https://black.readthedocs.io/en/stable/the_black_code_style/current_style.html#slices): ```python e00 = "e"[:] e01 = "e"[:1] e02 = "e"[: a()] e10 = "e"[1:] e11 = "e"[1:1] e12 = "e"[1 : a()] e20 = "e"[a() :] e21 = "e"[a() : 1] e22 = "e"[a() : a()] e200 = "e"[a() : :] e201 = "e"[a() :: 1] e202 = "e"[a() :: a()] e210 = "e"[a() : 1 :] ``` Comment placement is different due to our very different infrastructure. If we have explicit bounds (e.g. `x[1:2]`) all comments get assigned as leading or trailing to the bound expression. If a bound is missing `[:]`, comments get marked as dangling and placed in the same section as they were originally in: ```python x = "x"[ # a # b : # c # d ] ``` to ```python x = "x"[ # a # b : # c # d ] ``` Except for the potential trailing end-of-line comments, all comments get formatted on their own line. This can be improved by keeping end-of-line comments after the opening bracket or after a colon as such but the changes were already complex enough. I added tests for comment placement and spaces.	2023-06-21 15:09:39 +00:00
Charlie Marsh	10885d09a1	Add support for top-level quoted annotations in RUF013 (#5235 ) ## Summary This PR adds support for autofixing annotations like: ```python def f(x: "int" = None): ... ``` However, we don't yet support nested quotes, like: ```python def f(x: Union["int", "str"] = None): ... ``` Closes #5231.	2023-06-21 10:23:37 -04:00
konstin	44156f6962	Improve debuggability of `place_comment` (#5209 ) ## Summary I found it hard to figure out which function decides placement for a specific comment. An explicit loop makes this easier to debug ## Test Plan There should be no functional changes, no changes to the formatting of the fixtures.	2023-06-21 09:52:13 +00:00
konstin	f551c9aad2	Unify benchmarking and profiling docs (#5145 ) This moves all docs about benchmarking and profiling into CONTRIBUTING.md by moving the readme of `ruff_benchmark` and adding more information on profiling. We need to somehow consolidate that documentation, but i'm not convinced that this is the best way (i tried subpages in mkdocs, but that didn't seem good either), so i'm happy to take suggestions.	2023-06-21 09:39:56 +00:00
Micha Reiser	653dbb6d17	Format BoolOp (#4986 )	2023-06-21 09:27:57 +00:00
konstin	db301c14bd	Consistently name comment own line/end-of-line `line_position()` (#5215 ) ## Summary Previously, `DecoratedComment` used `text_position()` and `SourceComment` used `position()`. This PR unifies this to `line_position` everywhere. ## Test Plan This is a rename refactoring.	2023-06-21 11:04:56 +02:00
Micha Reiser	1336ca601b	Format `UnaryExpr` <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds basic formatting for unary expressions. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a new `unary.py` with custom test cases	2023-06-21 10:09:47 +02:00
Micha Reiser	3973836420	Correctly handle left/right breaking of binary expression <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Black supports for layouts when it comes to breaking binary expressions: ```rust #[derive(Copy, Clone, Debug, Eq, PartialEq)] enum BinaryLayout { /// Put each operand on their own line if either side expands Default, /// Try to expand the left to make it fit. Add parentheses if the left or right don't fit. /// ///```python /// [ /// a, /// b /// ] & c ///``` ExpandLeft, /// Try to expand the right to make it fix. Add parentheses if the left or right don't fit. /// /// ```python /// a & [ /// b, /// c /// ] /// ``` ExpandRight, /// Both the left and right side can be expanded. Try in the following order: /// * expand the right side /// * expand the left side /// * expand both sides /// /// to make the expression fit /// /// ```python /// [ /// a, /// b /// ] & [ /// c, /// d /// ] /// ``` ExpandRightThenLeft, } ``` Our current implementation only handles `ExpandRight` and `Default` correctly. This PR adds support for `ExpandRightThenLeft` and `ExpandLeft`. ## Test Plan I added tests that play through all 4 binary expression layouts.	2023-06-21 09:40:05 +02:00
Charlie Marsh	e0339b538b	Bump version to 0.0.274 (#5230 )	2023-06-20 22:12:32 -04:00
Charlie Marsh	07b6b7401f	Move `copyright` rules to `flake8_copyright` module (#5236 ) ## Summary I initially wanted this category to be more general and decoupled from the plugin, but I got some feedback that the titling felt inconsistent with others.	2023-06-21 01:56:40 +00:00
Charlie Marsh	1db7d9e759	Avoid erroneous RUF013 violations for quoted annotations (#5234 ) ## Summary Temporary fix for #5231: if we can't flag and fix these properly, just disabling them for now. \cc @dhruvmanila ## Test Plan `cargo test`	2023-06-21 01:29:12 +00:00
Charlie Marsh	621e9ace88	Use package roots rather than package members for cache initialization (#5233 ) ## Summary This is a proper fix for the issue patched-over in https://github.com/astral-sh/ruff/pull/5229, thanks to an extremely helpful repro from @tlambert03 in that thread. It looks like we were using the keys of `package_roots` rather than the values to initialize the cache -- but it's a map from package to package root. ## Test Plan Reverted #5229, then ran through the plan that @tlambert03 included in https://github.com/astral-sh/ruff/pull/5229#issuecomment-1599723226. Verified the panic before but not after this change.	2023-06-20 21:21:45 -04:00
Charlie Marsh	f9f77cf617	Revert change to `RUF010` to remove unnecessary `str` calls (#5232 ) ## Summary This PR reverts #4971 (`aba073a791`). It turns out that `f"{str(x)}"` and `f"{x}"` are often but not exactly equivalent, and performing that conversion automatically can lead to subtle bugs, See the discussion in https://github.com/astral-sh/ruff/issues/4958.	2023-06-20 21:15:17 -04:00
Charlie Marsh	1a2bd984f2	Avoid `.unwrap()` on cache access (#5229 ) ## Summary I haven't been able to determine why / when this is happening, but in some cases, users are reporting that this `unwrap()` is causing a panic. It's fine to just return `None` here and fallback to "No cache", certainly better than panicking (while we figure out the edge case). Closes #5225. Closes #5228.	2023-06-20 19:01:21 -04:00
Tom Kuson	4717d0779f	Complete `flake8-debugger` documentation (#5223 ) ## Summary Completes the documentation for the `flake8-debugger` ruleset. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-20 21:04:32 +00:00
Florian Stasse	07409ce201	Fixed typo in numpy deprecated type alias rule documentation (#5224 ) ## Summary It is a very simple typo fix in the "numy deprecated type alias" documentation.	2023-06-20 16:51:51 -04:00
Micha Reiser	e520a3a721	Fix ArgWithDefault comments handling (#5204 )	2023-06-20 20:48:07 +00:00
Charlie Marsh	fde5dbc9aa	Bump version to 0.0.273 (#5218 )	2023-06-20 14:37:28 -04:00
Charlie Marsh	30734f06fd	Support parenthesized expressions when splitting compound assertions (#5219 ) ## Summary I'm looking into the Black stability tests, and here's one failing case. We split `assert a and (b and c)` into: ```python assert a assert (b and c) ``` We fail to split `assert (b and c)` due to the parentheses. But Black then removes then, and when running Ruff again, we get: ```python assert a assert b assert c ``` This PR just enables us to fix to this in one pass.	2023-06-20 13:47:01 -04:00
Charlie Marsh	4547002eb7	Remove defaults from fixtures/pyproject.toml (#5217 ) ## Summary These should be encoded in the tests themselves, rather than here. In fact, I think they're all unused?	2023-06-20 13:16:00 -04:00
Charlie Marsh	310abc769d	Move `StarImport` to its own module (#5186 )	2023-06-20 13:12:46 -04:00
Micha Reiser	b369288833	Accept any `Into<AnyNodeRef>` as `Comments` arguments (#5205 )	2023-06-20 16:49:21 +00:00
Dhruv Manilawala	6f7d3cc798	Add option (`-o`/`--output-file`) to write output to a file (#4950 ) ## Summary A new CLI option (`-o`/`--output-file`) to write output to a file instead of stdout. Major change is to remove the lock acquired on stdout. The argument is that the output is buffered and thus the lock is acquired only when writing a block (8kb). As per the benchmark below there is a slight performance penalty. Reference: https://rustmagazine.org/issue-3/javascript-compiler/#printing-is-slow ## Benchmarks _Output is truncated to only contain useful information:_ Command: `check --isolated --no-cache --select=ALL --show-source ./test-repos/cpython"` Latest HEAD (`361d45f2b2`) with and without the manual lock on stdout: ```console Benchmark 1: With lock Time (mean ± σ): 5.687 s ± 0.075 s [User: 17.110 s, System: 0.486 s] Range (min … max): 5.615 s … 5.860 s 10 runs Benchmark 2: Without lock Time (mean ± σ): 5.719 s ± 0.064 s [User: 17.095 s, System: 0.491 s] Range (min … max): 5.640 s … 5.865 s 10 runs Summary (1) ran 1.01 ± 0.02 times faster than (2) ``` This PR: ```console Benchmark 1: This PR Time (mean ± σ): 5.855 s ± 0.058 s [User: 17.197 s, System: 0.491 s] Range (min … max): 5.786 s … 5.987 s 10 runs Benchmark 2: Latest HEAD with lock Time (mean ± σ): 5.645 s ± 0.033 s [User: 16.922 s, System: 0.495 s] Range (min … max): 5.600 s … 5.712 s 10 runs Summary (2) ran 1.04 ± 0.01 times faster than (1) ``` ## Test Plan Run all of the commands which gives output with and without the `--output-file=ruff.out` option: * `--show-settings` * `--show-files` * `--show-fixes` * `--diff` * `--select=ALL` * `--select=All --show-source` * `--watch` (only stdout allowed) resolves: #4754	2023-06-20 22:16:49 +05:30
Micha Reiser	d9e59b21cd	Add BestFittingMode (#5184 ) ## Summary Black supports for layouts when it comes to breaking binary expressions: ```rust #[derive(Copy, Clone, Debug, Eq, PartialEq)] enum BinaryLayout { /// Put each operand on their own line if either side expands Default, /// Try to expand the left to make it fit. Add parentheses if the left or right don't fit. /// ///```python /// [ /// a, /// b /// ] & c ///``` ExpandLeft, /// Try to expand the right to make it fix. Add parentheses if the left or right don't fit. /// /// ```python /// a & [ /// b, /// c /// ] /// ``` ExpandRight, /// Both the left and right side can be expanded. Try in the following order: /// * expand the right side /// * expand the left side /// * expand both sides /// /// to make the expression fit /// /// ```python /// [ /// a, /// b /// ] & [ /// c, /// d /// ] /// ``` ExpandRightThenLeft, } ``` Our current implementation only handles `ExpandRight` and `Default` correctly. `ExpandLeft` turns out to be surprisingly hard. This PR adds a new `BestFittingMode` parameter to `BestFitting` to support `ExpandLeft`. There are 3 variants that `ExpandLeft` must support: Variant 1: Everything fits on the line (easy) ```python [a, b] + c ``` Variant 2: Left breaks, but right fits on the line. Doesn't need parentheses ```python [ a, b ] + c ``` Variant 3: The left breaks, but there's still not enough space for the right hand side. Parenthesize the whole expression: ```python ( [ a, b ] + ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc ) ``` Solving Variant 1 and 2 on their own is straightforward The printer gives us this behavior by nesting right inside of the group of left: ``` group(&format_args![ if_group_breaks(&text("(")), soft_block_indent(&group(&format_args![ left, soft_line_break_or_space(), op, space(), group(&right) ])), if_group_breaks(&text(")")) ]) ``` The fundamental problem is that the outer group, which adds the parentheses, always breaks if the left side breaks. That means, we end up with ```python ( [ a, b ] + c ) ``` which is not what we want (we only want parentheses if the right side doesn't fit). Okay, so nesting groups don't work because of the outer parentheses. Sequencing groups doesn't work because it results in a right-to-left breaking which is the opposite of what we want. Could we use best fitting? Almost! ``` best_fitting![ // All flat format_args![left, space(), op, space(), right], // Break left format_args!(group(&left).should_expand(true), space(), op, space(), right], // Break all format_args![ text("("), block_indent!(&format_args![ left, hard_line_break(), op, space() right ]) ] ] ``` I hope I managed to write this up correctly. The problem is that the printer never reaches the 3rd variant because the second variant always fits: * The `group(&left).should_expand(true)` changes the group so that all `soft_line_breaks` are turned into hard line breaks. This is necessary because we want to test if the content fits if we break after the `[`. * Now, the whole idea of `best_fitting` is that you can pretend that some content fits on the line when it actually does not. The way this works is that the printer only tests if all the content of the variant up to the first line break fits on the line (we insert that line break by using `should_expand(true))`. The printer doesn't care whether the rest `a\n, b\n ] + c` all fits on (multiple?) lines. Why does breaking right work but not breaking the left? The difference is that we can make the decision whether to parenthesis the expression based on the left expression. We can't do this for breaking left because the decision whether to insert parentheses or not would depend on a lookahead: will the right side break. We simply don't know this yet when printing the parentheses (it would work for the right parentheses but not for the left and indent). What we kind of want here is to tell the printer: Look, what comes here may or may not fit on a single line but we don't care. Simply test that what comes after fits on a line. This PR adds a new `BestFittingMode` that has a new `AllLines` option that gives us the desired behavior of testing all content and not just up to the first line break. ## Test Plan I added a new example to `BestFitting::with_mode`	2023-06-20 18:16:01 +02:00
Tom Kuson	6929fcc55f	Complete `flake8-bugbear` documentation (#5178 ) ## Summary Completes the documentation for the `flake8-bugbear` ruleset. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py` --------- Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2023-06-20 12:10:58 -04:00
Charlie Marsh	7bc33a8d5f	Remove identifier lexing in favor of parser ranges (#5195 ) ## Summary Now that all identifiers include ranges (#5194), we can remove a ton of this "custom lexing" code that we have to sketchily extract identifier ranges from source. ## Test Plan `cargo test`	2023-06-20 12:07:29 -04:00
Charlie Marsh	6331598511	Upgrade `RustPython` to access ranged names (#5194 ) ## Summary In https://github.com/astral-sh/RustPython-Parser/pull/8, we modified RustPython to include ranges for any identifiers that aren't `Expr::Name` (which already has an identifier). For example, the `e` in `except ValueError as e` was previously un-ranged. To extract its range, we had to do some lexing of our own. This change should improve performance and let us remove a bunch of code. ## Test Plan `cargo test`	2023-06-20 15:43:38 +00:00
Thomas de Zeeuw	17f1ecd56e	Open cache files in parallel (#5120 ) ## Summary Open cache files in parallel (again), brings the performance back to be roughly equal to the old implementation. ## Test Plan Existing tests should keep working.	2023-06-20 17:43:09 +02:00
Dhruv Manilawala	062b6e5c2b	Handle trailing newline in Jupyter notebook JSON string (#5202 ) ## Summary Handle trailing newline in Jupyter Notebook JSON string similar to how `black` does it. ## Test Plan Add test cases when the JSON string for notebook ends with and without a newline. resolves: #5190	2023-06-20 10:19:11 +00:00
David Szotten	773e79b481	basic formatting for ExprDict (#5167 )	2023-06-20 09:25:08 +00:00
Charlie Marsh	4cc3cdba16	Use some more wildcard imports in rules (#5201 )	2023-06-20 03:21:08 +00:00
Charlie Marsh	a797e05602	Use a consistent argument ordering for `Indexer` (#5200 )	2023-06-20 02:59:51 +00:00
Evan Rittenhouse	62aa77df31	Fix corner case involving terminal backslash after fixing `W293` (#5172 ) ## Summary Fixes #4404. Consider this file: ```python if True: x = 1; \ <space><space><space> ``` The current implementation of W293 removes the 3 spaces on line 2. This fix changes the file to: ```python if True: x = 1; \ ``` A file can't end in a `\`, according to Python's [lexical analysis](https://docs.python.org/3/reference/lexical_analysis.html), so subsequent iterations of the autofixer fail (the AST-based ones specifically, since they depend on a valid syntax tree and get re-parsed). This patch examines the line before the line checked in `W293`. If its first non-whitespace character is a `\`, the patch will extend the diagnostic's fix range to all whitespace up until the previous line's second non-whitespace character; that is, it deletes all spaces and potential `\`s up until the next non-whitespace character on the previous line. ## Test Plan Ran `cargo run -p ruff_cli -- ~/Downloads/aa.py --fix --select W293,D100 --no-cache` against the above file. This resulted in: ``` /Users/evan/Downloads/aa.py:1:1: D100 Missing docstring in public module Found 2 errors (1 fixed, 1 remaining). ``` The file's contents, after the fix: ```python if True: x = 1;<space> ``` The `\` was removed, leaving the terminal space. The space should be handled by `Rule::TrailingWhitespace`, not `BlankLineWithWhitespace`.	2023-06-20 02:57:24 +00:00
Charlie Marsh	64bd955c58	Remove continuations before trailing semicolons (#5199 ) ## Summary Closes #4828.	2023-06-20 02:22:32 +00:00
Charlie Marsh	8e06140d1d	Remove continuations when deleting statements (#5198 ) ## Summary This PR modifies our statement deletion logic to delete any preceding continuation lines. For example, given: ```py x = 1; \ import os ``` We'll now rewrite to: ```py x = 1; ``` In addition, the logic can now handle multiple preceding continuations (which is unlikely, but valid).	2023-06-19 22:04:28 -04:00
Charlie Marsh	015895bcae	Move copyright rule to nursery (#5197 ) ## Summary I want this to be explicitly opted-into.	2023-06-19 21:41:47 -04:00
Charlie Marsh	36e01ad6eb	Upgrade RustPython (#5192 ) ## Summary This PR upgrade RustPython to pull in the changes to `Arguments` (zip defaults with their identifiers) and all the renames to `CmpOp` and friends.	2023-06-19 21:09:53 +00:00
Charlie Marsh	ddfdc3bb01	Add rule documentation URL to JSON output (#5187 ) ## Summary I want to include URLs to the rule documentation in the LSP (the LSP has a native `code_description` field for this, which, if specified, causes the source to be rendered as a link to the docs). This PR exposes the URL to the documentation in the Ruff JSON output.	2023-06-19 21:09:15 +00:00
Dhruv Manilawala	48f4f2d63d	Maintain consistency when deserializing to JSON (#5114 ) ## Summary Maintain consistency while deserializing Jupyter notebook to JSON. The following changes were made: 1. Use string array to store the source value as that's the default (`5781720423/nbformat/v4/nbjson.py (L56-L57)`) 2. Remove unused structs and enums 3. Reorder the keys in alphabetical order as that's the default. (`5781720423/nbformat/v4/nbjson.py (L51)`) ### Side effect Removing the `preserve_order` feature means that the order of keys in JSON output (`--format json`) will be in alphabetical order. This is because the value is represented using `serde_json::Value` which internally is a `BTreeMap`, thus sorting it as per the string key. For posterity if this turns out to be not ideal, then we could define a struct representing the JSON object and the order of struct fields will determine the order in the JSON string. ## Test Plan Add a test case to assert the raw JSON string.	2023-06-19 23:47:56 +05:30
Charlie Marsh	94abf7f088	Rename `Importation` structs to `Import` (#5185 ) ## Summary I find "Importation" a bit awkward, it may not even be grammatically correct here.	2023-06-19 12:09:10 -04:00
Thomas de Zeeuw	e3c12764f8	Only use a single cache file per Python package (#5117 ) ## Summary This changes the caching design from one cache file per source file, to one cache file per package. This greatly reduces the amount of cache files that are opened and written, while maintaining roughly the same (combined) size as bincode is very compact. Below are some very much not scientific performance tests. It uses projects/sources to check: * small.py: single, 31 bytes Python file with 2 errors. * test.py: single, 43k Python file with 8 errors. * fastapi: FastAPI repo, 1134 files checked, 0 errors. Source \| Before # files \| After # files \| Before size \| After size -------\|-------\|-------\|-------\|------- small.py \| 1 \| 1 \| 20 K \| 20 K test.py \| 1 \| 1 \| 60 K \| 60 K fastapi \| 1134 \| 518 \| 4.5 M \| 2.3 M One question that might come up is why fastapi still has 518 cache files and not 1? That is because this is using the existing package resolution, which sees examples, docs, etc. as separate from the "main" source code (in the fastapi directory in the repo). In this future it might be worth consider switching to a one cache file per repo strategy. This new design is not perfect and does have a number of known issues. First, like the old design it doesn't remove the cache for a source file that has been (re)moved until `ruff clean` is called. Second, this currently uses a large mutex around the mutation of the package cache (e.g. inserting result). This could be (or become) a bottleneck. It's future work to test and improve this (if needed). Third, currently the packages and opened and stored in a sequential loop, this could be done parallel. This is also future work. ## Test Plan Run `ruff check` (with caching enabled) twice on any Python source code and it should produce the same results.	2023-06-19 17:46:13 +02:00
konstin	b8d378b0a3	Add a script that tests formatter stability on repositories (#5055 ) ## Summary We want to ensure that once formatted content stays the same when formatted again, which is known as formatter stability or formatter idempotency, and that the formatter prints syntactically valid code. As our test cases cover only a limited amount of code, this allows checking entire repositories. This adds a new subcommand to `ruff_dev` which can be invoked as `cargo run --bin ruff_dev -- check-formatter-stability <repo>`. While initially only intended to check stability, it has also found cases where the formatter printed invalid syntax or panicked. ## Test Plan Running this on cpython is already identifying bugs (https://github.com/astral-sh/ruff/pull/5089)	2023-06-19 14:13:38 +00:00
konstin	0e028142f4	Explain dangling comments in the formatter (#5170 ) This documentation change improves the section on dangling comments in the formatter. --------- Co-authored-by: David Szotten <davidszotten@gmail.com> Co-authored-by: Micha Reiser <micha@reiser.io>	2023-06-19 14:24:45 +02:00
konstin	361d45f2b2	Add `cargo dev repeat` for profiling (#5144 ) ## Summary This adds a new subcommand that can be used as ```shell cargo build --bin ruff_dev --profile=release-debug perf record -g -F 999 target/release-debug/ruff_dev repeat --repeat 30 --exit-zero --no-cache path/to/cpython > /dev/null flamegraph --perfdata perf.data ``` ## Test Plan This is a ruff internal script. I successfully used it to profile cpython with the instructions above	2023-06-19 11:40:09 +02:00
Charlie Marsh	be11cae619	Fix allowed-ellipsis detection (#5174 ) ## Summary We weren't resetting the `allow_ellipsis` flag properly, which ultimately caused us to treat the semicolon as "unnecessary" rather than "creating a multi-statement line". Closes #5154.	2023-06-19 04:19:41 +00:00
Charlie Marsh	2b82caa163	Detect continuations at start-of-file (#5173 ) ## Summary Given: ```python \ import os ``` Deleting `import os` leaves a syntax error: a file can't end in a continuation. We have code to handle this case, but it failed to pick up continuations at the _very start_ of a file. Closes #5156.	2023-06-19 00:09:02 -04:00
Charlie Marsh	a6cf31cc89	Move `dead_scopes` to `deferred.scopes` (#5171 ) ## Summary This is more consistent with the rest of the `deferred` patterns.	2023-06-18 15:57:38 +00:00
Charlie Marsh	524a2045ba	Enable autofix for unconventional imports rule (#5152 ) ## Summary We can now automatically rewrite `import pandas` to `import pandas as pd`, with minimal changes needed.	2023-06-18 15:56:42 +00:00
Charlie Marsh	a0b750f74b	Move unconventional import rule to post-binding phase (#5151 ) ## Summary This PR moves the "unconventional import alias" rule (which enforces, e.g., that `pandas` is imported as `pd`) to the "dead scopes" phase, after the main linter pass. This (1) avoids an allocation since we no longer need to create the qualified name in the linter pass; and (2) will allow us to autofix it, since we'll have access to all references. ## Test Plan `cargo test` -- all changes are to ranges (which are improvements IMO).	2023-06-18 15:23:40 +00:00
Chris Pryer	195b36c429	Format `continue` statement (#5165 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Format `continue` statement. ## Test Plan `continue` is used already in some tests, but if a new test is needed I could add it. --------- Co-authored-by: konstin <konstin@mailbox.org>	2023-06-18 11:25:59 +00:00
konstin	5c416e4d9b	Pre commit without cargo and other pre-PR improvements (#5146 ) This tackles three problems: * pre-commit was slow because it ran cargo commands * Improve the clarity on what you need to run to get your PR pass on CI (and make those fast) * You had to compile and run `cargo dev generate-all` separately, which was slow The first change is to remove all cargo commands except running ruff itself from pre-commit. With `cargo run --bin ruff` already compiled it takes about 7s on my machine. It would make sense to also use the ruff pre-commit action here even if we're then lagging a release behind for checking ruff on ruff. The contributing guide is now clear about what you need to run: ```shell cargo clippy --workspace --all-targets --all-features -- -D warnings # Linting... RUFF_UPDATE_SCHEMA=1 cargo test # Testing and updating ruff.schema.json pre-commit run --all-files # rust and python formatting, markdown and python linting, etc. ``` Example timings from my machine: `cargo clippy --workspace --all-targets --all-features -- -D warnings`: 23s `RUFF_UPDATE_SCHEMA=1 cargo test`: 2min (recompiling), 1min (no code changes, this is mainly doc tests) `pre-commit run --all-files`: 7s The exact numbers don't matter so much as the approximate experience (6s is easier to just wait than 1min, esp if you need to fix and rerun). The biggest remaining block seems to be doc tests, i'm surprised i didn't find any solution to speeding them up (nextest simply doesn't run them at all). Also note that the formatter has it's own tests which are much faster since they avoid linking ruff (`cargo test ruff_python_formatter`). The third change is to enable `cargo test` to update the schema. Similar to `INSTA_UPDATE=always`, i've added `RUFF_UPDATE_SCHEMA=1` (name open to bikeshedding), so `RUFF_UPDATE_SCHEMA=1 cargo test` updates the schema, while `cargo test` still fails as expected if the repo isn't up-to-date. --------- Co-authored-by: Dhruv Manilawala <dhruvmanila@gmail.com>	2023-06-18 11:00:42 +00:00
konstin	763d38cafb	Refactor top llvm-lines entry (#5147 ) ## Summary This refactors the top entry in terms of llvm lines, `RuleCodePrefix::iter()`. It's only used for generating the schema and the clap completion so no effect on performance. I've confirmed with ``` CARGO_TARGET_DIR=target-llvm-lines RUSTFLAGS="-Csymbol-mangling-version=v0" cargo llvm-lines -p ruff --lib \| head -n 20 ``` that this indeed remove the method from the list of heaviest symbols in terms of llvm-lines Before: ``` Lines Copies Function name ----- ------ ------------- 1768469 40538 (TOTAL) 10391 (0.6%, 0.6%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::codes::RuleCodePrefix>::iter 8250 (0.5%, 1.1%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::codes::Rule>::noqa_code 7427 (0.4%, 1.5%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::checkers::ast::Checker as ruff_python_ast[c4c9eadfa5741dd4]::visitor::Visitor>::visit_stmt 6536 (0.4%, 1.8%) 1 (0.0%, 0.0%) <<ruff[fa0f2e8ef07114da]::settings::options::Options as serde[1a28808d63625aed]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[1a28808d63625aed]:🇩🇪:Visitor>::visit_map::<toml_edit[de4ca26332d39787]:🇩🇪:spanned::SpannedDeserializer<toml_edit[de4ca26332d39787]:🇩🇪:value::ValueDeserializer>> 6536 (0.4%, 2.2%) 1 (0.0%, 0.0%) <<ruff[fa0f2e8ef07114da]::settings::options::Options as serde[1a28808d63625aed]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[1a28808d63625aed]:🇩🇪:Visitor>::visit_map::<toml_edit[de4ca26332d39787]:🇩🇪:table::TableMapAccess> 6533 (0.4%, 2.6%) 1 (0.0%, 0.0%) <<ruff[fa0f2e8ef07114da]::settings::options::Options as serde[1a28808d63625aed]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[1a28808d63625aed]:🇩🇪:Visitor>::visit_map::<toml_edit[de4ca26332d39787]:🇩🇪:datetime::DatetimeDeserializer> 5727 (0.3%, 2.9%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::checkers::ast::Checker as ruff_python_ast[c4c9eadfa5741dd4]::visitor::Visitor>::visit_expr 4453 (0.3%, 3.2%) 1 (0.0%, 0.0%) ruff[fa0f2e8ef07114da]::flake8_to_ruff::converter::convert 3790 (0.2%, 3.4%) 1 (0.0%, 0.0%) <&ruff[fa0f2e8ef07114da]::registry::Linter as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter 3416 (0.2%, 3.6%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::registry::Linter>::code_for_rule 3187 (0.2%, 3.7%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::codes::Rule as core[da82827a87f140f9]::fmt::Debug>::fmt 3185 (0.2%, 3.9%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<&ruff[fa0f2e8ef07114da]::codes::Rule>>::from 3185 (0.2%, 4.1%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<ruff[fa0f2e8ef07114da]::codes::Rule>>::from 3185 (0.2%, 4.3%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::codes::Rule as core[da82827a87f140f9]::convert::AsRef<str>>::as_ref 3183 (0.2%, 4.5%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::codes::RuleIter>::get 2718 (0.2%, 4.6%) 1 (0.0%, 0.0%) <<ruff[fa0f2e8ef07114da]::settings::options::Options as serde[1a28808d63625aed]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[1a28808d63625aed]:🇩🇪:Visitor>::visit_seq::<toml_edit[de4ca26332d39787]:🇩🇪:array::ArraySeqAccess> 2706 (0.2%, 4.8%) 1 (0.0%, 0.0%) <&ruff[fa0f2e8ef07114da]::codes::Pylint as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter ``` After: ``` Lines Copies Function name ----- ------ ------------- 1763380 40806 (TOTAL) 8250 (0.5%, 0.5%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::codes::Rule>::noqa_code 7427 (0.4%, 0.9%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::checkers::ast::Checker as ruff_python_ast[c4c9eadfa5741dd4]::visitor::Visitor>::visit_stmt 6536 (0.4%, 1.3%) 1 (0.0%, 0.0%) <<ruff[fa0f2e8ef07114da]::settings::options::Options as serde[1a28808d63625aed]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[1a28808d63625aed]:🇩🇪:Visitor>::visit_map::<toml_edit[de4ca26332d39787]:🇩🇪:spanned::SpannedDeserializer<toml_edit[de4ca26332d39787]:🇩🇪:value::ValueDeserializer>> 6536 (0.4%, 1.6%) 1 (0.0%, 0.0%) <<ruff[fa0f2e8ef07114da]::settings::options::Options as serde[1a28808d63625aed]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[1a28808d63625aed]:🇩🇪:Visitor>::visit_map::<toml_edit[de4ca26332d39787]:🇩🇪:table::TableMapAccess> 6533 (0.4%, 2.0%) 1 (0.0%, 0.0%) <<ruff[fa0f2e8ef07114da]::settings::options::Options as serde[1a28808d63625aed]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[1a28808d63625aed]:🇩🇪:Visitor>::visit_map::<toml_edit[de4ca26332d39787]:🇩🇪:datetime::DatetimeDeserializer> 5727 (0.3%, 2.3%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::checkers::ast::Checker as ruff_python_ast[c4c9eadfa5741dd4]::visitor::Visitor>::visit_expr 4453 (0.3%, 2.6%) 1 (0.0%, 0.0%) ruff[fa0f2e8ef07114da]::flake8_to_ruff::converter::convert 3790 (0.2%, 2.8%) 1 (0.0%, 0.0%) <&ruff[fa0f2e8ef07114da]::registry::Linter as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter 3416 (0.2%, 3.0%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::registry::Linter>::code_for_rule 3187 (0.2%, 3.2%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::codes::Rule as core[da82827a87f140f9]::fmt::Debug>::fmt 3185 (0.2%, 3.3%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<&ruff[fa0f2e8ef07114da]::codes::Rule>>::from 3185 (0.2%, 3.5%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<ruff[fa0f2e8ef07114da]::codes::Rule>>::from 3185 (0.2%, 3.7%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::codes::Rule as core[da82827a87f140f9]::convert::AsRef<str>>::as_ref 3183 (0.2%, 3.9%) 1 (0.0%, 0.0%) <ruff[fa0f2e8ef07114da]::codes::RuleIter>::get 2718 (0.2%, 4.0%) 1 (0.0%, 0.0%) <<ruff[fa0f2e8ef07114da]::settings::options::Options as serde[1a28808d63625aed]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[1a28808d63625aed]:🇩🇪:Visitor>::visit_seq::<toml_edit[de4ca26332d39787]:🇩🇪:array::ArraySeqAccess> 2706 (0.2%, 4.2%) 1 (0.0%, 0.0%) <&ruff[fa0f2e8ef07114da]::codes::Pylint as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter 2573 (0.1%, 4.3%) 1 (0.0%, 0.0%) <<ruff[fa0f2e8ef07114da]::rules::isort::settings::Options as serde[1a28808d63625aed]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[1a28808d63625aed]:🇩🇪:Visitor>::visit_map::<toml_edit[de4ca26332d39787]:🇩🇪:spanned::SpannedDeserializer<toml_edit[de4ca26332d39787]:🇩🇪:value::ValueDeserializer>> ``` I didn't measure the effect on binary size this time. ## Testing `cargo test` which uses this to generate the schema didn't change	2023-06-18 12:39:06 +02:00
Evan Rittenhouse	653a0ebf2d	Add Applicability to pyupgrade (#5162 ) ## Summary Fixes some of #4184.	2023-06-17 19:33:11 +00:00
Evan Rittenhouse	95448ba669	Add Applicability to isort (#5161 ) ## Summary Fixes some of #4184.	2023-06-17 19:08:11 +00:00
Charlie Marsh	f18e10183f	Add some minor tweaks to latest docs (#5164 )	2023-06-17 17:04:50 +00:00
Tom Kuson	98920909c6	Complete documentation for `flake8-blind-except` and `flake8-raise` rules (#5143 ) ## Summary Completes the documentation for the `flake8-blind-except` and `flake8-raise` rules. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-06-17 12:56:27 -04:00
Evan Rittenhouse	e1e1d2d341	Add Applicability to flynt (#5160 ) ## Summary Fixes some of #4184.	2023-06-17 12:05:43 -04:00
David Szotten	4b9b6829dc	format StmtBreak (#5158 ) ## Summary format `StmtBreak` trying to learn how to help out with the formatter. starting simple ## Test Plan new snapshot test	2023-06-17 10:31:29 +02:00
Charlie Marsh	d0ad1ed0af	Replace static `CallPath` vectors with `matches!` macros (#5148 ) ## Summary After #5140, I audited the codebase for similar patterns (defining a list of `CallPath` entities in a static vector, then looping over them to pattern-match). This PR migrates all other such cases to use `match` and `matches!` where possible. There are a few benefits to this: 1. It more clearly denotes the intended semantics (branches are exclusive). 2. The compiler can help deduplicate the patterns and detect unreachable branches. 3. Performance: in the benchmark below, the all-rules performance is increased by nearly 10%... ## Benchmarks I decided to benchmark against a large file in the Airflow repository with a lot of type annotations ([`views.py`](https://raw.githubusercontent.com/apache/airflow/f03f73100e8a7d6019249889de567cb00e71e457/airflow/www/views.py)): ``` linter/default-rules/airflow/views.py time: [10.871 ms 10.882 ms 10.894 ms] thrpt: [19.739 MiB/s 19.761 MiB/s 19.781 MiB/s] change: time: [-2.7182% -2.5687% -2.4204%] (p = 0.00 < 0.05) thrpt: [+2.4805% +2.6364% +2.7942%] Performance has improved. linter/all-rules/airflow/views.py time: [24.021 ms 24.038 ms 24.062 ms] thrpt: [8.9373 MiB/s 8.9461 MiB/s 8.9527 MiB/s] change: time: [-8.9537% -8.8516% -8.7527%] (p = 0.00 < 0.05) thrpt: [+9.5923% +9.7112% +9.8342%] Performance has improved. Found 12 outliers among 100 measurements (12.00%) 5 (5.00%) high mild 7 (7.00%) high severe ``` The impact is dramatic -- nearly a 10% improvement for `all-rules`.	2023-06-16 17:34:42 +00:00
Charlie Marsh	b3240dbfa2	Avoid propagating `BindingKind::Global` and `BindingKind::Nonlocal` (#5136 ) ## Summary This PR fixes a small quirk in the semantic model. Typically, when we see an import, like `import foo`, we create a `BindingKind::Importation` for it. However, if `foo` has been declared as a `global`, then we propagate the kind forward. So given: ```python global foo import foo ``` We'd create two bindings for `foo`, both with type `global`. This was originally borrowed from Pyflakes, and it exists to help avoid false-positives like: ```python def f(): global foo # Don't mark `foo` as "assigned but unused"! It's a global! foo = 1 ``` This PR removes that behavior, and instead tracks "Does this binding refer to a global?" as a flag. This is much cleaner, since it means we don't "lose" the identity of various bindings. As a very strange example of why this matters, consider: ```python def foo(): global Member from module import Member x: Member = 1 ``` `Member` is only used in a typing context, so we should flag it and say "move it to a `TYPE_CHECKING` block". However, when we go to analyze `from module import Member`, it has `BindingKind::Global`. So we don't even know that it's an import!	2023-06-16 11:06:59 -04:00
Charlie Marsh	fd1dfc3bfa	Add support for global and nonlocal symbol renames (#5134 ) ## Summary In #5074, we introduced an abstraction to support local symbol renames ("local" here refers to "within a module"). However, that abstraction didn't support `global` and `nonlocal` symbols. This PR extends it to those cases. Broadly, there are considerations. First, if we're renaming a symbol in a scope in which it is declared `global` or `nonlocal`. For example, given: ```python x = 1 def foo(): global x ``` Then when renaming `x` in `foo`, we need to detect that it's `global` and instead perform the rename starting from the module scope. Second, when renaming a symbol, we need to determine the scopes in which it is declared `global` or `nonlocal`. This is effectively the inverse of the above: when renaming `x` in the module scope, we need to detect that we should _also_ rename `x` in `foo`. To support these cases, the renaming algorithm was adjusted as follows: - When we start a rename in a scope, determine whether the symbol is declared `global` or `nonlocal` by looking for a `global` or `nonlocal` binding. If it is, start the rename in the defining scope. (This requires storing the defining scope on the `nonlocal` binding, which is new.) - We then perform the rename in the defining scope. - We then check whether the symbol was declared as `global` or `nonlocal` in any scopes, and perform the rename in those scopes too. (Thankfully, this doesn't need to be done recursively.) Closes #5092. ## Test Plan Added some additional snapshot tests.	2023-06-16 14:35:10 +00:00
Charlie Marsh	b9754bd5c5	Add autofix for `Set`-to-`AbstractSet` rewrite using reference tracking (#5074 ) ## Summary This PR enables autofix behavior for the `flake8-pyi` rule that asks you to alias `Set` to `AbstractSet` when importing `collections.abc.Set`. It's not the most important rule, but it's a good isolated test-case for local symbol renaming. The renaming algorithm is outlined in-detail in the `renamer.rs` module. But to demonstrate the behavior, here's the diff when running this fix over a complex file that exercises a few edge cases: ```diff --- a/foo.pyi +++ b/foo.pyi @@ -1,16 +1,16 @@ if True: - from collections.abc import Set + from collections.abc import Set as AbstractSet else: - Set = 1 + AbstractSet = 1 -x: Set = set() +x: AbstractSet = set() -x: Set +x: AbstractSet -del Set +del AbstractSet def f(): - print(Set) + print(AbstractSet) def Set(): pass ``` Making this work required resolving a bunch of edge cases in the semantic model that were causing us to "lose track" of references. For example, the above wasn't possible with our previous approach to handling deletions (#5071). Similarly, the `x: Set` "delayed annotation" tracking was enabled via #5070. And many of these edits would've failed if we hadn't changed `BindingKind` to always match the identifier range (#5090). So it's really the culmination of a bunch of changes over the course of the week. The main outstanding TODO is that this doesn't support `global` or `nonlocal` usages. I'm going to take a look at that tonight, but I'm comfortable merging this as-is. Closes #1106. Closes #5091.	2023-06-16 14:12:33 +00:00
Charlie Marsh	307f7a735c	Avoid allocations in lowercase comparisons (#5137 ) ## Summary I noticed that we have a few hot comparisons that involve called `s.to_lowercase()`. We can avoid an allocation by comparing characters directly.	2023-06-16 08:57:43 -04:00
Charlie Marsh	3af9dfeb0a	Rewrite `suspicious_function_call` as a match statement (#5140 ) ## Summary @konstin mentioned that in profiling, this function accounted for a non-trivial amount of time (0.33% of total execution, the most of any rule). This PR attempts to rewrite it as a match statement for better performance over a looping comparison. ## Test Plan `cargo test`	2023-06-16 08:57:20 -04:00
Charlie Marsh	5526699535	Use const-singleton helpers in more rules (#5142 )	2023-06-16 04:28:35 +00:00
Charlie Marsh	fab2a4adf7	Use `matches!` for insecure hash rule (#5141 )	2023-06-16 04:18:32 +00:00
Charlie Marsh	13813dc1b1	Skip `DJ008` enforcement in stub files (#5139 ) Closes #5138.	2023-06-16 03:49:40 +00:00
Charlie Marsh	70c01257ca	Minor formatting changes to `Checker` (#5135 )	2023-06-15 22:42:21 -04:00
Evan Rittenhouse	26d19655db	Add Applicability to flake8_tidy_imports (#5131 ) ## Summary Fixes some of https://github.com/astral-sh/ruff/issues/4184	2023-06-15 18:09:00 -04:00
Charlie Marsh	1f856aa576	Don't treat straight imports of __future__ as `__future__` imports (#5128 ) ## Summary If you `import __future__`, it's not subject to the same rules as `from __future__ import feature` -- i.e., this is fine: ```python x = 1 import __future__ ``` It doesn't really make sense to treat these as `__future__` imports (though I can't imagine anyone ever does this anyway).	2023-06-15 20:53:02 +00:00
Evan Rittenhouse	1e383483f7	Add Applicability to flake8_quotes fixes (#5130 ) ## Summary Fixes some of #4184	2023-06-15 16:50:54 -04:00
Evan Rittenhouse	89b328c6be	Add Applicability to flake8_logging_format fixes (#5129 ) ## Summary Fixes some of #4184	2023-06-15 16:50:19 -04:00
Evan Rittenhouse	6143065fc2	Add Applicability to flake8_comma fixes (#5127 ) ## Summary Fixes some of #4184	2023-06-15 16:49:54 -04:00
Charlie Marsh	107a295af4	Allow `async with` in `redefined-loop-name` (#5125 ) ## Summary Closes #5124.	2023-06-15 15:00:19 -04:00
Charlie Marsh	5ea3e42513	Always use identifier ranges to store bindings (#5110 ) ## Summary At present, when we store a binding, we include a `TextRange` alongside it. The `TextRange` _sometimes_ matches the exact range of the identifier to which the `Binding` is linked, but... not always. For example, given: ```python x = 1 ``` The binding we create _will_ use the range of `x`, because the left-hand side is an `Expr::Name`, which has a valid range on it. However, given: ```python try: pass except ValueError as e: pass ``` When we create a binding for `e`, we don't have a `TextRange`... The AST doesn't give us one. So we end up extracting it via lexing. This PR extends that pattern to the rest of the binding kinds, to ensure that whenever we create a binding, we always use the range of the bound name. This leads to better diagnostics in cases like pattern matching, whereby the diagnostic for "unused variable `x`" here used to include `x`, instead of just `x`: ```python def f(provided: int) -> int: match provided: case [_, x]: pass ``` This is _also_ required for symbol renames, since we track writes as bindings -- so we need to know the ranges of the bound symbols. By storing these bindings precisely, we can also remove the `binding.trimmed_range` abstraction -- since bindings already use the "trimmed range". To implement this behavior, I took some of our existing utilities (like the code we had for `except ValueError as e` above), migrated them from a full lexer to a zero-allocation lexer that _only_ identifies "identifiers", and moved the behavior into a trait, so we can now do `stmt.identifier(locator)` to get the range for the identifier. Honestly, we might end up discarding much of this if we decide to put ranges on all identifiers (https://github.com/astral-sh/RustPython-Parser/pull/8). But even if we do, this will _still_ be a good change, because the lexer introduced here is useful beyond names (e.g., we use it find the `except` keyword in an exception handler, to find the `else` after a `for` loop, and so on). So, I'm fine committing this even if we end up changing our minds about the right approach. Closes #5090. ## Benchmarks No significant change, with one statistically significant improvement (-2.1654% on `linter/all-rules/large/dataset.py`): ``` linter/default-rules/numpy/globals.py time: [73.922 µs 73.955 µs 73.986 µs] thrpt: [39.882 MiB/s 39.898 MiB/s 39.916 MiB/s] change: time: [-0.5579% -0.4732% -0.3980%] (p = 0.00 < 0.05) thrpt: [+0.3996% +0.4755% +0.5611%] Change within noise threshold. Found 6 outliers among 100 measurements (6.00%) 4 (4.00%) low severe 1 (1.00%) low mild 1 (1.00%) high mild linter/default-rules/pydantic/types.py time: [1.4909 ms 1.4917 ms 1.4926 ms] thrpt: [17.087 MiB/s 17.096 MiB/s 17.106 MiB/s] change: time: [+0.2140% +0.2741% +0.3392%] (p = 0.00 < 0.05) thrpt: [-0.3380% -0.2734% -0.2136%] Change within noise threshold. Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) high mild 1 (1.00%) high severe linter/default-rules/numpy/ctypeslib.py time: [688.97 µs 691.34 µs 694.15 µs] thrpt: [23.988 MiB/s 24.085 MiB/s 24.168 MiB/s] change: time: [-1.3282% -0.7298% -0.1466%] (p = 0.02 < 0.05) thrpt: [+0.1468% +0.7351% +1.3461%] Change within noise threshold. Found 15 outliers among 100 measurements (15.00%) 1 (1.00%) low mild 2 (2.00%) high mild 12 (12.00%) high severe linter/default-rules/large/dataset.py time: [3.3872 ms 3.4032 ms 3.4191 ms] thrpt: [11.899 MiB/s 11.954 MiB/s 12.011 MiB/s] change: time: [-0.6427% -0.2635% +0.0906%] (p = 0.17 > 0.05) thrpt: [-0.0905% +0.2642% +0.6469%] No change in performance detected. Found 20 outliers among 100 measurements (20.00%) 1 (1.00%) low severe 2 (2.00%) low mild 4 (4.00%) high mild 13 (13.00%) high severe linter/all-rules/numpy/globals.py time: [148.99 µs 149.21 µs 149.42 µs] thrpt: [19.748 MiB/s 19.776 MiB/s 19.805 MiB/s] change: time: [-0.7340% -0.5068% -0.2778%] (p = 0.00 < 0.05) thrpt: [+0.2785% +0.5094% +0.7395%] Change within noise threshold. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high severe linter/all-rules/pydantic/types.py time: [3.0362 ms 3.0396 ms 3.0441 ms] thrpt: [8.3779 MiB/s 8.3903 MiB/s 8.3997 MiB/s] change: time: [-0.0957% +0.0618% +0.2125%] (p = 0.45 > 0.05) thrpt: [-0.2121% -0.0618% +0.0958%] No change in performance detected. Found 11 outliers among 100 measurements (11.00%) 1 (1.00%) low severe 3 (3.00%) low mild 5 (5.00%) high mild 2 (2.00%) high severe linter/all-rules/numpy/ctypeslib.py time: [1.6879 ms 1.6894 ms 1.6909 ms] thrpt: [9.8478 MiB/s 9.8562 MiB/s 9.8652 MiB/s] change: time: [-0.2279% -0.0888% +0.0436%] (p = 0.18 > 0.05) thrpt: [-0.0435% +0.0889% +0.2284%] No change in performance detected. Found 5 outliers among 100 measurements (5.00%) 4 (4.00%) low mild 1 (1.00%) high severe linter/all-rules/large/dataset.py time: [7.1520 ms 7.1586 ms 7.1654 ms] thrpt: [5.6777 MiB/s 5.6831 MiB/s 5.6883 MiB/s] change: time: [-2.5626% -2.1654% -1.7780%] (p = 0.00 < 0.05) thrpt: [+1.8102% +2.2133% +2.6300%] Performance has improved. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high mild ```	2023-06-15 18:43:19 +00:00

... 4 5 6 7 8 ...

1838 Commits