Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Dhruv Manilawala	3b4c8fffe5	Lex Jupyter line magic with `Mode::Jupyter` (#23 ) Lex Jupyter line magic with `Mode::Jupyter` This PR adds a new token `MagicCommand`[^1] which the lexer will recognize when in `Mode::Jupyter`. The rules for the lexer is as follows: 1. Given that we are at the start of line, skip the indentation and look for [characters that represent the start of a magic command](`635815e8f1/IPython/core/inputtransformer2.py (L335-L346)`), determine the magic kind and capture all the characters following it as the command string. 2. If the command extends multiple lines, the lexer will skip the line continuation character (`\`) but only if it's followed by a newline (`\n` or `\r`). The reason to skip this only in case of newline is because they can occur in the command string which we should not skip: ```rust // Skip this backslash // v // !pwd \ // && ls -a \| sed 's/^/\\ /' // ^^ // Don't skip these backslashes ``` 3. The parser, when in `Mode::Jupyter`, will filter these tokens before the parsing begins. There is a small caveat when the magic command is indented. In the following example, when the parser filters out magic command, it'll throw an indentation error: ```python for i in range(5): !ls # What the parser will see for i in range(5): ``` [^1]: I would prefer to have some other name as this not only represent a line magic (`%`) but also shell command (`!`), help command (`?`) and others. In original implementation, it's named as ["IPython Syntax"](`635815e8f1/IPython/core/inputtransformer2.py (L332)`)	2023-07-18 09:24:24 +05:30
Harutaka Kawamura	a4e5e3205f	Ignore directories when collecting files to lint (#5775 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary <!-- What's the purpose of the change? What does it do, and why? --> Fixes #5739 ## Test Plan <!-- How was it tested? --> Manually tested: ```sh $ tree dir dir ├── dir.py │ └── file.py └── file.py 1 directory, 2 files $ cargo run -p ruff_cli -- check dir --no-cache Finished dev [unoptimized + debuginfo] target(s) in 0.08s Running `target/debug/ruff check dir --no-cache` dir/dir.py/file.py:1:7: F821 Undefined name `a` dir/file.py:1:7: F821 Undefined name `a` Found 2 errors. ``` Is a unit test needed?	2023-07-17 20:25:43 -05:00
Simon Brugman	17ee80363a	refactor: use find_keyword ast helper more (#5847 ) Use the ast helper function `find_keyword` where applicable (found these while working on another feature)	2023-07-17 19:37:23 -04:00
David Szotten	52aa2fc875	upgrade rustpython to remove tuple-constants (#5840 ) c.f. https://github.com/astral-sh/RustPython-Parser/pull/28 Tests: No snapshots changed --------- Co-authored-by: Zanie <contact@zanie.dev>	2023-07-17 22:50:31 +00:00
Zanie	126652b684	Fix decorator ranges Incorrectly merged LALRPOP file	2023-07-17 14:49:14 -05:00
Zanie	57e8712d76	Bump expected size of `Stmt` to 168 bytes	2023-07-17 14:49:14 -05:00
Zanie	78c6ede1c9	Format	2023-07-17 14:49:14 -05:00
Zanie Blue	a843a00f6b	Add parsing of type alias statements i.e. the `type` keyword (#97 ) Extends #95 Closes #82 Adds parsing of new `type` soft keyword for defining type aliases. Supports type alias statements as defined in PEP 695 e.g. ```python type IntOrStr = int \| str type ListOrSet[T] = list[T] \| set[T] type AnimalOrVegetable = Animal \| "Vegetable" type RecursiveList[T] = T \| list[RecursiveList[T]] ``` All type parameter kinds are supported as in #95. Builds on soft keyword abstractions introduced in https://github.com/RustPython/RustPython/pull/4519	2023-07-17 14:49:14 -05:00
Zanie Blue	f846f1ea42	Parse type parameters in function definitions (#96 ) * Parse type parameters in function definitions * Add test for combined items	2023-07-17 14:49:14 -05:00
Zanie	1d4b7a395f	Consolidate tests and add coverage for trailing comma	2023-07-17 14:49:14 -05:00
Zanie	ce3ce0734b	Add bound to test case `test_parse_class_with_all_possible_generic_types`	2023-07-17 14:49:14 -05:00
Zanie	b0e119f049	Add test for tuple bounds	2023-07-17 14:49:14 -05:00
Zanie	1f5e707829	Remove test for empty generic `class Foo[]: ...` Not valid syntax	2023-07-17 14:49:14 -05:00
Zanie	ed7acfe477	Parse type parameters in class definitions	2023-07-17 14:49:14 -05:00
Zanie	c31b58eb39	Move `type_param` stubs into LALRPOP definition	2023-07-17 14:49:14 -05:00
Zanie	c0a3a20c63	Bump size assertion for `Stmt` from 136 to 160 bytes	2023-07-17 14:49:14 -05:00
Zanie	05ae26b935	Regenerate code with latest ASDL	2023-07-17 14:49:14 -05:00
David Szotten	b996b21ffc	tuple constants are for optimisations, not source (#28 ) my reading of https://docs.python.org/3/library/ast.html#ast.unparse and https://discuss.python.org/t/ast-constant-value-tuple-s-and-frozenset-s/22578 is that tuple constants cannot come from parsing python source, they are only for optimised bytecode see also https://github.com/astral-sh/ruff/pull/5812	2023-07-17 15:46:37 -04:00
Charlie Marsh	e574a6a769	Add some "Phase" annotations to other visit methods (#5839 ) ## Summary Follow-up from #5820.	2023-07-17 14:46:39 -04:00
Charlie Marsh	b9346a4fd6	Draw boundaries between various `Checker` visitation phases (#5820 ) ## Summary This PR does some non-behavior-changing refactoring of the AST checker. Specifically, it breaks the `Stmt`, `Expr`, and `ExceptHandler` visitors into four distinct, consistent phases: 1. Phase 1: Analysis: Run any lint rules on the node. 2. Phase 2: Binding: Bind any symbols declared by the node. 3. Phase 3: Recursion: Visit all child nodes. 4. Phase 4: Clean-up: Pop scopes, etc. There are some fuzzy boundaries in the last three phases, but the most important divide is between the Phase 1 and all the others -- the goal here is (as much as possible) to disentangle all of the vanilla lint-rule calls from any other semantic analysis or model building. Part of the motivation here is that I'm considering re-ordering some of these phases, and it was just impossible to reason about that change as long as we had miscellaneous binding-creation and scope-modification code intermingled with lint rules. However, this could also enable us to (e.g.) move the entire analysis phase elsewhere, and even with a more limited API that has read-only access to `Checker` (but can push to a diagnostics vector).	2023-07-17 13:02:21 -04:00
Charlie Marsh	8001a2f121	Expand convention documentation (#5819 )	2023-07-17 14:12:46 +00:00
konsti	7dd30f0270	Read black options in format_dev script (#5827 ) ## Summary Comparing repos with black requires that we use the settings as black, notably line length and magic trailing comma behaviour. Excludes and preserving quotes (vs. a preference for either quote style) is not yet implemented because they weren't needed for the test projects. In the other two commits i fixed the output when the progress bar is hidden (this way is recommonded in the indicatif docs), added a `scratch.pyi` file to gitignore because black formats stub files differently and also updated the ecosystem readme with the projects json without forks. ## Test Plan I added a `line-length` vs `line_length` test. Otherwise only my personal usage atm, a PR to integrate the script into the CI to check some projects will follow.	2023-07-17 13:29:43 +00:00
Micha Reiser	21063544f7	Fix formatter `generate.py` (#5829 )	2023-07-17 10:41:27 +00:00
Luc Khai Hai	fb336898a5	Format `AsyncFor` (#5808 )	2023-07-17 10:38:59 +02:00
Tom Kuson	f5f8eb31ed	Add documentation to the `flake8-gettext` (`INT`) rules (#5813 ) ## Summary Completes documentation for the `flake8-gettext` (`INT`) ruleset. Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-07-17 04:09:33 +00:00
Charlie Marsh	be6c744856	Include function name in `undocumented-param` message (#5818 ) Closes #5814.	2023-07-16 22:51:34 -04:00
Charlie Marsh	94998aedef	Reduce unnecessary allocations for keyword detection (#5817 )	2023-07-17 02:22:30 +00:00
Tom Kuson	1c0376a72d	Add documentation to the `S5XX` rules (#5805 ) ## Summary Add documentation to the `S5XX` rules (the `flake8-bandit` ['cryptography'](https://bandit.readthedocs.io/en/latest/plugins/index.html#plugin-id-groupings) rule group). Related to #2646. ## Test Plan `python scripts/check_docs_formatted.py`	2023-07-17 02:12:57 +00:00
Simon Brugman	de2a13fcd7	[`pandas-vet`] series constant series (#5802 ) ## Summary Implementation for https://github.com/astral-sh/ruff/issues/5588 Q1: are there any additional semantic helpers that could be used to guard this rule? Which existing rules should be similar in that respect? Can we at least check if `pandas` is imported (any pointers welcome)? Currently, the rule flags: ```python data = {"a": "b"} data.nunique() == 1 ``` Q2: Any pointers on naming of the rule and selection of the code? It was proposed, but not replied to/implemented in the upstream. `pandas` did accept a PR to update their cookbook to reflect this rule though. ## Test Plan TODO: - [X] Checking for ecosystem CI results - [x] Test on selected [real-world cases](https://github.com/search?q=%22nunique%28%29+%3D%3D+1%22+language%3APython+&type=code) - [x] https://github.com/sdv-dev/SDMetrics - [x] https://github.com/google-research/robustness_metrics - [x] https://github.com/soft-matter/trackpy - [x] https://github.com/microsoft/FLAML/ - [ ] Add guarded test cases	2023-07-17 01:55:34 +00:00
Harutaka Kawamura	cfec636046	Do not fix `NamedTuple` calls containing both a list of fields and keywords (#5799 ) ## Summary Fixes #5794 ## Test Plan Existing tests	2023-07-17 01:31:53 +00:00
Tom Kuson	ae431df146	Change `pandas-use-of-dot-read-table` rule to emit only when `read_table` is used on CSV data (#5807 ) ## Summary Closes #5628 by only emitting if `sep=","`. Includes documentation (completes the `pandas-vet` ruleset). Related to #2646. ## Test Plan `cargo test`	2023-07-17 01:25:13 +00:00
Charlie Marsh	2cd117ba81	Remove `TryIdentifier` trait (#5816 ) ## Summary Last remaining usage here is for patterns, but we now have ranges on identifiers so it's unnecessary.	2023-07-16 21:24:16 -04:00
Simon Brugman	a956226d95	perf: only compute start offset for overlong lines (#5811 ) Moves the computation of the `start_offset` for overlong lines to just before the result is returned. There is a slight overhead for overlong lines (double the work for the first `limit` characters). In practice this results in a speedup on the CPython codebase. Most lines are not overlong, or are not enforced because the line ends with a URL, or does not contain whitespace. Nonetheless, the 0.3% of overlong lines are a lot compared to other violations. ### Before ![selected before](https://github.com/astral-sh/ruff/assets/9756388/d32047df-7fd2-4ae8-8333-1a3679ce000f) _Selected W505 and E501_ ![all before](https://github.com/astral-sh/ruff/assets/9756388/98495118-c474-46ff-873c-fb58a78cfe15) _All rules_ ### After ![selected after](https://github.com/astral-sh/ruff/assets/9756388/e4bd7f10-ff7e-4d52-8267-27cace8c5471) _Selected W505 and E501_ ![all after](https://github.com/astral-sh/ruff/assets/9756388/573bdbe2-c64f-4f22-9659-c68726ff52c0) _All rules_ CPython line statistics: - Number of Python lines: 867.696 - Number of overlong lines: 2.963 (0.3%) <details> Benchmark selected: ```shell cargo build --release && hyperfine --warmup 10 --min-runs 50 \ "./target/release/ruff ./crates/ruff/resources/test/cpython/ --no-cache -e --select W505,E501" ``` Benchmark all: ```shell cargo build --release && hyperfine --warmup 10 --min-runs 50 \ "./target/release/ruff ./crates/ruff/resources/test/cpython/ --no-cache -e --select ALL" ``` Overlong lines in CPython ```shell cargo run -p ruff_cli -- check crates/ruff/resources/test/cpython/Lib --no-cache --select=E501,W505 --statistics ``` Total Python lines: ```shell find crates/ruff/resources/test/cpython/ -name '*.py' \| xargs wc -l ``` </details> (Performance tested on Mac M1)	2023-07-16 21:05:44 -04:00
Chris Pryer	1dd52ad139	Update generate.py comment (#5809 ) ## Summary The generated comment is different from the generate files current comment. ## Test Plan None	2023-07-16 11:51:30 -04:00
Charlie Marsh	d692ed0896	Use a match statement for builtin detection (#5798 ) ## Summary We've seen speed-ups in the past by converting from slice iteration to match statements; this just does the same for built-in checks.	2023-07-16 04:57:57 +00:00
Charlie Marsh	01b05fe247	Remove `Identifier` usages for isolating exception names (#5797 ) ## Summary The motivating change here is to remove `let range = except_handler.try_identifier().unwrap();` and instead just do `name.range()`, since exception names now have ranges attached to them by the parse. This also required some refactors (which are improvements) to the built-in attribute shadowing rules, since at least one invocation relied on passing in the exception handler and calling `.try_identifier()`. Now that we have easy access to identifiers, we can remove the whole `AnyShadowing` abstraction.	2023-07-16 04:49:48 +00:00
Charlie Marsh	59dfd0e793	Move except-handler flag into `visit_except_handler` (#5796 ) ## Summary This is more similar to how these flags work in other contexts (e.g., `visit_annotation`), and also ensures that we unset it prior to visit the `orelse` and `finalbody` (a subtle bug).	2023-07-16 00:35:02 -04:00
Charlie Marsh	c7ff743d30	Use `semantic().global()` to power `global-statement` rule (#5795 ) ## Summary The intent of this rule is to always flag the `global` declaration, not the usage. The current implementation does the wrong thing if a global is assigned multiple times. Using `semantic().global()` is also more efficient.	2023-07-16 00:34:42 -04:00
konsti	b01a4d8446	Update ruff crate descriptions (#5710 ) ## Summary I updated all ruff crate descriptions in the contributing guide ## Test Plan n/a	2023-07-16 02:41:47 +00:00
Justin Prieto	f012ed2d77	Add autofix for B004 (#5788 ) ## Summary Adds autofix for `hasattr` case of B004. I don't think it's safe (or simple) to implement it for the `getattr` case because, inter alia, calling `getattr` may have side effects. Fixes #3545 ## Test Plan Existing tests were sufficient. Updated snapshots	2023-07-16 01:32:21 +00:00
Charlie Marsh	06b5c6c06f	Use `SmallVec#extend_from_slice` in lieu of `SmallVec#extend` (#5793 ) ## Summary There's a note in the docs that suggests this can be faster, and in the benchmarks it... seems like it is? Might just be noise but held up over a few runs. Before: <img width="1792" alt="Screen Shot 2023-07-15 at 9 10 06 PM" src="https://github.com/astral-sh/ruff/assets/1309177/973cd955-d4e6-4ae3-898e-90b7eb52ecf2"> After: <img width="1792" alt="Screen Shot 2023-07-15 at 9 10 09 PM" src="https://github.com/astral-sh/ruff/assets/1309177/1491b391-d219-48e9-aa47-110bc7dc7f90">	2023-07-15 21:25:12 -04:00
Charlie Marsh	4782675bf9	Remove lexer-based comment range detection (#5785 ) ## Summary I'm doing some unrelated profiling, and I noticed that this method is actually measurable on the CPython benchmark -- it's > 1% of execution time. We don't need to lex here, we already know the ranges of all comments, so we can just do a simple binary search for overlap, which brings the method down to 0%. ## Test Plan `cargo test`	2023-07-16 01:03:27 +00:00
Charlie Marsh	f2e995f78d	Gate `runtime-import-in-type-checking-block` (`TCH004`) behind enabled flag (#5789 ) Closes #5787.	2023-07-15 20:57:29 +00:00
guillaumeLepape	6824b67f44	Include alias when formatting import-from structs (#5786 ) ## Summary When required-imports is set with the syntax from ... import ... as ..., autofix I002 is failing ## Test Plan Reuse the same python files as `crates/ruff/src/rules/isort/mod.rs:required_import` test.	2023-07-15 15:53:21 -04:00
Charlie Marsh	8ccd697020	Expand scope of `quoted-annotation` rule (#5766 ) ## Summary Previously, the `quoted-annotation` rule only removed quotes when `from __future__ import annotations` was present. However, there are some other cases in which this is also safe -- for example: ```python def foo(): x: "MyClass" ``` We already model these in the semantic model, so this PR just expands the scope of the rule to handle those.	2023-07-15 15:37:34 -04:00
Charlie Marsh	2de6f30929	Lift `Expr::Subscript` value visit out of branches (#5783 ) Like #5772, but for subscripts.	2023-07-15 15:12:15 -04:00
Micha Reiser	df2efe81c8	Respect magic trailing comma for set expression (#5782 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR uses the `join_comma_separated` builder for formatting set expressions to ensure the formatting preserves magic commas, if the setting is enabled. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan See the fixed black tests <!-- How was it tested? -->	2023-07-15 16:40:38 +00:00
Chris Pryer	fa4855e6fe	Format `DictComp` expression (#5771 ) ## Summary Format `DictComp` like `ListComp` from #5600. It's not 100%, but I figured maybe it's worth starting to explore. ## Test Plan Added ruff fixture based on `ListComp`'s.	2023-07-15 17:35:23 +01:00
Micha Reiser	3cda89ecaf	Parenthesize with statements (#5758 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR improves the parentheses handling for with items to get closer to black's formatting. ### Case 1: ```python # Black / Input with ( [ "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa", "bbbbbbbbbb", "cccccccccccccccccccccccccccccccccccccccccc", dddddddddddddddddddddddddddddddd, ] as example1, aaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb + cccccccccccccccccccccccccccc + ddddddddddddddddd as example2, CtxManager2() as example2, CtxManager2() as example2, CtxManager2() as example2, ): ... # Before with ( [ "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa", "bbbbbbbbbb", "cccccccccccccccccccccccccccccccccccccccccc", dddddddddddddddddddddddddddddddd, ] as example1, ( aaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb + cccccccccccccccccccccccccccc + ddddddddddddddddd ) as example2, CtxManager2() as example2, CtxManager2() as example2, CtxManager2() as example2, ): ... ``` Notice how Ruff wraps the binary expression in an extra set of parentheses ### Case 2: Black does not expand the with-items if the with has no parentheses: ```python # Black / Input with aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb as c: ... # Before with ( aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa + bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb as c ): ... ``` Or ```python # Black / Input with [ "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa", "bbbbbbbbbb", "cccccccccccccccccccccccccccccccccccccccccc", dddddddddddddddddddddddddddddddd, ] as example1, aaaaaaaaaaaaaaaaaaaaaaaaaa * bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb * cccccccccccccccccccccccccccc + ddddddddddddddddd as example2, CtxManager222222222222222() as example2: ... # Before (Same as Case 1) with ( [ "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa", "bbbbbbbbbb", "cccccccccccccccccccccccccccccccccccccccccc", dddddddddddddddddddddddddddddddd, ] as example1, ( aaaaaaaaaaaaaaaaaaaaaaaaaa * bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb * cccccccccccccccccccccccccccc + ddddddddddddddddd ) as example2, CtxManager222222222222222() as example2, ): ... ``` ## Test Plan I added new snapshot tests Improves the django similarity index from 0.973 to 0.977	2023-07-15 16:03:09 +01:00
Luc Khai Hai	e1c119fde3	Format `SetComp` (#5774 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Format `SetComp` like `ListComp`. ## Test Plan Derived from `ListComp`'s fixture.	2023-07-15 15:50:47 +01:00

... 3 4 5 6 7 ...

4719 Commits All Branches Search

4719 Commits

All Branches