Python/ruff - ruff - Gitea: Git with a cup of tea

mirror of https://github.com/astral-sh/ruff synced 2026-01-24 06:50:59 -05:00

Author	SHA1	Message	Date
Charlie Marsh	3fb9e76012	Remove unnecessary manual Generator invocations (#2129 )	2023-01-24 09:38:12 -05:00
Charlie Marsh	8d46d3bfa6	Avoid nested-if violations when outer-if has else clause (#2095 ) It looks like we need `do`-`while`-like semantics here with an additional outer check. Closes #2094.	2023-01-22 17:40:56 -05:00
alm	e11cf1bf65	Update linters PyPI links to latest version (#2062 )	2023-01-22 13:10:22 -05:00
Charlie Marsh	4dcf284a04	Index source code upfront to power (row, column) lookups (#1990 ) ## Summary The problem: given a (row, column) number (e.g., for a token in the AST), we need to be able to map it to a precise byte index in the source code. A while ago, we moved to `ropey` for this, since it was faster in practice (mostly, I think, because it's able to defer indexing). However, at some threshold of accesses, it becomes faster to index the string in advance, as we're doing here. ## Benchmark It looks like this is ~3.6% slower for the default rule set, but ~9.3% faster for `--select ALL`. I suspect there's a strategy that would be strictly faster in both cases, based on deferring even more computation (right now, we lazily compute these offsets, but we do it for the entire file at once, even if we only need some slice at the top), or caching the `ropey` lookups in some way. Before: ![main](https://user-images.githubusercontent.com/1309177/213883581-8f73c61d-2979-4171-88a6-a88d7ff07e40.png) After: ![48 all](https://user-images.githubusercontent.com/1309177/213883586-3e049680-9ef9-49e2-8f04-fd6ff402eba7.png) ## Alternatives I tried tweaking the `Vec::with_capacity` hints, and even trying `Vec::with_capacity(str_indices::lines_crlf::count_breaks(contents))` to do a quick scan of the number of lines, but that turned out to be slower.	2023-01-21 17:56:11 -05:00
Harutaka Kawamura	eb1b5e5454	De-duplicate SIM102 (#2050 ) The idea is the same as #1867. Avoids emitting `SIM102` twice for the following code: ```python if a: if b: if c: d ``` ``` resources/test/fixtures/flake8_simplify/SIM102.py:1:1: SIM102 Use a single `if` statement instead of nested `if` statements resources/test/fixtures/flake8_simplify/SIM102.py:2:5: SIM102 Use a single `if` statement instead of nested `if` statements ```	2023-01-20 23:38:52 -05:00
Charlie Marsh	9e704a7c63	Only fix true-false returns for return-bool-condition-directly (#2037 ) Closes #2035.	2023-01-20 13:17:19 -05:00
Charlie Marsh	fd6dc2a343	Use platform-appropriate newline character for LibCST embedding (#2028 ) Closes #2026.	2023-01-20 09:08:04 -05:00
Charlie Marsh	ec0c7647ab	Avoid SIM401 in `elif` blocks (#2012 ) For now, we're just gonna avoid flagging this for `elif` blocks, following the same reasoning as for ternaries. We can handle all of these cases, but we'll knock out the TODOs as a pair, and this avoids broken code. Closes #2007.	2023-01-19 21:57:18 -05:00
Charlie Marsh	045229630e	Upgrade RustPython (#2011 ) This lets us revert the "manual" fix introduced in #1944.	2023-01-19 21:49:12 -05:00
Charlie Marsh	ad80fdc2cd	Avoid `SIM201` and `SIM202` errors in `__ne__` et al (#2001 ) Closes #1986.	2023-01-19 11:27:27 -05:00
Charlie Marsh	a122d95ef5	Preserve unmatched comparators in `SIM109` (#1998 ) Closes #1993.	2023-01-19 10:23:20 -05:00
Martin Fischer	6649225167	rule 8/8: Automatically rewrite RuleCode to Rule # This commit was automatically generated by running the following # script (followed by `cargo +nightly fmt`): import glob import re from typing import NamedTuple class Rule(NamedTuple): code: str name: str path: str def rules() -> list[Rule]: """Returns all the rules defined in `src/registry.rs`.""" file = open('src/registry.rs') rules = [] while next(file) != 'ruff_macros::define_rule_mapping!(\n': continue while (line := next(file)) != ');\n': line = line.strip().rstrip(',') if line.startswith('//'): continue code, path = line.split(' => ') name = path.rsplit('::')[-1] rules.append(Rule(code, name, path)) return rules code2name = {r.code: r.name for r in rules()} for pattern in ('src/*/.rs', 'ruff_cli/*/.rs', 'ruff_dev/*/.rs', 'scripts/add_.py'): for name in glob.glob(pattern, recursive=True): with open(name) as f: text = f.read() text = re.sub('Rule(?:Code)?::([A-Z]\w+)', lambda m: 'Rule::' + code2name[m.group(1)], text) text = re.sub(r'(?<!"<FilePattern>:<)RuleCode\b', 'Rule', text) text = re.sub('(use crate::registry::{., Rule), Rule(.*)', r'\1\2', text) # fix duplicate import with open(name, 'w') as f: f.write(text)	2023-01-18 23:51:48 -05:00
Martin Fischer	dbcab5128c	rule 3/8: Remove AsRef<str> impl for Rule	2023-01-18 23:51:48 -05:00
Martin Fischer	3810250bb6	rule 2/8: Rename DiagnosticKind::code to rule	2023-01-18 23:51:48 -05:00
Charlie Marsh	34412a0a01	Avoid removing side effects for boolean simplifications (#1984 ) Closes #1978.	2023-01-18 19:08:14 -05:00
Charlie Marsh	969a6f0d53	Replace misplaced-comparison-constant with SIM300 (#1980 ) Closes: #1954.	2023-01-18 18:42:49 -05:00
Charlie Marsh	7628876ff2	Invert order of yoda-conditions message (#1979 ) The suggestion was wrong!	2023-01-18 18:27:36 -05:00
Charlie Marsh	97f55b8e97	Convert remaining call path sites to use `SmallVec` (#1972 )	2023-01-18 14:50:33 -05:00
Charlie Marsh	dae95626ae	Use `smallvec` for call path representation (#1960 ) This provides a ~10% speed-up for large codebases with `--select ALL`: ![Screen Shot 2023-01-18 at 11 28 20 AM](https://user-images.githubusercontent.com/1309177/213236389-cff50840-6e55-47a3-9164-2e40cbc885f6.png)	2023-01-18 11:29:05 -05:00
Anders Kaseorg	b9c6cfc0ab	Autofix SIM117 (MultipleWithStatements) (#1961 ) This is slightly buggy due to Instagram/LibCST#855; it will complain `[ERROR] Failed to fix nested with: Failed to extract CST from source` when trying to fix nested parenthesized `with` statements lacking trailing commas. But presumably people who write parenthesized `with` statements already knew that they don’t need to nest them. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2023-01-18 11:06:04 -05:00
Anders Kaseorg	83346de6e0	Autofix SIM102 (NestedIfStatements) Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com> Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2023-01-18 07:37:27 -05:00
Anders Kaseorg	b23cc31863	Change ast::helpers::has_coments to accept a Range Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com> Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2023-01-18 07:37:27 -05:00
Anders Kaseorg	ea4d54a90f	Restrict SIM105 to try blocks with a body of one simple statement (#1948 ) If a `try` block has multiple statements, a compound statement, or control flow, rewriting it with `contextlib.suppress` would obfuscate the fact that the exception still short-circuits further statements in the block. Fixes #1947. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2023-01-18 00:22:22 -05:00
Charlie Marsh	51b917cfbf	Exempt `contextlib.ExitStack()` for SIM115 rules (#1946 ) Since our binding tracking is somewhat limited, I opted to favor false negatives over false positives. So, e.g., this won't trigger SIM115: ```py with contextlib.ExitStack(): f = exit_stack.enter_context(open("filename")) ``` (Notice that `exit_stack` is unbound.) The alternative strategy required us to incorrectly trigger SIM115 on this: ```py with contextlib.ExitStack() as exit_stack: exit_stack_ = exit_stack f = exit_stack_.enter_context(open("filename")) ``` Closes #1945.	2023-01-17 22:39:54 -05:00
Charlie Marsh	84d1df08be	Avoid broken autofix for `SIM103` with `elif` (#1944 ) Also adjusts the generator to avoid the extra parentheses (and skips commented `if` statements). Closes #1943.	2023-01-17 22:03:17 -05:00
Martin Fischer	a6566b1b34	refactor: Merge Settings.enabled and Settings.fixable The Settings struct previously contained the fields: pub enabled: HashableHashSet<RuleCode>, pub fixable: HashableHashSet<RuleCode>, This commit merges both fields into one by introducing a new RuleTable type, wrapping HashableHashMap<RuleCode, bool>, which has the following benefits: 1. It makes the invalid state that a rule is disabled but fixable unrepresentable. 2. It encapsulates the implementation details of the table. (It currently uses an FxHashMap but that may change.) 3. It results in more readable code. settings.rules.enabled(rule) settings.rules.should_fix(rule) is more readable than: settings.enabled.contains(rule) settings.fixable.contains(rule)	2023-01-17 09:20:57 -05:00
Martin Fischer	b78b6f275e	refactor: Define origin names & URLs within doc comments	2023-01-17 07:44:40 -05:00
Charlie Marsh	f3bf008aed	Avoid removing statements that contain side-effects (#1920 ) Closes #1917.	2023-01-16 14:45:02 -05:00
Charlie Marsh	3791ca721a	Add a dedicated token indexer for continuations and comments (#1886 ) The primary motivation is that we can now robustly detect `\` continuations due to the addition of `Tok::NonLogicalNewline`. This PR generalizes the approach we took to comments (track all lines that contain any comments), and applies it to continuations too.	2023-01-15 01:57:31 -05:00
Harutaka Kawamura	2a1601749f	Fix range of SIM201, 202, and 208 (#1880 ) Before ``` resources/test/fixtures/flake8_simplify/SIM208.py:1:13: SIM208 Use `a` instead of `not (not a)` \| 1 \| if not (not a): # SIM208 \| ^ SIM208 \| = help: Replace with `a` resources/test/fixtures/flake8_simplify/SIM208.py:4:14: SIM208 Use `a == b` instead of `not (not a == b)` \| 4 \| if not (not (a == b)): # SIM208 \| ^^^^^^ SIM208 \| = help: Replace with `a == b` ``` After ``` resources/test/fixtures/flake8_simplify/SIM208.py:1:4: SIM208 Use `a` instead of `not (not a)` \| 1 \| if not (not a): # SIM208 \| ^^^^^^^^^^^ SIM208 \| = help: Replace with `a` resources/test/fixtures/flake8_simplify/SIM208.py:4:4: SIM208 Use `a == b` instead of `not (not a == b)` \| 4 \| if not (not (a == b)): # SIM208 \| ^^^^^^^^^^^^^^^^^^ SIM208 \| = help: Replace with `a == b` ```	2023-01-14 21:17:32 -05:00
Martin Fischer	c7f0f3b237	Regenerate insta snapshots	2023-01-14 11:48:02 -05:00
Martin Fischer	3b36030461	Introduce ruff::rules module Resolves #1547.	2023-01-14 11:48:02 -05:00

32 Commits