Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Charlie Marsh	93b5d8a0fb	Implement our own small-integer optimization (#7584 ) ## Summary This is a follow-up to #7469 that attempts to achieve similar gains, but without introducing malachite. Instead, this PR removes the `BigInt` type altogether, instead opting for a simple enum that allows us to store small integers directly and only allocate for values greater than `i64`: ```rust /// A Python integer literal. Represents both small (fits in an `i64`) and large integers. #[derive(Clone, PartialEq, Eq, Hash)] pub struct Int(Number); #[derive(Debug, Clone, PartialEq, Eq, Hash)] pub enum Number { /// A "small" number that can be represented as an `i64`. Small(i64), /// A "large" number that cannot be represented as an `i64`. Big(Box<str>), } impl std::fmt::Display for Number { fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result { match self { Number::Small(value) => write!(f, "{value}"), Number::Big(value) => write!(f, "{value}"), } } } ``` We typically don't care about numbers greater than `isize` -- our only uses are comparisons against small constants (like `1`, `2`, `3`, etc.), so there's no real loss of information, except in one or two rules where we're now a little more conservative (with the worst-case being that we don't flag, e.g., an `itertools.pairwise` that uses an extremely large value for the slice start constant). For simplicity, a few diagnostics now show a dedicated message when they see integers that are out of the supported range (e.g., `outdated-version-block`). An additional benefit here is that we get to remove a few dependencies, especially `num-bigint`. ## Test Plan `cargo test`	2023-09-25 15:13:21 +00:00
Charlie Marsh	87a0cd219f	Detect `asyncio.get_running_loop` calls in RUF006 (#7562 ) ## Summary We can do a good enough job detecting this with our existing semantic model. Closes https://github.com/astral-sh/ruff/issues/3237.	2023-09-21 04:37:38 +00:00
Tom Kuson	511cc25fc4	[`refurb`] Implement `unnecessary-enumerate` (`FURB148`) (#7454 ) ## Summary Implement [`no-ignored-enumerate-items`](https://github.com/dosisod/refurb/blob/master/refurb/checks/builtin/no_ignored_enumerate.py) as `unnecessary-enumerate` (`FURB148`). The auto-fix considers if a `start` argument is passed to the `enumerate()` function. If only the index is used, then the suggested fix is to pass the `start` value to the `range()` function. So, ```python for i, _ in enumerate(xs, 1): ... ``` becomes ```python for i in range(1, len(xs)): ... ``` If the index is ignored and only the value is ignored, and if a start value greater than zero is passed to `enumerate()`, the rule doesn't produce a suggestion. I couldn't find a unanimously accepted best way to iterate over a collection whilst skipping the first n elements. The rule still triggers, however. Related to #1348. ## Test Plan `cargo test` --------- Co-authored-by: Dhruv Manilawala <dhruvmanila@gmail.com> Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2023-09-19 20:19:28 +00:00
Charlie Marsh	28b48ab902	Avoid flagging starred expressions in UP007 (#7505 ) ## Summary These can't be fixed, because fixing them would lead to invalid syntax. So flagging them also feels misleading. Closes https://github.com/astral-sh/ruff/issues/7452.	2023-09-19 03:37:38 +00:00
Charlie Marsh	6856d0b44b	Use dot references in docs for methods (#7391 ) ## Summary This matches the convention used in the Python documentation.	2023-09-14 14:35:34 -04:00
Jelle van der Waa	04183b0299	[pylint] Implement `too-many-public-methods` rule (PLR0904) (#6179 ) Implement https://pylint.pycqa.org/en/latest/user_guide/messages/refactor/too-many-public-methods.html Confusingly the rule page mentions a max of 7 while in practice it is 20. https://github.com/search?q=repo%3Apylint-dev%2Fpylint+max-public-methods&type=code ## Summary Implement pylint's R0904 ## Test Plan Unit tests.	2023-09-14 00:52:26 +00:00
qdegraaf	f3aaf84a28	Move `refurb/helpers` utils to `ruff_python_semantic` for broader use (#6990 ) ## Summary The utils added for `refurb` in its `helpers.rs` file could be useful for many other plugins. (Such as the PERF4XX codes, see e.g. https://github.com/astral-sh/ruff/pull/6132 ). This PR moves them to `ruff_python_semantic::analyzers::typing` as suggested in https://github.com/astral-sh/ruff/pull/6132#issuecomment-1697910093 ## Test Plan Confirmed `refurb` and all other tests still work	2023-08-29 14:45:09 -04:00
Valeriy Savchenko	26d53c56a2	[refurb] Implement `repeated-append` rule (`FURB113`) (#6702 ) ## Summary As an initial effort with replicating `refurb` rules (#1348 ), this PR adds support for [FURB113](https://github.com/dosisod/refurb/blob/master/refurb/checks/builtin/list_extend.py) and adds a new category of checks. ## Test Plan I included a new test + checked that all other tests pass.	2023-08-28 22:51:59 +00:00
Jelle van der Waa	af61abc747	Re-use is_magic where possible (#6945 ) ## Summary Use `is_magic` where possible ## Test Plan Unit tests	2023-08-28 15:35:34 +00:00
Charlie Marsh	fc89976c24	Move `Ranged` into `ruff_text_size` (#6919 ) ## Summary The motivation here is that this enables us to implement `Ranged` in crates that don't depend on `ruff_python_ast`. Largely a mechanical refactor with a lot of regex, Clippy help, and manual fixups. ## Test Plan `cargo test`	2023-08-27 14:12:51 -04:00
Charlie Marsh	d0b051e447	Fix ranges for global usages (#6917 ) ## Summary The range of the usage from `Globals` should be the range of the identifier, not the range of the full `global pandas` statement. Closes https://github.com/astral-sh/ruff/issues/6914.	2023-08-27 15:27:07 +00:00
Zanie Blue	417a1d0717	Update `mutable-argument-default` (`B006`) to use `extend-immutable-calls` when determining if annotations are immutable (#6781 ) Part of https://github.com/astral-sh/ruff/issues/3762	2023-08-23 15:44:35 +00:00
Charlie Marsh	424b8d4ad2	Use a single node hierarchy to track statements and expressions (#6709 ) ## Summary This PR is a follow-up to the suggestion in https://github.com/astral-sh/ruff/pull/6345#discussion_r1285470953 to use a single stack to store all statements and expressions, rather than using separate vectors for each, which gives us something closer to a full-fidelity chain. (We can then generalize this concept to include all other AST nodes too.) This is in part made possible by the removal of the hash map from `&Stmt` to `StatementId` (#6694), which makes it much cheaper to store these using a single interface (since doing so no longer introduces the requirement that we hash all expressions). I'll follow-up with some profiling, but a few notes on how the data requirements have changed: - We now store a `BranchId` for every expression, not just every statement, so that's an extra `u32`. - We now store a single `NodeId` on every snapshot, rather than separate `StatementId` and `ExpressionId` IDs, so that's one fewer `u32` for each snapshot. - We're probably doing a few more lookups in general, since any calls to `current_statement()` etc. now have to iterate up the node hierarchy until they identify the first statement. ## Test Plan `cargo test`	2023-08-21 21:32:57 -04:00
Charlie Marsh	da1697121e	Add `BranchId` to the model snapshot (#6706 ) This _probably_ never matters given the set of rules we support and in fact I'm having trouble thinking of a test-case for it, but it's definitely incorrect _not_ to pass on the `BranchId` here.	2023-08-20 15:35:49 +00:00
Charlie Marsh	17af12e57c	Add branch detection to the semantic model (#6694 ) ## Summary We have a few rules that rely on detecting whether two statements are in different branches -- for example, different arms of an `if`-`else`. Historically, the way this was implemented is that, given two statement IDs, we'd find the common parent (by traversing upwards via our `Statements` abstraction); then identify branches "manually" by matching the parents against `try`, `if`, and `match`, and returning iterators over the arms; then check if there's an arm for which one of the statements is a child, and the other is not. This has a few drawbacks: 1. First, the code is generally a bit hard to follow (Konsti mentioned this too when working on the `ElifElseClause` refactor). 2. Second, this is the only place in the codebase where we need to go from `&Stmt` to `StatementID` -- _everywhere_ else, we only need to go in the _other_ direction. Supporting these lookups means we need to maintain a mapping from `&Stmt` to `StatementID` that includes every `&Stmt` in the program. (We _also_ end up maintaining a `depth` level for every statement.) I'd like to get rid of these requirements to improve efficiency, reduce complexity, and enable us to treat AST modes more generically in the future. (When I looked at adding the `&Expr` to our existing statement-tracking infrastructure, maintaining a hash map with all the statements noticeably hurt performance.) The solution implemented here instead makes branches a first-class concept in the semantic model. Like with `Statements`, we now have a `Branches` abstraction, where each branch points to its optional parent. When we store statements, we store the `BranchID` alongside each statement. When we need to detect whether two statements are in the same branch, we just realize each statement's branch path and compare the two. (Assuming that the two statements are in the same scope, then they're on the same branch IFF one branch path is a subset of the other, starting from the top.) We then add some calls to the visitor to push and pop branches in the appropriate places, for `if`, `try`, and `match` statements. Note that a branch is not 1:1 with a statement; instead, each branch is closer to a suite, but not _every_ suite is a branch. For example, each arm in an `if`-`elif`-`else` is a branch, but the `else` in a `for` loop is not considered a branch. In addition to being much simpler, this should also be more efficient, since we've shed the entire `&Stmt` hash map, plus the `depth` that we track on `StatementWithParent` in favor of a single `Option<BranchID>` on `StatementWithParent` plus a single vector for all branches. The lookups should be faster too, since instead of doing a bunch of jumps around with the hash map + repeated recursive calls to find the common parents, we instead just do a few simple lookups in the `Branches` vector to realize and compare the branch paths. ## Test Plan `cargo test` -- we have a lot of coverage for this, which we inherited from PyFlakes	2023-08-19 21:28:17 +00:00
Charlie Marsh	db1c556508	Implement `Ranged` on more structs (#6639 ) ## Summary I noticed some inconsistencies around uses of `.range.start()`, structs that have a `TextRange` field but don't implement `Ranged`, etc. ## Test Plan `cargo test`	2023-08-17 11:22:39 -04:00
Charlie Marsh	96d310fbab	Remove `Stmt::TryStar` (#6566 ) ## Summary Instead, we set an `is_star` flag on `Stmt::Try`. This is similar to the pattern we've migrated towards for `Stmt::For` (removing `Stmt::AsyncFor`) and friends. While these are significant differences for an interpreter, we tend to handle these cases identically or nearly identically. ## Test Plan `cargo test`	2023-08-14 13:39:44 -04:00
Charlie Marsh	1a9536c4e2	Remove `SemanticModel#find_binding` (#6546 ) ## Summary This method is almost never what you actually want, because it doesn't respect Python's scoping semantics. For example, if you call this within a class method, it will return class attributes, whereas Python actually _skips_ symbols in classes unless the load occurs within the class itself. I also want to move away from these kinds of dynamic lookups and more towards `resolve_name`, which performs a lookup based on the stored `BindingId` at the time of symbol resolution, and will make it much easier for us to separate model building from linting in the near future. ## Test Plan `cargo test`	2023-08-14 00:09:05 -04:00
Charlie Marsh	768686148f	Add support for unions to our Python builtins type system (#6541 ) ## Summary Fixes some TODOs introduced in https://github.com/astral-sh/ruff/pull/6538. In short, given an expression like `1 if x > 0 else "Hello, world!"`, we now return a union type that says the expression can resolve to either an `int` or a `str`. The system remains very limited, it only works for obvious primitive types, and there's no attempt to do inference on any more complex variables. (If any expression yields `Unknown` or `TypeError`, we propagate that result throughout and abort on the client's end.)	2023-08-13 18:00:50 -04:00
Charlie Marsh	446ceed1ad	Support `IfExp` with dual string arms in `invalid-envvar-value` (#6538 ) ## Summary Closes https://github.com/astral-sh/ruff/issues/6537. We need to improve the `PythonType` algorithm, so this also documents some of its limitations as TODOs.	2023-08-13 15:52:10 -04:00
Charlie Marsh	6706ae4828	Respect scoping rules when identifying builtins (#6468 ) ## Summary Our `is_builtin` check did a naive walk over the parent scopes; instead, it needs to (e.g.) skip symbols in a class scope if being called outside of the class scope itself. Closes https://github.com/astral-sh/ruff/issues/6466. ## Test Plan `cargo test`	2023-08-10 10:20:09 -04:00
Charlie Marsh	a2758513de	Fix false-positive in submodule resolution (#6435 ) Closes https://github.com/astral-sh/ruff/issues/6433.	2023-08-09 02:36:39 +00:00
Charlie Marsh	3d06fe743d	Change `model: &SemanticModel` to `semantic: &SemanticModel` (#6406 ) Use the same naming conventions everywhere. See: https://github.com/astral-sh/ruff/pull/6314/files#r1284457874.	2023-08-07 16:32:55 -04:00
Charlie Marsh	26098b8d91	Extend nested union detection to handle bitwise or `Union` expressions (#6399 ) ## Summary We have some logic in the expression analyzer method to avoid re-checking the inner `Union` in `Union[Union[...]]`, since the methods that analyze `Union` expressions already recurse. Elsewhere, we have logic to avoid re-checking the inner `\|` in `int \| (int \| str)`, for the same reason. This PR unifies that logic into a single method _and_ ensures that, just as we recurse over both `Union` and `\|`, we also detect that we're in _either_ kind of nested union. Closes https://github.com/astral-sh/ruff/issues/6285. ## Test Plan Added some new snapshots.	2023-08-07 15:17:26 -04:00
Charlie Marsh	3f0eea6d87	Rename `JoinedStr` to `FString` in the AST (#6379 ) ## Summary Per the proposal in https://github.com/astral-sh/ruff/discussions/6183, this PR renames the `JoinedStr` node to `FString`.	2023-08-07 17:33:17 +00:00
Charlie Marsh	c439435615	Use dedicated AST nodes on `MemberKind` (#6374 ) ## Summary This PR leverages the unified function definition node to add precise AST node types to `MemberKind`, which is used to power our docstring definition tracking (e.g., classes and functions, whether they're methods or functions or nested functions and so on, whether they have a docstring, etc.). It was painful to do this in the past because the function variants needed to support a union anyway, but storing precise nodes removes like a dozen panics. No behavior changes -- purely a refactor. ## Test Plan `cargo test`	2023-08-07 17:17:58 +00:00
Charlie Marsh	daefa74e9a	Remove async AST node variants for `with`, `for`, and `def` (#6369 ) ## Summary Per the suggestion in https://github.com/astral-sh/ruff/discussions/6183, this PR removes `AsyncWith`, `AsyncFor`, and `AsyncFunctionDef`, replacing them with an `is_async` field on the non-async variants of those structs. Unlike an interpreter, we _generally_ have identical handling for these nodes, so separating them into distinct variants adds complexity from which we don't really benefit. This can be seen below, where we get to remove a _ton_ of code related to adding generic `Any*` wrappers, and a ton of duplicate branches for these cases. ## Test Plan `cargo test` is unchanged, apart from parser snapshots.	2023-08-07 16:36:02 +00:00
Charlie Marsh	c895252aae	Remove `RefEquality` (#6393 ) ## Summary See discussion in https://github.com/astral-sh/ruff/pull/6351#discussion_r1284996979. We can remove `RefEquality` entirely and instead use a text offset for statement keys, since no two statements can start at the same text offset. ## Test Plan `cargo test`	2023-08-07 16:04:50 +00:00
Charlie Marsh	9328606843	Remove `Statements#parent` (#6392 ) Discussed in https://github.com/astral-sh/ruff/pull/6351#discussion_r1284997065.	2023-08-07 15:41:02 +00:00
Charlie Marsh	b21abe0a57	Use separate structs for expression and statement tracking (#6351 ) ## Summary This PR fixes the performance degradation introduced in https://github.com/astral-sh/ruff/pull/6345. Instead of using the generic `Nodes` structs, we now use separate `Statement` and `Expression` structs. Importantly, we can avoid tracking a bunch of state for expressions that we need for parents: we don't need to track reference-to-ID pointers (we just have no use-case for this -- I'd actually like to remove this from statements too, but we need it for branch detection right now), we don't need to track depth, etc. In my testing, this entirely removes the regression on all-rules, and gets us down to 2ms slower on the default rules (as a crude hyperfine benchmark, so this is within margin of error IMO). No behavioral changes.	2023-08-07 15:27:42 +00:00
Charlie Marsh	61d3977f95	Make the `statement` vector private on `SemanticModel` (#6348 ) ## Summary Instead, expose these as methods, now that we can use a reasonable nomenclature on the API.	2023-08-07 15:02:14 +00:00
Charlie Marsh	bae87fa016	Rename semantic model methods to use `current_*` prefix (#6347 ) ## Summary This PR attempts to draw a clearer divide between "methods that take (e.g.) an expression or statement as input" and "methods that rely on the _current_ expression or statement" in the semantic model, by renaming methods like `stmt()` to `current_statement()`. This had led to confusion in the past. For example, prior to this PR, we had `scope()` (which returns the current scope), and `parent_scope`, which returns the parent _of a scope that's passed in_. Now, the API is clearer: `current_scope` returns the current scope, and `parent_scope` takes a scope as argument and returns its parent. Per above, I also changed `stmt` to `statement` and `expr` to `expression`.	2023-08-07 14:44:49 +00:00
Charlie Marsh	89e4e038b0	Store expression hierarchy in semantic model snapshots (#6345 ) ## Summary When we iterate over the AST for analysis, we often process nodes in a "deferred" manner. For example, if we're analyzing a function, we push the function body onto a deferred stack, along with a snapshot of the current semantic model state. Later, when we analyze the body, we restore the semantic model state from the snapshot. This ensures that we know the correct scope, hierarchy of statement parents, etc., when we go to analyze the function body. Historically, we _haven't_ included the _expression_ hierarchy in the model snapshot -- so we track the current expression parents in the visitor, but we never save and restore them when processing deferred nodes. This can lead to subtle bugs, in that methods like `expr_parent()` aren't guaranteed to be correct, if you're in a deferred visitor. This PR migrates expression tracking to mirror statement tracking exactly. So we push all expressions onto an `IndexVec`, and include the current expression on the snapshot. This ensures that `expr_parent()` and related methods are "always correct" rather than "sometimes correct". There's a performance cost here, both at runtime and in terms of memory consumption (we now store an additional pointer for every expression). In my hyperfine testing, it's about a 1% performance decrease for all-rules on CPython (up to 533.8ms, from 528.3ms) and a 4% performance decrease for default-rules on CPython (up to 212ms, from 204ms). However... I think this is worth it given the incorrectness of our current approach. In the future, we may want to reconsider how we do these upward traversals (e.g., with something like a red-green tree). (Note: in https://github.com/astral-sh/ruff/pull/6351, the slowdown seems to be entirely removed.)	2023-08-07 09:42:04 -04:00
Zixuan Li	be657f5e7e	Respect typing_extensions imports of Annotated for B006. (#6361 ) `typing_extensions.Annotated` should be treated the same way as `typing.Annotated`.	2023-08-05 17:39:52 +00:00
Charlie Marsh	76148ddb76	Store call paths rather than stringified names (#6102 ) ## Summary Historically, we've stored "qualified names" on our `BindingKind::Import`, `BindingKind::SubmoduleImport`, and `BindingKind::ImportFrom` structs. In Ruff, a "qualified name" is a dot-separated path to a symbol. For example, given `import foo.bar`, the "qualified name" would be `"foo.bar"`; and given `from foo.bar import baz`, the "qualified name" would be `foo.bar.baz`. This PR modifies the `BindingKind` structs to instead store _call paths_ rather than qualified names. So in the examples above, we'd store `["foo", "bar"]` and `["foo", "bar", "baz"]`. It turns out that this more efficient given our data access patterns. Namely, we frequently need to convert the qualified name to a call path (whenever we call `resolve_call_path`), and it turns out that we do this operation enough that those conversations show up on benchmarks. There are a few other advantages to using call paths, rather than qualified names: 1. The size of `BindingKind` is reduced from 32 to 24 bytes, since we no longer need to store a `String` (only a boxed slice). 2. All three import types are more consistent, since they now all store a boxed slice, rather than some storing an `&str` and some storing a `String` (for `BindingKind::ImportFrom`, we needed to allocate a `String` to create the qualified name, but the call path is a slice of static elements that don't require that allocation). 3. A lot of code gets simpler, in part because we now do call path resolution "earlier". Most notably, for relative imports (`from .foo import bar`), we store the _resolved_ call path rather than the relative call path, so the semantic model doesn't have to deal with that resolution. (See that `resolve_call_path` is simpler, fewer branches, etc.) In my testing, this change improves the all-rules benchmark by another 4-5% on top of the improvements mentioned in #6047.	2023-08-05 15:21:50 +00:00
Charlie Marsh	d788957ec4	Allow capitalized names for logger candidate heuristic match (#6356 ) Closes https://github.com/astral-sh/ruff/issues/6353.	2023-08-04 23:25:34 +00:00
Charlie Marsh	8a5bc93fdd	Make the `Nodes` vector generic on node type (#6328 )	2023-08-04 03:57:15 +00:00
Charlie Marsh	2fa508793f	Return a slice in `StmtClassDef#bases` (#6311 ) Slices are strictly more flexible, since you can always convert to an iterator, etc., but not the other way around. Suggested in https://github.com/astral-sh/ruff/pull/6259#discussion_r1282730994.	2023-08-03 16:21:55 +00:00
Charlie Marsh	9f3567dea6	Use `range: _` in lieu of `range: _range` (#6296 ) ## Summary `range: _range` is slightly inconvenient because you can't use it multiple times within a single match, unlike `_`.	2023-08-02 22:11:13 -04:00
Charlie Marsh	041946fb64	Remove `CallArguments` abstraction (#6279 ) ## Summary This PR removes a now-unnecessary abstraction from `helper.rs` (`CallArguments`), in favor of adding methods to `Arguments` directly, which helps with discoverability.	2023-08-02 13:25:43 -04:00
Charlie Marsh	981e64f82b	Introduce an `Arguments` AST node for function calls and class definitions (#6259 ) ## Summary This PR adds a new `Arguments` AST node, which we can use for function calls and class definitions. The `Arguments` node spans from the left (open) to right (close) parentheses inclusive. In the case of classes, the `Arguments` is an option, to differentiate between: ```python # None class C: ... # Some, with empty vectors class C(): ... ``` In this PR, we don't really leverage this change (except that a few rules get much simpler, since we don't need to lex to find the start and end ranges of the parentheses, e.g., `crates/ruff/src/rules/pyupgrade/rules/lru_cache_without_parameters.rs`, `crates/ruff/src/rules/pyupgrade/rules/unnecessary_class_parentheses.rs`). In future PRs, this will be especially helpful for the formatter, since we can track comments enclosed on the node itself. ## Test Plan `cargo test`	2023-08-02 10:01:13 -04:00
konsti	1df7e9831b	Replace `.map_or(false, $closure)` with `.is_some_and(closure)` (#6244 ) Summary [Option::is_some_and](https://doc.rust-lang.org/stable/std/option/enum.Option.html#method.is_some_and) and [Result::is_ok_and](https://doc.rust-lang.org/std/result/enum.Result.html#method.is_ok_and) are new methods is rust 1.70. I find them way more readable than `.map_or(false, ...)`. The changes are `s/.map_or(false,/.is_some_and(/g`, then manually switching to `is_ok_and` where the value is a Result rather than an Option. Test Plan n/a^	2023-08-01 19:29:42 +02:00
Charlie Marsh	de898c52eb	Avoid falsely marking non-submodules as submodule aliases (#6182 ) ## Summary We have some code to ensure that if an aliased import is used, any submodules should be marked as used too. This comment says it best: ```rust // If the name of a submodule import is the same as an alias of another import, and the // alias is used, then the submodule import should be marked as used too. // // For example, mark `pyarrow.csv` as used in: // // ```python // import pyarrow as pa // import pyarrow.csv // print(pa.csv.read_csv("test.csv")) // ``` ``` However, it looks like when we go to look up `pyarrow` (of `import pyarrow as pa`), we aren't checking to ensure the resolved binding is _actually_ an import. This was causing us to attribute `print(rm.ANY)` to `def requests_mock` here: ```python import requests_mock as rm def requests_mock(requests_mock: rm.Mocker): print(rm.ANY) ``` Closes https://github.com/astral-sh/ruff/issues/6180.	2023-07-30 22:16:25 +00:00
Zanie Blue	047c211837	Add semantic analysis of type aliases and parameters (#6109 ) Requires https://github.com/astral-sh/RustPython-Parser/pull/42 Related https://github.com/PyCQA/pyflakes/pull/778 [PEP-695](https://peps.python.org/pep-0695) Part of #5062 <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary <!-- What's the purpose of the change? What does it do, and why? --> Adds a scope for type parameters, a type parameter binding kind, and checker visitation of type parameters in type alias statements, function definitions, and class definitions. A few changes were necessary to ensure correctness following the insertion of a new scope between function and class scopes and their parent. ## Test Plan <!-- How was it tested? --> Undefined name snapshots. Unused type parameter rule will be added as follow-up.	2023-07-28 17:06:37 -05:00
Charlie Marsh	0bc3edf6c9	Add documentation and test cases for redefinition (#6135 )	2023-07-28 00:01:42 +00:00
Charlie Marsh	e15b9c5572	Cache name resolutions in the semantic model (#6047 ) ## Summary This PR stores the mapping from `ExprName` node to resolved `BindingId`, which lets us skip scope lookups in `resolve_call_path`. It's enabled by #6045, since that PR ensures that when we analyze a node (and thus call `resolve_call_path`), we'll have already visited its `ExprName` elements. In more detail: imagine that we're traversing over `foo.bar()`. When we read `foo`, it will be an `ExprName`, which we'll then resolve to a binding via `handle_node_load`. With this change, we then store that binding in a map. Later, if we call `collect_call_path` on `foo.bar`, we'll identify `foo` (the "head" of the attribute) and grab the resolved binding in that map. _Almost_ all names are now resolved in advance, though it's not a strict requirement, and some rules break that pattern (e.g., if we're analyzing arguments, and they need to inspect their annotations, which are visited in a deferred manner). This improves performance by 4-6% on the all-rules benchmark. It looks like it hurts performance (1-2% drop) in the default-rules benchmark, presumedly because those rules don't call `resolve_call_path` nearly as much, and so we're paying for these extra writes. Here's the benchmark data: ``` linter/default-rules/numpy/globals.py time: [67.270 µs 67.380 µs 67.489 µs] thrpt: [43.720 MiB/s 43.792 MiB/s 43.863 MiB/s] change: time: [+0.4747% +0.7752% +1.0626%] (p = 0.00 < 0.05) thrpt: [-1.0514% -0.7693% -0.4724%] Change within noise threshold. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high severe linter/default-rules/pydantic/types.py time: [1.4067 ms 1.4105 ms 1.4146 ms] thrpt: [18.028 MiB/s 18.081 MiB/s 18.129 MiB/s] change: time: [+1.3152% +1.6953% +2.0414%] (p = 0.00 < 0.05) thrpt: [-2.0006% -1.6671% -1.2981%] Performance has regressed. linter/default-rules/numpy/ctypeslib.py time: [637.67 µs 638.96 µs 640.28 µs] thrpt: [26.006 MiB/s 26.060 MiB/s 26.113 MiB/s] change: time: [+1.5859% +1.8109% +2.0353%] (p = 0.00 < 0.05) thrpt: [-1.9947% -1.7787% -1.5611%] Performance has regressed. linter/default-rules/large/dataset.py time: [3.2289 ms 3.2336 ms 3.2383 ms] thrpt: [12.563 MiB/s 12.581 MiB/s 12.599 MiB/s] change: time: [+0.8029% +0.9898% +1.1740%] (p = 0.00 < 0.05) thrpt: [-1.1604% -0.9801% -0.7965%] Change within noise threshold. linter/all-rules/numpy/globals.py time: [134.05 µs 134.15 µs 134.26 µs] thrpt: [21.977 MiB/s 21.995 MiB/s 22.012 MiB/s] change: time: [-4.4571% -4.1175% -3.8268%] (p = 0.00 < 0.05) thrpt: [+3.9791% +4.2943% +4.6651%] Performance has improved. Found 8 outliers among 100 measurements (8.00%) 2 (2.00%) low mild 3 (3.00%) high mild 3 (3.00%) high severe linter/all-rules/pydantic/types.py time: [2.5627 ms 2.5669 ms 2.5720 ms] thrpt: [9.9158 MiB/s 9.9354 MiB/s 9.9516 MiB/s] change: time: [-5.8304% -5.6374% -5.4452%] (p = 0.00 < 0.05) thrpt: [+5.7587% +5.9742% +6.1914%] Performance has improved. Found 7 outliers among 100 measurements (7.00%) 6 (6.00%) high mild 1 (1.00%) high severe linter/all-rules/numpy/ctypeslib.py time: [1.3949 ms 1.3956 ms 1.3964 ms] thrpt: [11.925 MiB/s 11.931 MiB/s 11.937 MiB/s] change: time: [-6.2496% -6.0856% -5.9293%] (p = 0.00 < 0.05) thrpt: [+6.3030% +6.4799% +6.6662%] Performance has improved. Found 7 outliers among 100 measurements (7.00%) 3 (3.00%) high mild 4 (4.00%) high severe linter/all-rules/large/dataset.py time: [5.5951 ms 5.6019 ms 5.6093 ms] thrpt: [7.2527 MiB/s 7.2623 MiB/s 7.2711 MiB/s] change: time: [-5.1781% -4.9783% -4.8070%] (p = 0.00 < 0.05) thrpt: [+5.0497% +5.2391% +5.4608%] Performance has improved. ``` Still playing with this (the concepts need better names, documentation, etc.), but opening up for feedback.	2023-07-27 13:01:56 -04:00
Micha Reiser	40f54375cb	Pull in RustPython parser (#6099 )	2023-07-27 09:29:11 +00:00
Victor Hugo Gomes	86539c1fc5	[`flake8-pyi`] Implement `PYI046` (#6098 ) ## Summary Checks for the presence of unused private `typing.Protocol` definitions. ref #848 ## Test Plan Snapshots and manual runs of flake8.	2023-07-27 02:34:56 +00:00
Victor Hugo Gomes	c0dbcb3434	[`flake8-pyi`] Implement PYI018 (#6018 ) ## Summary Check for unused private `TypeVar`. See [original implementation](`2a86db8271/pyi.py (L1958)`). ``` $ flake8 --select Y018 crates/ruff/resources/test/fixtures/flake8_pyi/PYI018.pyi crates/ruff/resources/test/fixtures/flake8_pyi/PYI018.pyi:4:1: Y018 TypeVar "_T" is not used crates/ruff/resources/test/fixtures/flake8_pyi/PYI018.pyi:5:1: Y018 TypeVar "_P" is not used ``` ``` $ ./target/debug/ruff --select PYI018 crates/ruff/resources/test/fixtures/flake8_pyi/PYI018.pyi --no-cache crates/ruff/resources/test/fixtures/flake8_pyi/PYI018.pyi:4:1: PYI018 TypeVar `_T` is never used crates/ruff/resources/test/fixtures/flake8_pyi/PYI018.pyi:5:1: PYI018 TypeVar `_P` is never used Found 2 errors. ``` In the file `unused_private_type_declaration.rs`, I'm planning to add other rules that are similar to `PYI018` like the `PYI046`, `PYI047` and `PYI049`. ref #848 ## Test Plan Snapshots and manual runs of flake8.	2023-07-26 22:56:15 +00:00
Zanie Blue	2d2673f613	Add comment regarding class scope short circuit (#6101 )	2023-07-26 14:55:05 -05:00
Micha Reiser	2cf00fee96	Remove parser dependency from ruff-python-ast (#6096 )	2023-07-26 17:47:22 +02:00
Charlie Marsh	c8ee357613	Remove relative import handling from `BindingKind::Import` case (#6084 ) ## Summary Only `ImportFrom` imports can be relative, this is just unused.	2023-07-26 00:17:41 -04:00
Charlie Marsh	ed72c027a3	Replace `NoHashHasher` usages with `FxHashMap` (#6049 ) ## Summary I had always assumed that `NoHashHasher` would be faster when using integer keys, but benchmarking shows otherwise: ``` linter/default-rules/numpy/globals.py time: [66.544 µs 66.606 µs 66.678 µs] thrpt: [44.253 MiB/s 44.300 MiB/s 44.342 MiB/s] change: time: [-0.1843% +0.1087% +0.3718%] (p = 0.46 > 0.05) thrpt: [-0.3704% -0.1086% +0.1847%] No change in performance detected. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high mild linter/default-rules/pydantic/types.py time: [1.3787 ms 1.3811 ms 1.3837 ms] thrpt: [18.431 MiB/s 18.466 MiB/s 18.498 MiB/s] change: time: [-0.4827% -0.1074% +0.1927%] (p = 0.56 > 0.05) thrpt: [-0.1924% +0.1075% +0.4850%] No change in performance detected. linter/default-rules/numpy/ctypeslib.py time: [624.82 µs 625.96 µs 627.17 µs] thrpt: [26.550 MiB/s 26.601 MiB/s 26.650 MiB/s] change: time: [-0.7071% -0.4908% -0.2736%] (p = 0.00 < 0.05) thrpt: [+0.2744% +0.4932% +0.7122%] Change within noise threshold. linter/default-rules/large/dataset.py time: [3.1585 ms 3.1634 ms 3.1685 ms] thrpt: [12.840 MiB/s 12.861 MiB/s 12.880 MiB/s] change: time: [-1.5338% -1.3463% -1.1476%] (p = 0.00 < 0.05) thrpt: [+1.1610% +1.3647% +1.5577%] Performance has improved. linter/all-rules/numpy/globals.py time: [140.17 µs 140.37 µs 140.58 µs] thrpt: [20.989 MiB/s 21.020 MiB/s 21.051 MiB/s] change: time: [-0.1066% +0.3140% +0.7479%] (p = 0.14 > 0.05) thrpt: [-0.7423% -0.3130% +0.1067%] No change in performance detected. Found 3 outliers among 100 measurements (3.00%) 2 (2.00%) high mild 1 (1.00%) high severe linter/all-rules/pydantic/types.py time: [2.7030 ms 2.7069 ms 2.7112 ms] thrpt: [9.4064 MiB/s 9.4216 MiB/s 9.4351 MiB/s] change: time: [-0.6721% -0.4874% -0.2974%] (p = 0.00 < 0.05) thrpt: [+0.2982% +0.4898% +0.6766%] Change within noise threshold. Found 14 outliers among 100 measurements (14.00%) 12 (12.00%) high mild 2 (2.00%) high severe linter/all-rules/numpy/ctypeslib.py time: [1.4709 ms 1.4727 ms 1.4749 ms] thrpt: [11.290 MiB/s 11.306 MiB/s 11.320 MiB/s] change: time: [-1.1617% -0.9766% -0.8094%] (p = 0.00 < 0.05) thrpt: [+0.8160% +0.9862% +1.1754%] Change within noise threshold. Found 12 outliers among 100 measurements (12.00%) 9 (9.00%) high mild 3 (3.00%) high severe linter/all-rules/large/dataset.py time: [5.8086 ms 5.8163 ms 5.8240 ms] thrpt: [6.9854 MiB/s 6.9946 MiB/s 7.0038 MiB/s] change: time: [-1.5651% -1.3536% -1.1584%] (p = 0.00 < 0.05) thrpt: [+1.1720% +1.3721% +1.5900%] Performance has improved. ``` My guess is that `NoHashHasher` underperforms because the keys are not randomly distributed... Anyway, it's a ~1% (significant) performance gain on some of the above, plus we get to remove a dependency.	2023-07-24 23:41:57 +00:00
Charlie Marsh	f9726af4ef	Allow specification of `logging.Logger` re-exports via `logger-objects` (#5750 ) ## Summary This PR adds a `logger-objects` setting that allows users to mark specific symbols a `logging.Logger` objects. Currently, if a `logger` is imported, we only flagged it as a `logging.Logger` if it comes exactly from the `logging` module or is `flask.current_app.logger`. This PR allows users to mark specific loggers, like `logging_setup.logger`, to ensure that they're covered by the `flake8-logging-format` rules and others. For example, if you have a module `logging_setup.py` with the following contents: ```python import logging logger = logging.getLogger(__name__) ``` Adding `"logging_setup.logger"` to `logger-objects` will ensure that `logging_setup.logger` is treated as a `logging.Logger` object when imported from other modules (e.g., `from logging_setup import logger`). Closes https://github.com/astral-sh/ruff/issues/5694.	2023-07-24 00:38:20 -04:00
Charlie Marsh	057faabcdd	Use `Flags::intersects` rather than `Flags::contains` (#6007 ) ## Summary This is equivalent for a single flag, but I think it's more likely to be correct when the bitflags are modified -- the primary reason being that we sometimes define flags as the union of other flags, e.g.: ```rust const ANNOTATION = Self::TYPING_ONLY_ANNOTATION.bits() \| Self::RUNTIME_ANNOTATION.bits(); ``` In this case, `flags.contains(Flag::ANNOTATION)` requires that _both_ flags in the union are set, whereas `flags.intersects(Flag::ANNOTATION)` requires that _at least one_ flag is set.	2023-07-23 02:59:31 +00:00
Charlie Marsh	0bb175f7f6	Store flags rather than `ExecutionContext` on references (#6006 )	2023-07-23 02:54:39 +00:00
Charlie Marsh	4b2ec7d562	Move runtime execution context into add_reference calls (#6003 )	2023-07-23 02:37:51 +00:00
Charlie Marsh	4aac801277	Fix context-to-model references in `SemanticModel` documentation (#6004 )	2023-07-23 02:32:23 +00:00
Charlie Marsh	86b6a3e1ad	Remove nested f-string flag (#5966 ) ## Summary Not worth taking up a slot in the semantic model flags.	2023-07-21 22:51:37 -04:00
Charlie Marsh	fe7505b738	Move undefined deletions into post-model-building pass (#5904 ) ## Summary Similar to #5902, but for undefined names in deletions (e.g., `del x` where `x` is unbound).	2023-07-20 05:14:46 +00:00
Charlie Marsh	963f240e46	Track unresolved references in the semantic model (#5902 ) ## Summary As part of my continued quest to separate semantic model-building from diagnostic emission, this PR moves our unresolved-reference rules to a deferred pass. So, rather than emitting diagnostics as we encounter unresolved references, we now track those unresolved references on the semantic model (just like resolved references), and after traversal, emit the relevant rules for any unresolved references.	2023-07-19 18:19:55 -04:00
Charlie Marsh	9834c69c98	Remove `__all__` enforcement rules out of binding phase (#5897 ) ## Summary This PR moves two rules (`invalid-all-format` and `invalid-all-object`) out of the name-binding phase, and into the dedicated pass over all bindings that occurs at the end of the `Checker`. This is part of my continued quest to separate the semantic model-building logic from the actual rule enforcement.	2023-07-19 21:18:47 +00:00
Charlie Marsh	a75a6de577	Use a boxed slice for `Export` struct (#5887 ) ## Summary The vector of names here is immutable -- we never push to it after initialization. Boxing reduces the size of the variant from 32 bytes to 24 bytes. (See: https://nnethercote.github.io/perf-book/type-sizes.html#boxed-slices.) It doesn't make a difference here, since it's not the largest variant, but it still seems like a prudent change (and I was considering adding another field to this variant, though I may no longer do so).	2023-07-19 11:45:04 -04:00
Charlie Marsh	1181d25e5a	Move a few more candidate rules to the deferred `Binding`-only pass (#5853 ) ## Summary No behavior change, but this is in theory more efficient, since we can just iterate over the flat `Binding` vector rather than having to iterate over binding chains via the `Scope`.	2023-07-19 00:59:02 +00:00
Charlie Marsh	626d8dc2cc	Use `.as_ref()` in lieu of `&**` (#5874 ) I find this less opaque (and often more succinct).	2023-07-19 00:49:13 +00:00
Charlie Marsh	7ffcd93afd	Move unused deletion tracking to deferred analysis (#5852 ) ## Summary This PR moves the "unused exception" rule out of the visitor and into a deferred check. When we can base rules solely on the semantic model, we probably should, as it greatly simplifies the `Checker` itself.	2023-07-18 20:43:12 -04:00
Charlie Marsh	2d505e2b04	Remove suite body tracking from `SemanticModel` (#5848 ) ## Summary The `SemanticModel` currently stores the "body" of a given `Suite`, along with the current statement index. This is used to support "next sibling" queries, but we only use this in exactly one place -- the rule that simplifies constructs like this to `any` or `all`: ```python for x in y: if x == 0: return True return False ``` Instead of tracking the state, we can just do a (slightly more expensive) traversal, by finding the node within its parent and returning the next node in the body. Note that we'll only have to do this extremely rarely -- namely, for functions that contain something like: ```python for x in y: if x == 0: return True ```	2023-07-18 18:58:31 -04:00
konsti	730e6b2b4c	Refactor `StmtIf`: Formatter and Linter (#5459 ) ## Summary Previously, `StmtIf` was defined recursively as ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub orelse: Vec<Stmt>, } ``` Every `elif` was represented as an `orelse` with a single `StmtIf`. This means that this representation couldn't differentiate between ```python if cond1: x = 1 else: if cond2: x = 2 ``` and ```python if cond1: x = 1 elif cond2: x = 2 ``` It also makes many checks harder than they need to be because we have to recurse just to iterate over an entire if-elif-else and because we're lacking nodes and ranges on the `elif` and `else` branches. We change the representation to a flat ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub elif_else_clauses: Vec<ElifElseClause>, } pub struct ElifElseClause { pub range: TextRange, pub test: Option<Expr>, pub body: Vec<Stmt>, } ``` where `test: Some(_)` represents an `elif` and `test: None` an else. This representation is different tradeoff, e.g. we need to allocate the `Vec<ElifElseClause>`, the `elif`s are now different than the `if`s (which matters in rules where want to check both `if`s and `elif`s) and the type system doesn't guarantee that the `test: None` else is actually last. We're also now a bit more inconsistent since all other `else`, those from `for`, `while` and `try`, still don't have nodes. With the new representation some things became easier, e.g. finding the `elif` token (we can use the start of the `ElifElseClause`) and formatting comments for if-elif-else (no more dangling comments splitting, we only have to insert the dangling comment after the colon manually and set `leading_alternate_branch_comments`, everything else is taken of by having nodes for each branch and the usual placement.rs fixups). ## Merge Plan This PR requires coordination between the parser repo and the main ruff repo. I've split the ruff part, into two stacked PRs which have to be merged together (only the second one fixes all tests), the first for the formatter to be reviewed by @michareiser and the second for the linter to be reviewed by @charliermarsh. * MH: Review and merge https://github.com/astral-sh/RustPython-Parser/pull/20 * MH: Review and merge or move later in stack https://github.com/astral-sh/RustPython-Parser/pull/21 * MH: Review and approve https://github.com/astral-sh/RustPython-Parser/pull/22 * MH: Review and approve formatter PR https://github.com/astral-sh/ruff/pull/5459 * CM: Review and approve linter PR https://github.com/astral-sh/ruff/pull/5460 * Merge linter PR in formatter PR, fix ecosystem checks (ecosystem checks can't run on the formatter PR and won't run on the linter PR, so we need to merge them first) * Merge https://github.com/astral-sh/RustPython-Parser/pull/22 * Create tag in the parser, update linter+formatter PR * Merge linter+formatter PR https://github.com/astral-sh/ruff/pull/5459 --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-18 13:40:15 +02:00
David Szotten	52aa2fc875	upgrade rustpython to remove tuple-constants (#5840 ) c.f. https://github.com/astral-sh/RustPython-Parser/pull/28 Tests: No snapshots changed --------- Co-authored-by: Zanie <contact@zanie.dev>	2023-07-17 22:50:31 +00:00
Charlie Marsh	be6c744856	Include function name in `undocumented-param` message (#5818 ) Closes #5814.	2023-07-16 22:51:34 -04:00
Charlie Marsh	06b5c6c06f	Use `SmallVec#extend_from_slice` in lieu of `SmallVec#extend` (#5793 ) ## Summary There's a note in the docs that suggests this can be faster, and in the benchmarks it... seems like it is? Might just be noise but held up over a few runs. Before: <img width="1792" alt="Screen Shot 2023-07-15 at 9 10 06 PM" src="https://github.com/astral-sh/ruff/assets/1309177/973cd955-d4e6-4ae3-898e-90b7eb52ecf2"> After: <img width="1792" alt="Screen Shot 2023-07-15 at 9 10 09 PM" src="https://github.com/astral-sh/ruff/assets/1309177/1491b391-d219-48e9-aa47-110bc7dc7f90">	2023-07-15 21:25:12 -04:00
Charlie Marsh	3dc73395ea	Move `Literal` flag detection into recurse phase (#5768 ) ## Summary The AST pass is broken up into three phases: pre-visit (which includes analysis), recurse (visit all members), and post-visit (clean-up). We're not supposed to edit semantic model flags in the pre-visit phase, but it looks like we were for literal detection. This didn't matter in practice, but I'm looking into some AST refactors for which this _does_ cause issues. No behavior changes expected. ## Test Plan Good test coverage on these.	2023-07-15 02:04:15 +00:00
Charlie Marsh	932c9a4789	Extend PEP 604 rewrites to support some quoted annotations (#5725 ) ## Summary Python doesn't allow `"Foo" \| None` if the annotation will be evaluated at runtime (see the comments in the PR, or the semantic model documentation for more on what this means and when it is true), but it _does_ allow it if the annotation is typing-only. This, for example, is invalid, as Python will evaluate `"Foo" \| None` at runtime in order to populate the function's `__annotations__`: ```python def f(x: "Foo" \| None): ... ``` This, however, is valid: ```python def f(): x: "Foo" \| None ``` As is this: ```python from __future__ import annotations def f(x: "Foo" \| None): ... ``` Closes #5706.	2023-07-13 07:34:04 -04:00
Charlie Marsh	bf4b96c5de	Differentiate between runtime and typing-time annotations (#5575 ) ## Summary In Python, the annotations on `x` and `y` here have very different treatment: ```python def foo(x: int): y: int ``` The `int` in `x: int` is a runtime-required annotation, because `x` gets added to the function's `__annotations__`. You'll notice, for example, that this fails: ```python from typing import TYPE_CHECKING if TYPE_CHECKING: from foo import Bar def f(x: Bar): ... ``` Because `Bar` is required to be available at runtime, not just at typing time. Meanwhile, this succeeds: ```python from typing import TYPE_CHECKING if TYPE_CHECKING: from foo import Bar def f(): x: Bar = 1 f() ``` (Both cases are fine if you use `from __future__ import annotations`.) Historically, we've tracked those annotations that are _not_ runtime-required via the semantic model's `ANNOTATION` flag. But annotations that _are_ runtime-required have been treated as "type definitions" that aren't annotations. This causes problems for the flake8-future-annotations rules, which try to detect whether adding `from __future__ import annotations` would _allow_ you to rewrite a type annotation. We need to know whether we're in _any_ type annotation, runtime-required or not, since adding `from __future__ import annotations` will convert any runtime-required annotation to a typing-only annotation. This PR adds separate state to track these runtime-required annotations. The changes in the test fixtures are correct -- these were false negatives before. Closes https://github.com/astral-sh/ruff/issues/5574.	2023-07-07 00:21:44 -04:00
Charlie Marsh	9e1039f823	Enable attribute lookups via semantic model (#5536 ) ## Summary This PR enables us to resolve attribute accesses within files, at least for static and class methods. For example, we can now detect that this is a function access (and avoid a false-positive): ```python class Class: @staticmethod def error(): return ValueError("Something") # OK raise Class.error() ``` Closes #5487. Closes #5416.	2023-07-05 15:19:14 -04:00
Charlie Marsh	00fbbe4223	Remove some additional manual iterator matches (#5482 ) ## Summary I've done a few of these PRs, I thought I'd caught them all, but missed this pattern.	2023-07-03 16:29:59 +00:00
Anders Kaseorg	df13e69c3c	Format let-else with rustfmt nightly (#5461 ) Support for `let…else` formatting was just merged to nightly (rust-lang/rust#113225). Rerun `cargo fmt` with Rust nightly 2023-07-02 to pick this up. Followup to #939. Signed-off-by: Anders Kaseorg <andersk@mit.edu>	2023-07-03 02:13:35 +00:00
Charlie Marsh	0e89c94947	Run shadowed-variable analyses in deferred handlers (#5181 ) ## Summary This PR extracts a bunch of complex logic from `add_binding`, instead running the the shadowing rules in the deferred handler, thereby decoupling the binding phase (during which we build up the semantic model) from the analysis phase, and generally making `add_binding` much more focused. This was made possible by improving the semantic model to better handle deletions -- previously, we'd "lose track" of bindings if they were deleted, which made this kind of refactor impossible. ## Test Plan We have good automated coverage for this, but I want to benchmark it separately.	2023-06-29 00:08:18 +00:00
Charlie Marsh	cdbd0bd5cd	Respect `abc` decorators when classifying function types (#5315 ) Closes #5307.	2023-06-22 19:52:36 +00:00
Charlie Marsh	ecf61d49fa	Restore existing bindings when unbinding caught exceptions (#5256 ) ## Summary In the latest release, we made some improvements to the semantic model, but our modifications to exception-unbinding are causing some false-positives. For example: ```py try: v = 3 except ImportError as v: print(v) else: print(v) ``` In the latest release, we started unbinding `v` after the `except` handler. (We used to restore the existing binding, the `v = 3`, but this was quite complicated.) Because we don't have full branch analysis, we can't then know that `v` is still bound in the `else` branch. The solution here modifies `resolve_read` to skip-lookup when hitting unbound exceptions. So when store the "unbind" for `except ImportError as v`, we save the binding that it shadowed `v = 3`, and skip to that. Closes #5249. Closes #5250.	2023-06-21 12:53:58 -04:00
Charlie Marsh	310abc769d	Move `StarImport` to its own module (#5186 )	2023-06-20 13:12:46 -04:00
Charlie Marsh	36e01ad6eb	Upgrade RustPython (#5192 ) ## Summary This PR upgrade RustPython to pull in the changes to `Arguments` (zip defaults with their identifiers) and all the renames to `CmpOp` and friends.	2023-06-19 21:09:53 +00:00
Charlie Marsh	94abf7f088	Rename `Importation` structs to `Import` (#5185 ) ## Summary I find "Importation" a bit awkward, it may not even be grammatically correct here.	2023-06-19 12:09:10 -04:00
Charlie Marsh	a6cf31cc89	Move `dead_scopes` to `deferred.scopes` (#5171 ) ## Summary This is more consistent with the rest of the `deferred` patterns.	2023-06-18 15:57:38 +00:00
Charlie Marsh	d0ad1ed0af	Replace static `CallPath` vectors with `matches!` macros (#5148 ) ## Summary After #5140, I audited the codebase for similar patterns (defining a list of `CallPath` entities in a static vector, then looping over them to pattern-match). This PR migrates all other such cases to use `match` and `matches!` where possible. There are a few benefits to this: 1. It more clearly denotes the intended semantics (branches are exclusive). 2. The compiler can help deduplicate the patterns and detect unreachable branches. 3. Performance: in the benchmark below, the all-rules performance is increased by nearly 10%... ## Benchmarks I decided to benchmark against a large file in the Airflow repository with a lot of type annotations ([`views.py`](https://raw.githubusercontent.com/apache/airflow/f03f73100e8a7d6019249889de567cb00e71e457/airflow/www/views.py)): ``` linter/default-rules/airflow/views.py time: [10.871 ms 10.882 ms 10.894 ms] thrpt: [19.739 MiB/s 19.761 MiB/s 19.781 MiB/s] change: time: [-2.7182% -2.5687% -2.4204%] (p = 0.00 < 0.05) thrpt: [+2.4805% +2.6364% +2.7942%] Performance has improved. linter/all-rules/airflow/views.py time: [24.021 ms 24.038 ms 24.062 ms] thrpt: [8.9373 MiB/s 8.9461 MiB/s 8.9527 MiB/s] change: time: [-8.9537% -8.8516% -8.7527%] (p = 0.00 < 0.05) thrpt: [+9.5923% +9.7112% +9.8342%] Performance has improved. Found 12 outliers among 100 measurements (12.00%) 5 (5.00%) high mild 7 (7.00%) high severe ``` The impact is dramatic -- nearly a 10% improvement for `all-rules`.	2023-06-16 17:34:42 +00:00
Charlie Marsh	b3240dbfa2	Avoid propagating `BindingKind::Global` and `BindingKind::Nonlocal` (#5136 ) ## Summary This PR fixes a small quirk in the semantic model. Typically, when we see an import, like `import foo`, we create a `BindingKind::Importation` for it. However, if `foo` has been declared as a `global`, then we propagate the kind forward. So given: ```python global foo import foo ``` We'd create two bindings for `foo`, both with type `global`. This was originally borrowed from Pyflakes, and it exists to help avoid false-positives like: ```python def f(): global foo # Don't mark `foo` as "assigned but unused"! It's a global! foo = 1 ``` This PR removes that behavior, and instead tracks "Does this binding refer to a global?" as a flag. This is much cleaner, since it means we don't "lose" the identity of various bindings. As a very strange example of why this matters, consider: ```python def foo(): global Member from module import Member x: Member = 1 ``` `Member` is only used in a typing context, so we should flag it and say "move it to a `TYPE_CHECKING` block". However, when we go to analyze `from module import Member`, it has `BindingKind::Global`. So we don't even know that it's an import!	2023-06-16 11:06:59 -04:00
Charlie Marsh	fd1dfc3bfa	Add support for global and nonlocal symbol renames (#5134 ) ## Summary In #5074, we introduced an abstraction to support local symbol renames ("local" here refers to "within a module"). However, that abstraction didn't support `global` and `nonlocal` symbols. This PR extends it to those cases. Broadly, there are considerations. First, if we're renaming a symbol in a scope in which it is declared `global` or `nonlocal`. For example, given: ```python x = 1 def foo(): global x ``` Then when renaming `x` in `foo`, we need to detect that it's `global` and instead perform the rename starting from the module scope. Second, when renaming a symbol, we need to determine the scopes in which it is declared `global` or `nonlocal`. This is effectively the inverse of the above: when renaming `x` in the module scope, we need to detect that we should _also_ rename `x` in `foo`. To support these cases, the renaming algorithm was adjusted as follows: - When we start a rename in a scope, determine whether the symbol is declared `global` or `nonlocal` by looking for a `global` or `nonlocal` binding. If it is, start the rename in the defining scope. (This requires storing the defining scope on the `nonlocal` binding, which is new.) - We then perform the rename in the defining scope. - We then check whether the symbol was declared as `global` or `nonlocal` in any scopes, and perform the rename in those scopes too. (Thankfully, this doesn't need to be done recursively.) Closes #5092. ## Test Plan Added some additional snapshot tests.	2023-06-16 14:35:10 +00:00
Charlie Marsh	b9754bd5c5	Add autofix for `Set`-to-`AbstractSet` rewrite using reference tracking (#5074 ) ## Summary This PR enables autofix behavior for the `flake8-pyi` rule that asks you to alias `Set` to `AbstractSet` when importing `collections.abc.Set`. It's not the most important rule, but it's a good isolated test-case for local symbol renaming. The renaming algorithm is outlined in-detail in the `renamer.rs` module. But to demonstrate the behavior, here's the diff when running this fix over a complex file that exercises a few edge cases: ```diff --- a/foo.pyi +++ b/foo.pyi @@ -1,16 +1,16 @@ if True: - from collections.abc import Set + from collections.abc import Set as AbstractSet else: - Set = 1 + AbstractSet = 1 -x: Set = set() +x: AbstractSet = set() -x: Set +x: AbstractSet -del Set +del AbstractSet def f(): - print(Set) + print(AbstractSet) def Set(): pass ``` Making this work required resolving a bunch of edge cases in the semantic model that were causing us to "lose track" of references. For example, the above wasn't possible with our previous approach to handling deletions (#5071). Similarly, the `x: Set` "delayed annotation" tracking was enabled via #5070. And many of these edits would've failed if we hadn't changed `BindingKind` to always match the identifier range (#5090). So it's really the culmination of a bunch of changes over the course of the week. The main outstanding TODO is that this doesn't support `global` or `nonlocal` usages. I'm going to take a look at that tonight, but I'm comfortable merging this as-is. Closes #1106. Closes #5091.	2023-06-16 14:12:33 +00:00
Charlie Marsh	5526699535	Use const-singleton helpers in more rules (#5142 )	2023-06-16 04:28:35 +00:00
Charlie Marsh	5ea3e42513	Always use identifier ranges to store bindings (#5110 ) ## Summary At present, when we store a binding, we include a `TextRange` alongside it. The `TextRange` _sometimes_ matches the exact range of the identifier to which the `Binding` is linked, but... not always. For example, given: ```python x = 1 ``` The binding we create _will_ use the range of `x`, because the left-hand side is an `Expr::Name`, which has a valid range on it. However, given: ```python try: pass except ValueError as e: pass ``` When we create a binding for `e`, we don't have a `TextRange`... The AST doesn't give us one. So we end up extracting it via lexing. This PR extends that pattern to the rest of the binding kinds, to ensure that whenever we create a binding, we always use the range of the bound name. This leads to better diagnostics in cases like pattern matching, whereby the diagnostic for "unused variable `x`" here used to include `x`, instead of just `x`: ```python def f(provided: int) -> int: match provided: case [_, x]: pass ``` This is _also_ required for symbol renames, since we track writes as bindings -- so we need to know the ranges of the bound symbols. By storing these bindings precisely, we can also remove the `binding.trimmed_range` abstraction -- since bindings already use the "trimmed range". To implement this behavior, I took some of our existing utilities (like the code we had for `except ValueError as e` above), migrated them from a full lexer to a zero-allocation lexer that _only_ identifies "identifiers", and moved the behavior into a trait, so we can now do `stmt.identifier(locator)` to get the range for the identifier. Honestly, we might end up discarding much of this if we decide to put ranges on all identifiers (https://github.com/astral-sh/RustPython-Parser/pull/8). But even if we do, this will _still_ be a good change, because the lexer introduced here is useful beyond names (e.g., we use it find the `except` keyword in an exception handler, to find the `else` after a `for` loop, and so on). So, I'm fine committing this even if we end up changing our minds about the right approach. Closes #5090. ## Benchmarks No significant change, with one statistically significant improvement (-2.1654% on `linter/all-rules/large/dataset.py`): ``` linter/default-rules/numpy/globals.py time: [73.922 µs 73.955 µs 73.986 µs] thrpt: [39.882 MiB/s 39.898 MiB/s 39.916 MiB/s] change: time: [-0.5579% -0.4732% -0.3980%] (p = 0.00 < 0.05) thrpt: [+0.3996% +0.4755% +0.5611%] Change within noise threshold. Found 6 outliers among 100 measurements (6.00%) 4 (4.00%) low severe 1 (1.00%) low mild 1 (1.00%) high mild linter/default-rules/pydantic/types.py time: [1.4909 ms 1.4917 ms 1.4926 ms] thrpt: [17.087 MiB/s 17.096 MiB/s 17.106 MiB/s] change: time: [+0.2140% +0.2741% +0.3392%] (p = 0.00 < 0.05) thrpt: [-0.3380% -0.2734% -0.2136%] Change within noise threshold. Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) high mild 1 (1.00%) high severe linter/default-rules/numpy/ctypeslib.py time: [688.97 µs 691.34 µs 694.15 µs] thrpt: [23.988 MiB/s 24.085 MiB/s 24.168 MiB/s] change: time: [-1.3282% -0.7298% -0.1466%] (p = 0.02 < 0.05) thrpt: [+0.1468% +0.7351% +1.3461%] Change within noise threshold. Found 15 outliers among 100 measurements (15.00%) 1 (1.00%) low mild 2 (2.00%) high mild 12 (12.00%) high severe linter/default-rules/large/dataset.py time: [3.3872 ms 3.4032 ms 3.4191 ms] thrpt: [11.899 MiB/s 11.954 MiB/s 12.011 MiB/s] change: time: [-0.6427% -0.2635% +0.0906%] (p = 0.17 > 0.05) thrpt: [-0.0905% +0.2642% +0.6469%] No change in performance detected. Found 20 outliers among 100 measurements (20.00%) 1 (1.00%) low severe 2 (2.00%) low mild 4 (4.00%) high mild 13 (13.00%) high severe linter/all-rules/numpy/globals.py time: [148.99 µs 149.21 µs 149.42 µs] thrpt: [19.748 MiB/s 19.776 MiB/s 19.805 MiB/s] change: time: [-0.7340% -0.5068% -0.2778%] (p = 0.00 < 0.05) thrpt: [+0.2785% +0.5094% +0.7395%] Change within noise threshold. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high severe linter/all-rules/pydantic/types.py time: [3.0362 ms 3.0396 ms 3.0441 ms] thrpt: [8.3779 MiB/s 8.3903 MiB/s 8.3997 MiB/s] change: time: [-0.0957% +0.0618% +0.2125%] (p = 0.45 > 0.05) thrpt: [-0.2121% -0.0618% +0.0958%] No change in performance detected. Found 11 outliers among 100 measurements (11.00%) 1 (1.00%) low severe 3 (3.00%) low mild 5 (5.00%) high mild 2 (2.00%) high severe linter/all-rules/numpy/ctypeslib.py time: [1.6879 ms 1.6894 ms 1.6909 ms] thrpt: [9.8478 MiB/s 9.8562 MiB/s 9.8652 MiB/s] change: time: [-0.2279% -0.0888% +0.0436%] (p = 0.18 > 0.05) thrpt: [-0.0435% +0.0889% +0.2284%] No change in performance detected. Found 5 outliers among 100 measurements (5.00%) 4 (4.00%) low mild 1 (1.00%) high severe linter/all-rules/large/dataset.py time: [7.1520 ms 7.1586 ms 7.1654 ms] thrpt: [5.6777 MiB/s 5.6831 MiB/s 5.6883 MiB/s] change: time: [-2.5626% -2.1654% -1.7780%] (p = 0.00 < 0.05) thrpt: [+1.8102% +2.2133% +2.6300%] Performance has improved. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high mild ```	2023-06-15 18:43:19 +00:00
Charlie Marsh	716cab2f19	Run `rustfmt` on nightly to clean up erroneous comments (#5106 ) ## Summary This PR runs `rustfmt` with a few nightly options as a one-time fix to catch some malformatted comments. I ended up just running with: ```toml condense_wildcard_suffixes = true edition = "2021" max_width = 100 normalize_comments = true normalize_doc_attributes = true reorder_impl_items = true unstable_features = true use_field_init_shorthand = true ``` Since these all seem like reasonable things to fix, so may as well while I'm here.	2023-06-15 00:19:05 +00:00
Charlie Marsh	56476dfd61	Use `matches!` for `CallPath` comparisons (#5099 ) ## Summary This PR consistently uses `matches! for static `CallPath` comparisons. In some cases, we can significantly reduce the number of cases or checks. ## Test Plan `cargo test `	2023-06-14 17:06:34 -04:00
Charlie Marsh	bae183b823	Rename `semantic_model` and `model` usages to `semantic` (#5097 ) ## Summary As discussed in Discord, and similar to oxc, we're going to refer to this as `.semantic()` everywhere. While I was auditing usages of `model: &SemanticModel`, I also changed as many function signatures as I could find to consistently take the model as the _last_ argument, rather than the first.	2023-06-14 15:01:51 -04:00
Charlie Marsh	65dbfd2556	Improve names and documentation on scope API (#5095 ) ## Summary Just minor improvements to improve consistency of method names and availability.	2023-06-14 18:28:55 +00:00
Charlie Marsh	86ff1febea	Re-export `ruff_python_semantic` members (#5094 ) ## Summary This PR adds a more unified public API to `ruff_python_semantic`, so that we don't need to do deeply nested imports all over the place.	2023-06-14 18:23:38 +00:00
Charlie Marsh	a33bbe6335	Track "delayed" annotations in the semantic model (#5070 ) ## Summary This PR tackles a corner case that we'll need to support local symbol renaming. It relates to a nuance in how we want handle annotations (i.e., `AnnAssign` statements with no value, like `x: int` in a function body). When we see a statement like: ```python x: int ``` We create a `BindingKind::Annotation` for `x`. This is a special `BindingKind` that the resolver isn't allowed to return. For example, given: ```python x: int print(x) ``` The second line will yield an `undefined-name` error. So why does this `BindingKind` exist at all? In Pyflakes, to support the `unused-annotation` lint: ```python def f(): x: int # unused-annotation ``` If we don't track `BindingKind::Annotation`, we can't lint for unused variables that are only "defined" via annotations. There are a few other wrinkles to `BindingKind::Annotation`. One is that, if a binding already exists in the scope, we actually just discard the `BindingKind`. So in this case: ```python x = 1 x: int ``` When we go to create the `BindingKind::Annotation` for the second statement, we notice that (1) we're creating an annotation but (2) the scope already has binding for the name -- so we just drop the binding on the floor. This has the nice property that annotations aren't considered to "shadow" another binding, which is important in a bunch of places (e.g., if we have `import os; os: int`, we still consider `os` to be an import, as we should). But it also means that these "delayed" annotations are one of the few remaining references that we don't track anywhere in the semantic model. This PR adds explicit support for these via a new `delayed_annotations` attribute on the semantic model. These should be extremely rare, but we do need to track them if we want to support local symbol renaming. ### This isn't the right way to model this This isn't the right way to model this. Here's an alternative: - Remove `BindingKind::Annotation`, and treat annotations as their own, separate concept. - Instead of storing a map from name to `BindingId` on each `Scope`, store a map from name to... `SymbolId`. - Introduce a `Symbol` abstraction, where a symbol can point to a current binding, and a list of annotations, like: ```rust pub struct Symbol { binding: Option<BindingId>, annotations: Vec<AnnotationId> } ``` If we did this, we could appropriately model the semantics described above. When we go to resolve a binding, we ignore annotations (always). When we try to find unused variables, we look through the list of symbols, and have sufficient information to discriminate between annotations and bound variables. Etc. The main downside of this `Symbol`-based approach is that it's going to take a lot more work to implement, and it'll be less performant (we'll be storing more data per symbol, and our binding lookups will have an added layer of indirection).	2023-06-14 17:54:35 +00:00
Charlie Marsh	c992cfa76e	Make some of `ruff_python_semantic` `pub(crate)` (#5093 )	2023-06-14 17:49:37 +00:00
Charlie Marsh	6f10aeebaa	Remove unused `Scope#delete` method (#5085 ) ## Summary This is now intentionally unused and is now made impossible (via this PR).	2023-06-14 14:15:14 +00:00
Charlie Marsh	c74ef77e85	Move binding accesses into `SemanticModel` method (#5084 )	2023-06-14 14:07:46 +00:00
Charlie Marsh	1e497162d1	Add a dedicated read result for unbound locals (#5083 ) ## Summary Small follow-up to #4888 to add a dedicated `ResolvedRead` case for unbound locals, mostly for clarity and documentation purposes (no behavior changes). ## Test Plan `cargo test`	2023-06-14 09:58:48 -04:00
Charlie Marsh	aa41ffcfde	Add `BindingKind` variants to represent deleted bindings (#5071 ) ## Summary Our current mechanism for handling deletions (e.g., `del x`) is to remove the symbol from the scope's `bindings` table. This "does the right thing", in that if we then reference a deleted symbol, we're able to determine that it's unbound -- but it causes a variety of problems, mostly in that it makes certain bindings and references unreachable after-the-fact. Consider: ```python x = 1 print(x) del x ``` If we analyze this code _after_ running the semantic model over the AST, we'll have no way of knowing that `x` was ever introduced in the scope, much less that it was bound to a value, read, and then deleted -- because we effectively erased `x` from the model entirely when we hit the deletion. In practice, this will make it impossible for us to support local symbol renames. It also means that certain rules that we want to move out of the model-building phase and into the "check dead scopes" phase wouldn't work today, since we'll have lost important information about the source code. This PR introduces two new `BindingKind` variants to model deletions: - `BindingKind::Deletion`, which represents `x = 1; del x`. - `BindingKind::UnboundException`, which represents: ```python try: 1 / 0 except Exception as e: pass ``` In the latter case, `e` gets unbound after the exception handler (assuming it's triggered), so we want to handle it similarly to a deletion. The main challenge here is auditing all of our existing `Binding` and `Scope` usages to understand whether they need to accommodate deletions or otherwise behave differently. If you look one commit back on this branch, you'll see that the code is littered with `NOTE(charlie)` comments that describe the reasoning behind changing (or not) each of those call sites. I've also augmented our test suite in preparation for this change over a few prior PRs. ### Alternatives As an alternative, I considered introducing a flag to `BindingFlags`, like `BindingFlags::UNBOUND`, and setting that at the appropriate time. This turned out to be a much more difficult change, because we tend to match on `BindingKind` all over the place (e.g., we have a bunch of code blocks that only run when a `BindingKind` is `BindingKind::Importation`). As a result, introducing these new `BindingKind` variants requires only a few changes at the client sites. Adding a flag would've required a much wider-reaching change.	2023-06-14 09:27:24 -04:00
Charlie Marsh	1895011ac2	Document some attributes on the semantic model (#5064 )	2023-06-13 20:45:24 +00:00
Charlie Marsh	364bd82aee	Don't treat annotations as resolved in forward references (#5060 ) ## Summary This behavior dates back to a Pyflakes commit (5fc37cbd), which was used to allow this test to pass: ```py from __future__ import annotations T: object def f(t: T): pass def g(t: 'T'): pass ``` But, I think this is an error. Mypy and Pyright don't accept it -- you can only use variables as type annotations if they're type aliases (i.e., annotated with `TypeAlias`), in which case, there has to be an assignment on the right-hand side (see: [PEP 613](https://peps.python.org/pep-0613/)).	2023-06-13 14:47:29 -04:00
Charlie Marsh	19f972a305	Use `Scope#has` in lieu of `Scope#get` (#5051 ) ## Summary These usages don't actually need the `BindingId`.	2023-06-13 15:59:53 +00:00
Charlie Marsh	68b6d30c46	Use consistent `Cargo.toml` metadata in all crates (#5015 )	2023-06-12 00:02:40 +00:00
Charlie Marsh	e86f12a1ec	Rename some methods on `SemanticModel` (#4990 )	2023-06-09 19:36:59 +00:00
Charlie Marsh	5c502a3320	Add documentation for `BindingKind` variants (#4989 )	2023-06-09 18:32:50 +00:00
Davide Canton	63fdcea29e	Handled dict and set inside f-string (#4249 ) (#4563 )	2023-06-09 04:53:13 +00:00
Micha Reiser	39a1f3980f	Upgrade RustPython (#4900 )	2023-06-08 05:53:14 +00:00
Charlie Marsh	ae75b303f0	Avoid attributing runtime references to module-level imports (#4942 )	2023-06-07 21:56:03 +00:00
Charlie Marsh	780d153ae8	Replace one-off locals property with `ScopeFlags` (#4912 )	2023-06-06 21:22:21 -04:00
Charlie Marsh	8c048b463c	Track symbol deletions separately from bindings (#4888 )	2023-06-06 18:49:36 +00:00
Charlie Marsh	7b0fb1a3b4	Respect noqa directives on `ImportFrom` parents for type-checking rules (#4889 )	2023-06-06 02:37:07 +00:00
Charlie Marsh	8938b2d555	Use `qualified_name` terminology in more structs for consistency (#4873 )	2023-06-05 19:06:48 +00:00
Charlie Marsh	d31eb87877	Extract shared simple AST node inference utility (#4871 )	2023-06-05 18:23:37 +00:00
Charlie Marsh	a0721912a4	Invert structure of Scope#shadowed_bindings (#4855 )	2023-06-05 02:03:21 +00:00
Charlie Marsh	466719247b	Invert parent-shadowed bindings map (#4847 )	2023-06-04 00:18:46 -04:00
Charlie Marsh	3fa4440d87	Modify semantic model API to push bindings upon creation (#4846 )	2023-06-04 02:28:25 +00:00
Charlie Marsh	c14896b42c	Move `Binding` initialization into `SemanticModel` (#4819 )	2023-06-03 15:26:55 -04:00
Charlie Marsh	935094c2ff	Move import-name matching into methods on `BindingKind` (#4818 )	2023-06-03 15:01:27 -04:00
Charlie Marsh	26b1dd0ca2	Remove `name` field from import binding kinds (#4817 )	2023-06-02 23:02:47 -04:00
Charlie Marsh	211d8e170d	Ignore error calls with `exc_info` in TRY400 (#4797 )	2023-06-02 04:59:45 +00:00
Charlie Marsh	ea3cbcc362	Avoid enforcing native-literals rule within nested f-strings (#4488 )	2023-06-02 04:00:31 +00:00
qdegraaf	fcbf5c3fae	Add PYI034 for `flake8-pyi` plugin (#4764 )	2023-06-02 02:15:57 +00:00
Jonathan Plasse	edadd7814f	Add `pyflakes.extend-generics` setting (#4677 )	2023-06-01 22:19:37 +00:00
Charlie Marsh	ea31229be0	Track `TYPE_CHECKING` blocks in `Importer` (#4593 )	2023-05-30 16:18:10 +00:00
Charlie Marsh	80fa3f2bfa	Add a convenience method to check if a name is bound (#4718 )	2023-05-30 01:52:41 +00:00
Aarni Koskela	0106bce02f	[`flake8-future-annotations`] Implement `FA102` (#4702 )	2023-05-29 22:41:45 +00:00
Charlie Marsh	9741f788c7	Remove globals table from `Scope` (#4686 )	2023-05-27 22:35:20 -04:00
Charlie Marsh	af433ac14d	Avoid using typing-imported symbols for runtime edits (#4649 )	2023-05-26 20:36:37 -04:00
qdegraaf	ccca11839a	Allow more immutable funcs for RUF009 (#4660 )	2023-05-26 15:18:52 -04:00
Charlie Marsh	0f610f2cf7	Remove dedicated ScopeKind structs in favor of AST nodes (#4648 )	2023-05-25 19:31:02 +00:00
Charlie Marsh	f0e173d9fd	Use `BindingId` copies in lieu of `&BindingId` in semantic model methods (#4633 )	2023-05-24 15:55:45 +00:00
Charlie Marsh	fcdc7bdd33	Remove separate `ReferenceContext` enum (#4631 )	2023-05-24 15:12:38 +00:00
Charlie Marsh	5cedf0f724	Remove `ReferenceContext::Synthetic` (#4612 )	2023-05-24 14:30:35 +00:00
Charlie Marsh	8961d8eb6f	Track all read references in semantic model (#4610 )	2023-05-24 14:14:27 +00:00
Charlie Marsh	ba4c0a21fa	Rename `ContextFlags` to `SemanticModelFlags` (#4611 )	2023-05-23 17:47:07 -04:00
Micha Reiser	652c644c2a	Introduce `ruff_index` crate (#4597 )	2023-05-23 17:40:35 +02:00
Charlie Marsh	74effb40b9	Rename `index` to `binding_id` in a few iterators (#4594 )	2023-05-23 03:56:00 +00:00
Micha Reiser	daadd24bde	Include decorators in `Function` and `Class` definition ranges (#4467 )	2023-05-22 17:50:42 +02:00
Micha Reiser	cbe344f4d5	Rename `Checker::model` to `semantic_model` (#4573 )	2023-05-22 15:14:30 +02:00
Charlie Marsh	d70f899f71	Use `SemanticModel` in lieu of `Checker` in more methods (#4568 )	2023-05-22 02:58:47 +00:00
Charlie Marsh	19c4b7bee6	Rename ruff_python_semantic's `Context` struct to `SemanticModel` (#4565 )	2023-05-22 02:35:03 +00:00
Charlie Marsh	fe7f2e2e4d	Move submodule alias resolution into `Context` (#4543 )	2023-05-20 16:34:10 +00:00
Charlie Marsh	6aa9900c03	Improve handling of `__qualname__`, `__module__`, and `__class__` (#4512 )	2023-05-20 03:03:45 +00:00
Charlie Marsh	9e21414294	Improve reference resolution for deferred-annotations-within-classes (#4509 )	2023-05-20 02:54:18 +00:00
Charlie Marsh	bb4e674415	Move reference-resolution into Context (#4510 )	2023-05-20 02:47:15 +00:00
Charlie Marsh	73efbeb581	Invert quote-style when generating code within f-strings (#4487 )	2023-05-18 14:33:33 +00:00
Micha Reiser	fa26860296	Refactor range from `Attributed` to `Node`s (#4422 )	2023-05-16 06:36:32 +00:00
Charlie Marsh	2414469ac3	Enable automatic rewrites of `typing.Deque` and `typing.DefaultDict` (#4420 )	2023-05-15 22:33:24 +00:00
Tyler Yep	01b372a75c	Implement `flake8-future-annotations` FA100 (#3979 )	2023-05-14 03:00:06 +00:00
Charlie Marsh	7e3ba7f32a	Use `bitflags` for tracking `Context` flags (#4381 )	2023-05-12 16:07:26 +00:00
Charlie Marsh	9158f13ee6	Respect `__all__` imports when determining definition visibility (#4357 )	2023-05-11 17:43:51 +00:00
Charlie Marsh	72e0ffc1ac	Delay computation of `Definition` visibility (#4339 )	2023-05-11 17:14:29 +00:00
Jeong, YunWon	be6e00ef6e	Re-integrate RustPython parser repository (#4359 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-05-11 07:47:17 +00:00
Charlie Marsh	f4f88308ae	Remove `Copy` and destructure `Snapshot` (#4358 )	2023-05-10 19:46:18 +00:00
Charlie Marsh	ea3d3a655d	Add a `Snapshot` abstraction for deferring and restoring visitor context (#4353 )	2023-05-10 16:50:47 +00:00
Charlie Marsh	6d6d7abf70	Use short-import for `HashMap` (#4351 )	2023-05-10 15:46:55 +00:00
Charlie Marsh	7b91a162c6	Remove `current_` prefix from some Context methods (#4325 )	2023-05-09 19:40:12 +00:00
Charlie Marsh	d365dab904	Include static and class methods in in abstract decorator list (#4298 )	2023-05-08 21:54:02 -04:00
Charlie Marsh	539af34f58	Add a utility method to detect top-level state (#4259 )	2023-05-06 20:24:27 +00:00
Charlie Marsh	983bb31577	Remove `RefEquality` usages from `Context` (#4257 )	2023-05-06 15:55:14 -04:00
Charlie Marsh	cd27b39aff	Re-order some code in scope.rs (#4255 )	2023-05-06 16:36:20 +00:00
Charlie Marsh	a9fc648faf	Use `NodeId` for `Binding` source (#4234 )	2023-05-06 16:20:08 +00:00
Charlie Marsh	c1f0661225	Replace `parents` statement stack with a `Nodes` abstraction (#4233 )	2023-05-06 16:12:41 +00:00
Dhruv Manilawala	2c91412321	Consider Flask app logger as logger candidate (#4253 )	2023-05-06 11:31:10 -04:00
Charlie Marsh	64b7280eb8	Respect parent-scoping rules for `NamedExpr` assignments (#4145 )	2023-04-29 22:45:30 +00:00
Charlie Marsh	2115d99c43	Remove `ScopeStack` in favor of child-parent `ScopeId` pointers (#4138 )	2023-04-29 18:23:51 -04:00
Moritz Sauter	ee6d8f7467	Add bugbear immutable functions as allowed in dataclasses (#4122 )	2023-04-27 21:23:06 -04:00
Micha Reiser	cab65b25da	Replace row/column based `Location` with byte-offsets. (#3931 )	2023-04-26 18:11:02 +00:00
Jonathan Plasse	5e91211e6d	Add `in_boolean_test` to `Context` (#4072 )	2023-04-23 23:18:23 -06:00
Micha Reiser	ba4f4f4672	Upgrade dependencies (#4064 )	2023-04-22 18:04:01 +01:00
Dhruv Manilawala	ba98149022	Avoid `RUF008` if field annotation is immutable (#4039 )	2023-04-20 16:02:12 -04:00
Charlie Marsh	10d5415bcb	Ignore certain flake8-pyi errors within function bodies (#4029 )	2023-04-19 15:10:29 -04:00
Charlie Marsh	7fa1da20fb	Support relative imports in `banned-api` enforcement (#4025 )	2023-04-19 14:30:13 -04:00
Dhruv Manilawala	b6155232ac	Consider logger candidate from `logging` module only (#3878 )	2023-04-04 19:52:57 +00:00
Charlie Marsh	d919adc13c	Introduce a `ruff_python_semantic` crate (#3865 )	2023-04-04 16:50:47 +00:00

... 3 4 5 6 7 ...

377 Commits