Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
David Peter	54769ac9f9	[ty] While loop modeling cleanup (#18994 ) ## Summary I found the previous code here very confusing, and it also did some unnecessary work. Hopefully this is a bit easier to understand.	2025-06-30 11:38:25 +02:00
Micha Reiser	29927f2b59	Update Rust toolchain to 1.88 and MSRV to 1.86 (#19011 )	2025-06-28 20:24:00 +02:00
Matthew Mckee	a3c79d8170	[ty] Don't add incorrect subdiagnostic for unresolved reference (#18487 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com> Co-authored-by: Carl Meyer <carl@astral.sh>	2025-06-27 12:40:33 +00:00
Ibraheem Ahmed	6f7b1c9bb3	[ty] Add environment variable to dump Salsa memory usage stats (#18928 ) ## Summary Setting `TY_MEMORY_REPORT=full` will generate and print a memory usage report to the CLI after a `ty check` run: ``` =======SALSA STRUCTS======= `Definition` metadata=7.24MB fields=17.38MB count=181062 `Expression` metadata=4.45MB fields=5.94MB count=92804 `member_lookup_with_policy_::interned_arguments` metadata=1.97MB fields=2.25MB count=35176 ... =======SALSA QUERIES======= `File -> ty_python_semantic::semantic_index::SemanticIndex` metadata=11.46MB fields=88.86MB count=1638 `Definition -> ty_python_semantic::types::infer::TypeInference` metadata=24.52MB fields=86.68MB count=146018 `File -> ruff_db::parsed::ParsedModule` metadata=0.12MB fields=69.06MB count=1642 ... =======SALSA SUMMARY======= TOTAL MEMORY USAGE: 577.61MB struct metadata = 29.00MB struct fields = 35.68MB memo metadata = 103.87MB memo fields = 409.06MB ``` Eventually, we should integrate these numbers into CI in some form. The one limitation currently is that heap allocations in salsa structs (e.g. interned values) are not tracked, but memoized values should have full coverage. We may also want a peak memory usage counter (that accounts for non-salsa memory), but that is relatively simple to profile manually (e.g. `time -v ty check`) and would require a compile-time option to avoid runtime overhead.	2025-06-26 21:27:51 +00:00
David Peter	b01003f81d	[ty] Infer nonlocal types as unions of all reachable bindings (#18750 ) ## Summary This PR includes a behavioral change to how we infer types for public uses of symbols within a module. Where we would previously use the type that a use at the end of the scope would see, we now consider all reachable bindings and union the results: ```py x = None def f(): reveal_type(x) # previously `Unknown \| Literal[1]`, now `Unknown \| None \| Literal[1]` f() x = 1 f() ``` This helps especially in cases where the the end of the scope is not reachable: ```py def outer(x: int): def inner(): reveal_type(x) # previously `Unknown`, now `int` raise ValueError ``` This PR also proposes to skip the boundness analysis of public uses. This is consistent with the "all reachable bindings" strategy, because the implicit `x = <unbound>` binding is also always reachable, and we would have to emit "possibly-unresolved" diagnostics for every public use otherwise. Changing this behavior allows common use-cases like the following to type check without any errors: ```py def outer(flag: bool): if flag: x = 1 def inner(): print(x) # previously: possibly-unresolved-reference, now: no error ``` closes https://github.com/astral-sh/ty/issues/210 closes https://github.com/astral-sh/ty/issues/607 closes https://github.com/astral-sh/ty/issues/699 ## Follow up It is now possible to resolve the following TODO, but I would like to do that as a follow-up, because it requires some changes to how we treat implicit attribute assignments, which could result in ecosystem changes that I'd like to see separately. `315fb0f3da/crates/ty_python_semantic/src/semantic_index/builder.rs (L1095-L1117)` ## Ecosystem analysis [Full report](https://shark.fish/diff-public-types.html) * This change obviously removes a lot of `possibly-unresolved-reference` diagnostics (7818) because we do not analyze boundness for public uses of symbols inside modules anymore. * As the primary goal here, this change also removes a lot of false-positive `unresolved-reference` diagnostics (231) in scenarios like this: ```py def _(flag: bool): if flag: x = 1 def inner(): x raise ``` * This change also introduces some new false positives for cases like: ```py def _(): x = None x = "test" def inner(): x.upper() # Attribute `upper` on type `Unknown \| None \| Literal["test"]` is possibly unbound ``` We have test cases for these situations and it's plausible that we can improve this in a follow-up. ## Test Plan New Markdown tests	2025-06-26 12:24:40 +02:00
David Peter	689797a984	[ty] Type narrowing in comprehensions (#18934 ) ## Summary Add type narrowing inside comprehensions: ```py def _(xs: list[int \| None]): [reveal_type(x) for x in xs if x is not None] # revealed: int ``` closes https://github.com/astral-sh/ty/issues/680 ## Test Plan * New Markdown tests * Made sure the example from https://github.com/astral-sh/ty/issues/680 now checks without errors * Made sure that all removed ecosystem diagnostics were actually false positives	2025-06-25 11:30:28 +02:00
Micha Reiser	f544026b81	[ty] Use `HashTable` in `PlaceTable` (#18819 )	2025-06-20 15:31:54 +02:00
Shunsuke Shibayama	342b2665db	[ty] basic narrowing on attribute and subscript expressions (#17643 ) ## Summary This PR closes astral-sh/ty#164. This PR introduces a basic type narrowing mechanism for attribute/subscript expressions. Member accesses, int literal subscripts, string literal subscripts are supported (same as mypy and pyright). ## Test Plan New test cases are added to `mdtest/narrow/complex_target.md`. --------- Co-authored-by: David Peter <mail@david-peter.de>	2025-06-17 11:07:46 +02:00
David Peter	3a77768f79	[ty] Reachability constraints (#18621 ) ## Summary * Completely removes the concept of visibility constraints. Reachability constraints are now used to model the static visibility of bindings and declarations. Reachability constraints are much easier to reason about / work with, since they are applied at the beginning of a branch, and not applied retroactively. Removing the duplication between visibility and reachability constraints also leads to major code simplifications [^1]. For an overview of how the new constraint system works, see the updated doc comment in `reachability_constraints.rs`. * Fixes a [control-flow modeling bug (panic)](https://github.com/astral-sh/ty/issues/365) involving `break` statements in loops * Fixes a [bug where](https://github.com/astral-sh/ty/issues/624) where `elif` branches would have wrong reachability constraints * Fixes a [bug where](https://github.com/astral-sh/ty/issues/648) code after infinite loops would not be considered unreachble * Fixes a panic on the `pywin32` ecosystem project, which we should be able to move to `good.txt` once this has been merged. * Removes some false positives in unreachable code because we infer `Never` more often, due to the fact that reachability constraints now apply retroactively to all active bindings, not just to bindings inside a branch. * As one example, this removes the `division-by-zero` diagnostic from https://github.com/astral-sh/ty/issues/443 because we now infer `Never` for the divisor. * Supersedes and includes similar test changes as https://github.com/astral-sh/ruff/pull/18392 closes https://github.com/astral-sh/ty/issues/365 closes https://github.com/astral-sh/ty/issues/624 closes https://github.com/astral-sh/ty/issues/642 closes https://github.com/astral-sh/ty/issues/648 ## Benchmarks Benchmarks on black, pandas, and sympy showed that this is neither a performance improvement, nor a regression. ## Test Plan Regression tests for: - [x] https://github.com/astral-sh/ty/issues/365 - [x] https://github.com/astral-sh/ty/issues/624 - [x] https://github.com/astral-sh/ty/issues/642 - [x] https://github.com/astral-sh/ty/issues/648 [^1]: I'm afraid this is something that @carljm advocated for since the beginning, and I'm not sure anymore why we have never seriously tried this before. So I suggest we do not attempt to do a historical deep dive to find out exactly why this ever became so complicated, and just enjoy the fact that we eventually arrived here. --------- Co-authored-by: Carl Meyer <carl@astral.sh>	2025-06-17 09:24:28 +02:00
Alex Waygood	2b731d19b9	[ty] Fix panic when attempting to provide autocompletions for an instance of a class that assigns attributes to `self[0]` (#18707 )	2025-06-16 21:58:05 +00:00
David Peter	89d915a1e3	[ty] Delay computation of 'unbound' visibility for implicit instance attributes (#18669 ) ## Summary Consider the following example, which leads to a excessively large runtime on `main`. The reason for this is the following. When inferring types for `self.a`, we look up the `a` attribute on `C`. While looking for implicit instance attributes, we go through every method and check for `self.a = …` assignments. There are no such assignments here, but we always have an implicit `self.a = <unbound>` binding at the beginning over every method. This binding accumulates a complex visibility constraint in `C.f`, due to the `isinstance` checks. While evaluating that constraint, we need to infer the type of `self.b`. There's no binding for `self.b` either, but there's also an implicit `self.b = <unbound>` binding with the same complex visibility constraint (involving `self.b` recursively). This leads to a combinatorial explosion: ```py class C: def f(self: "C"): if isinstance(self.a, str): return if isinstance(self.b, str): return if isinstance(self.b, str): return if isinstance(self.b, str): return # repeat 20 times ``` (note that the `self` parameter here is annotated explicitly because we currently still infer `Unknown` for `self` otherwise) The fix proposed here is rather simple: when there are no `self.name = …` attribute assignments in a given method, we skip evaluating the visibility constraint of the implicit `self.name = <unbound>` binding. This should also generally help with performance, because that's a very common case. This is not a fix for cases where there are actual bindings in the method. When we add `self.a = 1; self.b = 1` to that example above, we still see that combinatorial explosion of runtime. I still think it's worth to make this optimization, as it fixes the problems with `pandas` and `sqlalchemy` reported by users. I will open a ticket to track that separately. closes https://github.com/astral-sh/ty/issues/627 closes https://github.com/astral-sh/ty/issues/641 ## Test Plan * Made sure that `ty` finishes quickly on the MREs in https://github.com/astral-sh/ty/issues/627 * Made sure that `ty` finishes quickly on `pandas` * Made sure that `ty` finishes quickly on `sqlalchemy`	2025-06-13 12:50:57 -07:00
Ibraheem Ahmed	c9dff5c7d5	[ty] AST garbage collection (#18482 ) ## Summary Garbage collect ASTs once we are done checking a given file. Queries with a cross-file dependency on the AST will reparse the file on demand. This reduces ty's peak memory usage by ~20-30%. The primary change of this PR is adding a `node_index` field to every AST node, that is assigned by the parser. `ParsedModule` can use this to create a flat index of AST nodes any time the file is parsed (or reparsed). This allows `AstNodeRef` to simply index into the current instance of the `ParsedModule`, instead of storing a pointer directly. The indices are somewhat hackily (using an atomic integer) assigned by the `parsed_module` query instead of by the parser directly. Assigning the indices in source-order in the (recursive) parser turns out to be difficult, and collecting the nodes during semantic indexing is impossible as `SemanticIndex` does not hold onto a specific `ParsedModuleRef`, which the pointers in the flat AST are tied to. This means that we have to do an extra AST traversal to assign and collect the nodes into a flat index, but the small performance impact (~3% on cold runs) seems worth it for the memory savings. Part of https://github.com/astral-sh/ty/issues/214.	2025-06-13 08:40:11 -04:00
Shunsuke Shibayama	ef564094a9	[ty] support del statement and deletion of except handler names (#18593 ) ## Summary This PR closes https://github.com/astral-sh/ty/issues/238. Since `DefinitionState::Deleted` was introduced in #18041, support for the `del` statement (and deletion of except handler names) is straightforward. However, it is difficult to determine whether references to attributes or subscripts are unresolved after they are deleted. This PR only invalidates narrowing by assignment if the attribute or subscript is deleted. ## Test Plan `mdtest/del.md` is added. --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2025-06-12 07:44:42 -07:00
Ibraheem Ahmed	8531f4b3ca	[ty] Add infrastructure for AST garbage collection (#18445 ) ## Summary https://github.com/astral-sh/ty/issues/214 will require a couple invasive changes that I would like to get merged even before garbage collection is fully implemented (to avoid rebasing): - `ParsedModule` can no longer be dereferenced directly. Instead you need to load a `ParsedModuleRef` to access the AST, which requires a reference to the salsa database (as it may require re-parsing the AST if it was collected). - `AstNodeRef` can only be dereferenced with the `node` method, which takes a reference to the `ParsedModuleRef`. This allows us to encode the fact that ASTs do not live as long as the database and may be collected as soon a given instance of a `ParsedModuleRef` is dropped. There are a number of places where we currently merge the `'db` and `'ast` lifetimes, so this requires giving some types/functions two separate lifetime parameters.	2025-06-05 11:43:18 -04:00
Shunsuke Shibayama	0858896bc4	[ty] type narrowing by attribute/subscript assignments (#18041 ) ## Summary This PR partially solves https://github.com/astral-sh/ty/issues/164 (derived from #17643). Currently, the definitions we manage are limited to those for simple name (symbol) targets, but we expand this to track definitions for attribute and subscript targets as well. This was originally planned as part of the work in #17643, but the changes are significant, so I made it a separate PR. After merging this PR, I will reflect this changes in #17643. There is still some incomplete work remaining, but the basic features have been implemented, so I am publishing it as a draft PR. Here is the TODO list (there may be more to come): * [x] Complete rewrite and refactoring of documentation (removing `Symbol` and replacing it with `Place`) * [x] More thorough testing * [x] Consolidation of duplicated code (maybe we can consolidate the handling related to name, attribute, and subscript) This PR replaces the current `Symbol` API with the `Place` API, which is a concept that includes attributes and subscripts (the term is borrowed from Rust). ## Test Plan `mdtest/narrow/assignment.md` is added. --------- Co-authored-by: David Peter <sharkdp@users.noreply.github.com> Co-authored-by: Carl Meyer <carl@astral.sh>	2025-06-04 17:24:27 -07:00
Dylan	9bbf4987e8	Implement template strings (#17851 ) This PR implements template strings (t-strings) in the parser and formatter for Ruff. Minimal changes necessary to compile were made in other parts of the code (e.g. ty, the linter, etc.). These will be covered properly in follow-up PRs.	2025-05-30 15:00:56 -05:00
Alex Waygood	41463396cf	[ty] Add a subdiagnostic if `invalid-return-type` is emitted on a method with an empty body on a non-protocol subclass of a protocol class (#18243 )	2025-05-21 17:38:07 +00:00
Max Mynter	02fd48132c	[ty] Don't warn `yield` not in function when `yield` is in function (#18008 )	2025-05-21 18:16:25 +02:00
Micha Reiser	76ab77fe01	[ty] Support `import <namespace>` and `from <namespace> import module` (#18137 )	2025-05-21 07:28:33 +00:00
Carl Meyer	2abcd86c57	Revert "[ty] Better control flow for boolean expressions that are inside if (#18010 )" (#18150 ) This reverts commit `9910ec700c`. ## Summary This change introduced a serious performance regression. Revert it while we investigate. Fixes https://github.com/astral-sh/ty/issues/431 ## Test Plan Timing on the snippet in https://github.com/astral-sh/ty/issues/431 again shows times similar to before the regression.	2025-05-17 08:27:32 -04:00
Alex Waygood	28fb802467	[ty] Merge `SemanticIndexBuilder` impl blocks (#18135 ) ## Summary just a minor nit followup to https://github.com/astral-sh/ruff/pull/18010 -- put all the non-`Visitor` methods of `SemanticIndexBuilder` in the same impl block rather than having multiple impl blocks ## Test Plan `cargo build`	2025-05-16 11:05:02 -04:00
TomerBin	9910ec700c	[ty] Better control flow for boolean expressions that are inside if (#18010 ) ## Summary With this PR we now detect that x is always defined in `use`: ```py if flag and (x := number): use(x) ``` When outside if, it's still detected as possibly not defined ```py flag and (x := number) # error: [possibly-unresolved-reference] use(x) ``` In order to achieve that, I had to find a way to get access to the flow-snapshots of the boolean expression when analyzing the flow of the if statement. I did it by special casing the visitor of boolean expression to return flow control information, exporting two snapshots - `maybe_short_circuit` and `no_short_circuit`. When indexing boolean expression itself we must assume all possible flows, but when it's inside if statement, we can be smarter than that. ## Test Plan Fixed existing and added new mdtests. I went through some of mypy primer results and they look fine --------- Co-authored-by: Carl Meyer <carl@astral.sh>	2025-05-16 11:59:21 +00:00
Micha Reiser	9ae698fe30	Switch to Rust 2024 edition (#18129 )	2025-05-16 13:25:28 +02:00
Micha Reiser	e7f97a3e4b	[ty] Reduce log level of 'symbol .. (via star import) not found' log message (#18087 )	2025-05-14 09:20:23 +02:00
omahs	882a1a702e	Fix typos (#17988 ) Fix typos --------- Co-authored-by: Brent Westbrook <36778786+ntBre@users.noreply.github.com> Co-authored-by: Brent Westbrook <brentrwestbrook@gmail.com>	2025-05-09 14:57:14 -04:00
Micha Reiser	6cd8a49638	[ty] Update salsa (#17964 )	2025-05-09 11:54:07 +02:00
Alex Waygood	f51f1f7153	[ty] Support extending `__all__` from an imported module even when the module is not an `ExprName` node (#17947 )	2025-05-08 23:54:19 +01:00
Brent Westbrook	57bf7dfbd9	[ty] Implement `global` handling and `load-before-global-declaration` syntax error (#17637 ) Summary -- This PR resolves both the typing-related and syntax error TODOs added in #17563 by tracking a set of `global` bindings for each scope. As discussed below, we avoid the additional AST traversal from ruff by collecting `Name`s from `global` statements while building the semantic index and emit a syntax error if the `Name` is already bound in the current scope at the point of the `global` statement. This has the downside of separating the error from the `SemanticSyntaxChecker`, but I plan to explore using this approach in the `SemanticSyntaxChecker` itself as a follow-up. It seems like this may be a better approach for ruff as well. Test Plan -- Updated all of the related mdtests to remove the TODOs (and add quotes I forgot on the messages). There is one remaining TODO, but it requires `nonlocal` support, which isn't even incorporated into the `SemanticSyntaxChecker` yet. --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com> Co-authored-by: Carl Meyer <carl@astral.sh>	2025-05-08 10:30:04 -04:00
Alex Waygood	51cef5a72b	[ty] Recognise functions containing `yield from` expressions as being generator functions (#17930 )	2025-05-07 23:29:44 +01:00
Dhruv Manilawala	78054824c0	[ty] Add support for `__all__` (#17856 ) ## Summary This PR adds support for the `__all__` module variable. Reference spec: https://typing.python.org/en/latest/spec/distributing.html#library-interface-public-and-private-symbols This PR adds a new `dunder_all_names` query that returns a set of `Name`s defined in the `__all__` variable of the given `File`. The query works by implementing the `StatementVisitor` and collects all the names by recognizing the supported idioms as mentioned in the spec. Any idiom that's not recognized are ignored. The current implementation is minimum to what's required for us to remove all the false positives that this is causing. Refer to the "Follow-ups" section below to see what we can do next. I'll a open separate issue to keep track of them. Closes: astral-sh/ty#106 Closes: astral-sh/ty#199 ### Follow-ups * Diagnostics: * Add warning diagnostics for unrecognized `__all__` idioms, `__all__` containing non-string element * Add an error diagnostic for elements that are present in `__all__` but not defined in the module. This could lead to runtime error * Maybe we should return `<type>` instead of `Unknown \| <type>` for `module.__all__`. For example: https://playknot.ruff.rs/2a6fe5d7-4e16-45b1-8ec3-d79f2d4ca894 * Mark a symbol that's mentioned in `__all__` as used otherwise it could raise (possibly in the future) "unused-name" diagnostic Supporting diagnostics will require that we update the return type of the query to be something other than `Option<FxHashSet<Name>>`, something that behaves like a result and provides a way to check whether a name exists in `__all__`, loop over elements in `__all__`, loop over the invalid elements, etc. ## Ecosystem analysis The following are the maximum amount of diagnostics removed in the ecosystem: * "Type <module '...'> has no attribute ..." * `collections.abc` - 14 * `numpy` - 35534 * `numpy.ma` - 296 * `numpy.char` - 37 * `numpy.testing` - 175 * `hashlib` - 311 * `scipy.fft` - 2 * `scipy.stats` - 38 * "Module '...' has no member ..." * `collections.abc` - 85 * `numpy` - 508 * `numpy.testing` - 741 * `hashlib` - 36 * `scipy.stats` - 68 * `scipy.interpolate` - 7 * `scipy.signal` - 5 The following modules have dynamic `__all__` definition, so `ty` assumes that `__all__` doesn't exists in that module: * `scipy.stats` (`95a5d6ea8b/scipy/stats/__init__.py (L665)`) * `scipy.interpolate` (`95a5d6ea8b/scipy/interpolate/__init__.py (L221)`) * `scipy.signal` (indirectly via `95a5d6ea8b/scipy/signal/_signal_api.py (L30)`) * `numpy.testing` (`de784cd6ee/numpy/testing/__init__.py (L16-L18)`) ~There's this one category of false positives that have been added:~ Fixed the false positives by also ignoring `__all__` from a module that uses unrecognized idioms. <details><summary>Details about the false postivie:</summary> <p> The `scipy.stats` module has dynamic `__all__` and it imports a bunch of symbols via star imports. Some of those modules have a mix of valid and invalid `__all__` idioms. For example, in `95a5d6ea8b/scipy/stats/distributions.py (L18-L24)`, 2 out of 4 `__all__` idioms are invalid but currently `ty` recognizes two of them and says that the module has a `__all__` with 5 values. This leads to around 2055 newly added false positives of the form: ``` Type <module 'scipy.stats'> has no attribute ... ``` I think the fix here is to completely ignore `__all__`, not only if there are invalid elements in it, but also if there are unrecognized idioms used in the module. </p> </details> ## Test Plan Add a bunch of test cases using the new `ty_extensions.dunder_all_names` function to extract a module's `__all__` names. Update various test cases to remove false positives around `` imports and re-export convention. Add new test cases for named import behavior as `` imports covers all of it already (thanks Alex!).	2025-05-07 21:42:42 +05:30
Shunsuke Shibayama	fd76d70a31	[red-knot] fix narrowing in nested scopes (#17630 ) ## Summary This PR fixes #17595. ## Test Plan New test cases are added to `mdtest/narrow/conditionals/nested.md`. --------- Co-authored-by: Carl Meyer <carl@astral.sh>	2025-05-05 16:28:42 -07:00
Alex Waygood	bb6c7cad07	[ty] Fix false-positive `[invalid-return-type]` diagnostics on generator functions (#17871 )	2025-05-05 21:44:59 +00:00
renovate[bot]	2485afe640	Update pre-commit dependencies (#17840 ) Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> Co-authored-by: Micha Reiser <micha@reiser.io>	2025-05-05 07:36:09 +00:00
Micha Reiser	fa628018b2	Use `#[expect(lint)]` over `#[allow(lint)]` where possible (#17822 )	2025-05-03 21:20:31 +02:00
Micha Reiser	b51c4f82ea	Rename Red Knot (#17820 )	2025-05-03 19:49:15 +02:00

1 2 3

135 Commits