Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Zanie Blue	cfd5d63917	Use operator specific messaging in division by zero diagnostics (#13588 ) Requested at https://github.com/astral-sh/ruff/pull/13576#discussion_r1782530971	2024-10-01 08:58:38 -05:00
Alex Waygood	2a36b47f13	[red-knot] Remove `Type::RevealType` (#13567 )	2024-10-01 10:01:03 +00:00
Zanie Blue	45f01e7872	Add diagnostic for integer division by zero (#13576 ) Adds a diagnostic for division by the integer zero in `//`, `/`, and `%`. Doesn't handle `<int> / 0.0` because we don't track the values of float literals.	2024-09-30 22:38:52 +00:00
Simon	6cdf996af6	[red-knot] feat: introduce a new `[Type::Todo]` variant (#13548 ) This variant shows inference that is not yet implemented.. ## Summary PR #13500 reopened the idea of adding a new type variant to keep track of not-implemented features in Red Knot. It was based off of #12986 with a more generic approach of keeping track of different kind of unknowns. Discussion in #13500 agreed that keeping track of different `Unknown` is complicated for now, and this feature is better achieved through a new variant of `Type`. ### Requirements Requirements for this implementation can be summed up with some extracts of comment from @carljm on the previous PR > So at the moment we are leaning towards simplifying this PR to just use a new top-level variant, which behaves like Any and Unknown but represents inference that is not yet implemented in red-knot. > I think the general rule should be that Todo should propagate only when the presence of the input Todo caused the output to be unknown. > > To take a specific example, the inferred result of addition must be Unknown if either operand is Unknown. That is, Unknown + X will always be Unknown regardless of what X is. (Same for X + Unknown.) In this case, I believe that Unknown + Todo (or Todo + Unknown) should result in Unknown, not result in Todo. If we fix the upstream source of the Todo, the result would still be Unknown, so it's not useful to propagate the Todo in this case: it wrongly suggests that the output is unknown because of a todo item. ## Test Plan This PR does not introduce new tests, but it did required to edit some tests with the display of `[Type::Todo]` (currently `@Todo`), which suggests that those test are placeholders requirements for features we don't support yet.	2024-09-30 14:28:06 -07:00
Zanie Blue	9d8a4c0057	Improve display of `assert_public_ty` assertion failures (#13577 ) While working on https://github.com/astral-sh/ruff/pull/13576 I noticed that it was really hard to tell which assertion failed in some of these test cases. This could be expanded to elsewhere, but I've heard this test suite format won't be around for long?	2024-09-30 16:12:26 -05:00
Charlie Marsh	c9c748a79e	Add some basic subscript type inference (#13562 ) ## Summary Just for tuples and strings -- the easiest cases. I think most of the rest require generic support?	2024-09-30 16:50:46 -04:00
Zanie Blue	32c746bd82	Fix inference when integers are divided (#13575 ) Fixes the `Operator::Div` case and adds `Operator::FloorDiv` support Closes https://github.com/astral-sh/ruff/issues/13570	2024-09-30 15:50:37 -05:00
Charlie Marsh	d86b73eb3d	Add unary inference for integer and boolean literals (#13559 ) ## Summary Just trying to familiarize myself with the general patterns, testing, etc. Part of https://github.com/astral-sh/ruff/issues/12701.	2024-09-30 16:29:06 +00:00
Alex Waygood	5f4b282327	[red-knot] Allow calling `bool()` with no arguments (#13568 )	2024-09-30 13:18:01 +00:00
aditya pillai	d9267132d6	Fix leftover references to `red_knot_python_semantic/vendor/` (#13561 ) Co-authored-by: Alex Waygood <alex.waygood@gmail.com>	2024-09-30 11:32:02 +00:00
TomerBin	ec72e675d9	Red Knot - Infer the return value of bool() (#13538 ) ## Summary Following #13449, this PR adds custom handling for the bool constructor, so when the input type has statically known truthiness value, it will be used as the return value of the bool function. For example, in the following snippet x will now be resolved to `Literal[True]` instead of `bool`. ```python x = bool(1) ``` ## Test Plan Some cargo tests were added.	2024-09-27 12:11:55 -07:00
Simon	1639488082	[red-knot] support fstring expressions (#13511 ) <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary Implement inference for `f-string`, contributes to #12701. ### First Implementation When looking at the way `mypy` handles things, I noticed the following: - No variables (e.g. `f"hello"`) ⇒ `LiteralString` - Any variable (e.g. `f"number {1}"`) ⇒ `str` My first commit (1ba5d0f13fdf70ed8b2b1a41433b32fc9085add2) implements exactly this logic, except that we deal with string literals just like `infer_string_literal_expression` (if below `MAX_STRING_LITERAL_SIZE`, show `Literal["exact string"]`) ### Second Implementation My second commit (90326ce9af5549af7b4efae89cd074ddf68ada14) pushes things a bit further to handle cases where the expression within the `f-string` are all literal values (string representation known at static time). Here's an example of when this could happen in code: ```python BASE_URL = "https://httpbin.org" VERSION = "v1" endpoint = f"{BASE_URL}/{VERSION}/post" # Literal["https://httpbin.org/v1/post"] ``` As this can be sightly more costly (additional allocations), I don't know if we want this feature. ## Test Plan - Added a test `fstring_expression` covering all cases I can think of --------- Co-authored-by: Carl Meyer <carl@astral.sh>	2024-09-27 10:29:21 -07:00
haarisr	7c83af419c	red-knot: Implement the `not` operator for all `Type` variants (#13432 ) Signed-off-by: haaris <haarisrahman@gmail.com> Co-authored-by: Carl Meyer <carl@oddbird.net>	2024-09-25 13:44:19 -07:00
TomerBin	be1d5e3368	[red-knot] Add `Type::bool` and boolean expression inference (#13449 )	2024-09-25 00:02:26 +00:00
Micha Reiser	653c09001a	Use an empty vendored file system in Ruff (#13436 ) ## Summary This PR changes removes the typeshed stubs from the vendored file system shipped with ruff and instead ships an empty "typeshed". Making the typeshed files optional required extracting the typshed files into a new `ruff_vendored` crate. I do like this even if all our builds always include typeshed because it means `red_knot_python_semantic` contains less code that needs compiling. This also allows us to use deflate because the compression algorithm doesn't matter for an archive containing a single, empty file. ## Test Plan `cargo test` I verified with ` cargo tree -f "{p} {f}" -p <package> ` that: * red_knot_wasm: enables `deflate` compression * red_knot: enables `zstd` compression * `ruff`: uses stored I'm not quiet sure how to build the binary that maturin builds but comparing the release artifact size with `strip = true` shows a `1.5MB` size reduction --------- Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2024-09-21 16:31:42 +00:00
Micha Reiser	8921fbb54c	`vendored_typeshed_versions` should use `db.vendored` (#13434 )	2024-09-21 16:35:06 +02:00
haarisr	6c303b2445	red-knot: Add not unary operator for boolean literals (#13422 ) ## Summary Contributes to #12701 ## Test Plan Added test for boolean literals Signed-off-by: haaris <haarisrahman@gmail.com>	2024-09-20 15:24:38 -07:00
Charlie Marsh	ff11db61b4	Add Python version support to ruff analyze CLI (#13426 )	2024-09-20 15:40:47 -04:00
Carl Meyer	149fb2090e	[red-knot] more efficient UnionBuilder::add (#13411 ) Avoid quadratic time in subsumed elements when adding a super-type of existing union elements. Reserve space in advance when adding multiple elements (from another union) to a union. Make union elements a `Box<[Type]>` instead of an `FxOrderSet`; the set doesn't buy much since the rules of union uniqueness are defined in terms of supertype/subtype, not in terms of simple type identity. Move sealed-boolean handling out of a separate `UnionBuilder::simplify` method and into `UnionBuilder::add`; now that `add` is iterating existing elements anyway, this is more efficient. Remove `UnionType::contains`, since it's now `O(n)` and we shouldn't really need it, generally we care about subtype/supertype, not type identity. (Right now it's used for `Type::Unbound`, which shouldn't even be a type.) Add support for `is_subtype_of` for the `object` type. Addresses comments on https://github.com/astral-sh/ruff/pull/13401	2024-09-20 10:49:45 -07:00
Carl Meyer	40c65dcfa7	[red-knot] dedicated error message for all-union-elements not callable (#13412 ) This was mentioned in an earlier review, and seemed easy enough to just do it. No need to repeat all the types twice when it gives no additional information.	2024-09-20 08:08:43 -07:00
Charlie Marsh	4e935f7d7d	Add a subcommand to generate dependency graphs (#13402 ) ## Summary This PR adds an experimental Ruff subcommand to generate dependency graphs based on module resolution. A few highlights: - You can generate either dependency or dependent graphs via the `--direction` command-line argument. - Like Pants, we also provide an option to identify imports from string literals (`--detect-string-imports`). - Users can also provide additional dependency data via the `include-dependencies` key under `[tool.ruff.import-map]`. This map uses file paths as keys, and lists of strings as values. Those strings can be file paths or globs. The dependency resolution uses the red-knot module resolver which is intended to be fully spec compliant, so it's also a chance to expose the module resolver in a real-world setting. The CLI is, e.g., `ruff graph build ../autobot`, which will output a JSON map from file to files it depends on for the `autobot` project.	2024-09-19 21:06:32 -04:00
Carl Meyer	260c2ecd15	[red-knot] visit with-item vars even if not a Name (#13409 ) This fixes the last panic on checking pandas. (Match statement became an `if let` because clippy decided it wanted that once I added the additional line in the else case?) --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-09-19 10:37:49 -07:00
Carl Meyer	a6d3d2fccd	[red-knot] support reveal_type as pseudo-builtin (#13403 ) Support using `reveal_type` without importing it, as implied by the type spec and supported by existing type checkers. We use `typing_extensions.reveal_type` for the implicit built-in; this way it exists on all Python versions. (It imports from `typing` on newer Python versions.) Emits an "undefined name" diagnostic whenever `reveal_type` is referenced in this way (in addition to the revealed-type diagnostic when it is called). This follows the mypy example (with `--enable-error-code unimported-reveal`) and I think provides a good (and easily understandable) balance for user experience. If you are using `reveal_type` for quick temporary debugging, the additional undefined-name diagnostic doesn't hinder that use case. If we make the revealed-type diagnostic a non-failing one, the undefined-name diagnostic can still be a failing diagnostic, helping prevent accidentally leaving it in place. For any use cases where you want to leave it in place, you can always import it to avoid the undefined-name diagnostic. In the future, we can easily provide configuration options to a) turn off builtin-reveal_type altogether, and/or b) silence the undefined-name diagnostic when using it, if we have users on either side (loving or hating pseudo-builtin `reveal_type`) who are dissatisfied with this compromise.	2024-09-19 07:58:08 -07:00
Simon	a8d9104fa3	Fix/#13070 defer annotations when future is active (#13395 )	2024-09-19 10:13:37 +02:00
Carl Meyer	cf1e91bb59	[red-knot] simplify subtypes from unions (#13401 ) Add `Type::is_subtype_of` method, and simplify subtypes out of unions.	2024-09-18 22:06:39 -07:00
Carl Meyer	125eaafae0	[red-knot] inferred type, not Unknown, for undeclared paths (#13400 ) After looking at more cases (for example, the case in the added test in this PR), I realized that our previous rule, "if a symbol has any declarations, use only declarations for its public type" is not adequate. Rather than using `Unknown` as fallback if the symbol is not declared in some paths, we need to use the inferred type as fallback in that case. For the paths where the symbol _was_ declared, we know that any bindings must be assignable to the declared type in that path, so this won't change the overall declared type in those paths. But for paths where the symbol wasn't declared, this will give us a better type in place of `Unknown`.	2024-09-18 21:47:49 -07:00
Carl Meyer	7aae80903c	[red-knot] add support for typing_extensions.reveal_type (#13397 ) Before `typing.reveal_type` existed, there was `typing_extensions.reveal_type`. We should support both. Also adds a test to verify that we can handle aliasing of `reveal_type` to a different name. Adds a bit of code to ensure that if we have a union of different `reveal_type` functions (e.g. a union containing both `typing_extensions.reveal_type` and `typing.reveal_type`) we still emit the reveal-type diagnostic only once. This is probably unlikely in practice, but it doesn't hurt to handle it smoothly. (It comes up now because we don't support `version_info` checks yet, so `typing_extensions.reveal_type` is actually that union.) --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-09-18 21:39:03 -07:00
Carl Meyer	4aca9b91ba	[red-knot] consider imports to be declarations (#13398 ) I noticed that this pattern sometimes occurs in typeshed: ``` if ...: from foo import bar else: def bar(): ... ``` If we have the rule that symbols with declarations only use declarations for the public type, then this ends up resolving as `Unknown \| Literal[bar]`, because we didn't consider the import to be a declaration. I think the most straightforward thing here is to also consider imports as declarations. The same rationale applies as for function and class definitions: if you shadow an import, you should have to explicitly shadow with an annotation, rather than just doing it implicitly/accidentally. We may also ultimately need to re-evaluate the rule that public type considers only declarations, if there are declarations.	2024-09-18 20:59:03 -07:00
Hamir Mahal	8b3da1867e	refactor: remove unnecessary string hashes (#13250 )	2024-09-18 19:08:59 +02:00
Carl Meyer	c173ec5bc7	[red-knot] support for typing.reveal_type (#13384 ) Add support for the `typing.reveal_type` function, emitting a diagnostic revealing the type of its single argument. This is a necessary piece for the planned testing framework. This puts the cart slightly in front of the horse, in that we don't yet have proper support for validating call signatures / argument types. But it's easy to do just enough to make `reveal_type` work. This PR includes support for calling union types (this is necessary because we don't yet support `sys.version_info` checks, so `typing.reveal_type` itself is a union type), plus some nice consolidated error messages for calls to unions where some elements are not callable. This is mostly to demonstrate the flexibility in diagnostics that we get from the `CallOutcome` enum.	2024-09-18 09:59:51 -07:00
Carl Meyer	dcfebaa4a8	[red-knot] use declared types in inference/checking (#13335 ) Use declared types in inference and checking. This means several things: * Imports prefer declarations over inference, when declarations are available. * When we encounter a binding, we check that the bound value's inferred type is assignable to the live declarations of the bound symbol, if any. * When we encounter a declaration, we check that the declared type is assignable from the inferred type of the symbol from previous bindings, if any. * When we encounter a binding+declaration, we check that the inferred type of the bound value is assignable to the declared type.	2024-09-17 08:11:06 -07:00
github-actions[bot]	1365b0806d	Sync vendored typeshed stubs (#13355 ) Close and reopen this PR to trigger CI Co-authored-by: typeshedbot <>	2024-09-14 20:40:42 -04:00
Alex Waygood	f4de49ab37	[red-knot] Clarify how scopes are pushed and popped for comprehensions and generator expressions (#13353 )	2024-09-14 13:31:17 -04:00
Carl Meyer	d988204b1b	[red-knot] add Declarations support to semantic indexing (#13334 ) Add support for declared types to the semantic index. This involves a lot of renaming to clarify the distinction between bindings and declarations. The Definition (or more specifically, the DefinitionKind) becomes responsible for determining which definitions are bindings, which are declarations, and which are both, and the symbol table building is refactored a bit so that the `IS_BOUND` (renamed from `IS_DEFINED` for consistent terminology) flag is always set when a binding is added, rather than being set separately (and requiring us to ensure it is set properly). The `SymbolState` is split into two parts, `SymbolBindings` and `SymbolDeclarations`, because we need to store live bindings for every declaration and live declarations for every binding; the split lets us do this without storing more than we need. The massive doc comment in `use_def.rs` is updated to reflect bindings vs declarations. The `UseDefMap` gains some new APIs which are allow-unused for now, since this PR doesn't yet update type inference to take declarations into account.	2024-09-13 13:55:22 -04:00
Carl Meyer	43a5922f6f	[red-knot] add BitSet::is_empty and BitSet::union (#13333 ) Add `::is_empty` and `::union` methods to the `BitSet` implementation. Allowing unused for now, until these methods become used later with the declared-types implementation. --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-09-12 14:25:45 -04:00
Carl Meyer	175d067250	[red-knot] add initial Type::is_equivalent_to and Type::is_assignable_to (#13332 ) These are quite incomplete, but I needed to start stubbing them out in order to build and test declared-types. Allowing unused for now, until they are used later in the declared-types PR. --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-09-12 14:15:25 -04:00
Alex Waygood	4dc2c257ef	[red-knot] Fix type inference for `except*` definitions (#13320 )	2024-09-11 15:05:40 -04:00
Alex Waygood	a7b8cc08f0	[red-knot] Fix `.to_instance()` for union types (#13319 )	2024-09-10 22:41:45 +00:00
Alex Waygood	b93d0ab57c	[red-knot] Add control flow for `for` loops (#13318 )	2024-09-10 22:04:35 +00:00
Alex Waygood	e6b927a583	[red-knot] Add a convenience method for constructing a union from a list of elements (#13315 )	2024-09-10 17:38:56 -04:00
Alex Waygood	2ca78721e6	[red-knot] Improve type inference for iteration over heterogenous tuples (#13314 ) Followup to #13295	2024-09-10 15:13:50 -04:00
Dhruv Manilawala	b7cef6c999	[red-knot] Add heterogeneous tuple type variant (#13295 ) ## Summary This PR adds a new `Type` variant called `TupleType` which is used for heterogeneous elements. ### Display notes * For an empty tuple, I'm using `tuple[()]` as described in the docs: https://docs.python.org/3/library/typing.html#annotating-tuples * For nested elements, it'll use the literal type instead of builtin type unlike Pyright which does `tuple[Literal[1], tuple[int, int]]` instead of `tuple[Literal[1], tuple[Literal[2], Literal[3]]]`. Also, mypy would give `tuple[builtins.int, builtins.int]` instead of `tuple[Literal[1], Literal[2]]` ## Test Plan Update test case to account for the display change and add cases for multiple elements and nested tuple elements. --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com> Co-authored-by: Carl Meyer <carl@astral.sh>	2024-09-10 17:54:19 +00:00
Dhruv Manilawala	62c7d8f6ba	[red-knot] Add control flow support for match statement (#13241 ) ## Summary This PR adds support for control flow for match statement. It also adds the necessary infrastructure required for narrowing constraints in case blocks and implements the logic for `PatternMatchSingleton` which is either `None` / `True` / `False`. Even after this the inferred type doesn't get simplified completely, there's a TODO for that in the test code. ## Test Plan Add test cases for control flow for (a) when there's a wildcard pattern and (b) when there isn't. There's also a test case to verify the narrowing logic. --------- Co-authored-by: Carl Meyer <carl@astral.sh>	2024-09-10 02:14:19 +05:30
Alex Waygood	6f53aaf931	[red-knot] Add type inference for loop variables inside comprehension scopes (#13251 )	2024-09-09 20:22:01 +00:00
Alex Waygood	1eb3e4057f	[red-knot] Add definitions and limited type inference for exception handlers (#13267 )	2024-09-09 07:35:15 -04:00
Dylan	e4aa479515	[red-knot] Handle StringLiteral truncation (#13276 ) When a type of the form `Literal["..."]` would be constructed with too large of a string, this PR converts it to `LiteralString` instead. We also extend inference for binary operations to include the case where one of the operands is `LiteralString`. Closes #13224	2024-09-07 20:25:09 -07:00
Simon	594dee1b0b	[red-knot] resolve source/stubs over namespace packages (#13254 )	2024-09-06 12:14:26 +01:00
Carl Meyer	a4ebe7d344	[red-knot] consolidate diagnostic and inference tests (#13248 ) Pull the tests from `types.rs` into `infer.rs`. All of these are integration tests with the same basic form: create a code sample, run type inference or check on it, and make some assertions about types and/or diagnostics. These are the sort of tests we will want to move into a test framework with a low-boilerplate custom textual format. In the meantime, having them together (and more importantly, their helper utilities together) means that it's easy to keep tests for related language features together (iterable tests with other iterable tests, callable tests with other callable tests), without an artificial split based on tests which test diagnostics vs tests which test inference. And it allows a single test to more easily test both diagnostics and inference. (Ultimately in the test framework, they will likely all test diagnostics, just in some cases the diagnostics will come from `reveal_type()`.)	2024-09-05 09:15:22 -07:00
Carl Meyer	2a3775e525	[red-knot] AnnAssign with no RHS is not a Definition (#13247 ) My plan for handling declared types is to introduce a `Declaration` in addition to `Definition`. A `Declaration` is an annotation of a name with a type; a `Definition` is an actual runtime assignment of a value to a name. A few things (an annotated function parameter, an annotated-assignment with an RHS) are both a `Definition` and a `Declaration`. This more cleanly separates type inference (only cares about `Definition`) from declared types (only impacted by a `Declaration`), and I think it will work out better than trying to squeeze everything into `Definition`. One of the tests in this PR (`annotation_only_assignment_transparent_to_local_inference`) demonstrates one reason why. The statement `x: int` should have no effect on local inference of the type of `x`; whatever the locally inferred type of `x` was before `x: int` should still be the inferred type after `x: int`. This is actually quite hard to do if `x: int` is considered a `Definition`, because a core assumption of the use-def map is that a `Definition` replaces the previous value. To achieve this would require some hackery to effectively treat `x: int` sort of as if it were `x: int = x`, but it's not really even equivalent to that, so this approach gets quite ugly. As a first step in this plan, this PR stops treating AnnAssign with no RHS as a `Definition`, which fixes behavior in a couple added tests. This actually makes things temporarily worse for the ellipsis-type test, since it is defined in typeshed only using annotated assignments with no RHS. This will be fixed properly by the upcoming addition of declarations, which should also treat a declared type as sufficient to import a name, at least from a stub.	2024-09-05 08:55:00 -07:00
Carl Meyer	66fe226608	[red-knot] fix lookup of nonlocal names in deferred annotations (#13236 ) Initially I had deferred annotation name lookups reuse the "public symbol type", since that gives the correct "from end of scope" view of reaching definitions that we want. But there is a key difference; public symbol types are based only on definitions in the queried scope (or "name in the given namespace" in runtime terms), they don't ever look up a name in nonlocal/global/builtin scopes. Deferred annotation resolution should do this lookup. Add a test, and fix deferred name resolution to support nonlocal/global/builtin names. Fixes #13176	2024-09-04 10:10:54 -07:00
Alex Waygood	e965f9cc0e	[red-knot] Infer `Unknown` for the loop var in `async for` loops (#13243 )	2024-09-04 14:24:58 +00:00
Alex Waygood	0512428a6f	[red-knot] Emit a diagnostic if the value of a starred expression or a `yield from` expression is not iterable (#13240 )	2024-09-04 14:19:11 +00:00
Alex Waygood	46a457318d	[red-knot] Add type inference for basic `for` loops (#13195 )	2024-09-04 10:19:50 +00:00
Dhruv Manilawala	862bd0c429	[red-knot] Add debug assert to check for duplicate definitions (#13214 ) ## Summary Closes: #13085 ## Test Plan `cargo insta test --workspace`	2024-09-04 05:53:32 +00:00
Dhruv Manilawala	e1e9143c47	[red-knot] Handle multiple comprehension targets (#13213 ) ## Summary Part of #13085, this PR updates the comprehension definition to handle multiple targets. ## Test Plan Update existing semantic index test case for comprehension with multiple targets. Running corpus tests shouldn't panic.	2024-09-04 11:18:58 +05:30
Carl Meyer	3c4ec82aee	[red-knot] support non-local name lookups (#13177 ) Add support for non-local name lookups. There's one TODO around annotated assignments without a RHS; these need a fair amount of attention, which they'll get in an upcoming PR about declared vs inferred types. Fixes #11663	2024-09-03 14:18:05 -07:00
Carl Meyer	29c36a56b2	[red-knot] fix scope inference with deferred types (#13204 ) Test coverage for #13131 wasn't as good as I thought it was, because although we infer a lot of types in stubs in typeshed, we don't check typeshed, and therefore we don't do scope-level inference and pull all types for a scope. So we didn't really have good test coverage for scope-level inference in a stub. And because of this, I got the code for supporting that wrong, meaning that if we did scope-level inference with deferred types, we'd end up never populating the deferred types in the scope's `TypeInference`, which causes panics like #13160. Here I both add test coverage by running the corpus tests both as `.py` and as `.pyi` (which reveals the panic), and I fix the code to support deferred types in scope inference. This also revealed a problem with deferred types in generic functions, which effectively span two scopes. That problem will require a bit more thought, and I don't want to block this PR on it, so for now I just don't defer annotations on generic functions. Fixes #13160.	2024-09-03 11:20:43 -07:00
Alex Waygood	dfee65882b	[red-knot] Inline `Type::is_literal` (#13230 )	2024-09-03 15:02:50 +01:00
Alex Waygood	9d517061f2	[red-knot] Reduce some repetitiveness in tests (#13135 )	2024-09-03 11:26:44 +01:00
Dhruv Manilawala	facf6febf0	[red-knot] Remove match pattern definition visitor (#13209 ) ## Summary This PR is based on this discussion: https://github.com/astral-sh/ruff/pull/13147#discussion_r1739408653. Todo - [x] Add documentation for `MatchPatternState` ## Test Plan `cargo insta test` and `cargo clippy`	2024-09-03 08:53:35 +00:00
Simon	46e687e8d1	[red-knot] Condense literals display by types (#13185 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2024-09-03 07:23:28 +00:00
Micha Reiser	599103c933	Add a few missing `#[return_ref]` attributes (#13223 )	2024-09-03 09:15:43 +02:00
Dhruv Manilawala	47f0b45be3	Implement `AstNode` for `Identifier` (#13207 ) ## Summary Follow-up to #13147, this PR implements the `AstNode` for `Identifier`. This makes it easier to create the `NodeKey` in red knot because it uses a generic method to construct the key from `AnyNodeRef` and is important for definitions that are created only on identifiers instead of `ExprName`. ## Test Plan `cargo test` and `cargo clippy`	2024-09-02 16:27:12 +05:30
Dhruv Manilawala	17eb65b26f	Add definitions for match statement (#13147 ) ## Summary This PR adds definition for match patterns. ## Test Plan Update the existing test case for match statement symbols to verify that the definitions are added as well.	2024-09-02 14:40:09 +05:30
Micha Reiser	9986397d56	Avoid allocating `OrderedSet` in `UnionBuilder::simplify` (#13206 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-09-02 09:07:53 +00:00
Alex Waygood	2014cba87f	[red-knot] Fix call expression inference edge case for decorated functions (#13191 )	2024-09-01 16:19:40 +01:00
Dylan	52d8847b60	[red-knot] `Literal[True,False]` normalized to `builtins.bool` (#13178 ) The `UnionBuilder` builds `builtins.bool` when handed `Literal[True]` and `Literal[False]`. Caveat: If the builtins module is unfindable somehow, the builder falls back to the union type of these two literals. First task from #12694 --------- Co-authored-by: Carl Meyer <carl@astral.sh>	2024-08-31 22:57:50 -07:00
Alex Waygood	fae0573817	[red-knot] Fix async function edge case for inference of call expressions (#13187 )	2024-09-01 01:58:35 +01:00
github-actions[bot]	0c23b868dc	Sync vendored typeshed stubs (#13188 ) Co-authored-by: typeshedbot <>	2024-09-01 01:41:27 +01:00
Dylan	3ceedf76b8	[red-knot] Infer type of class constructor call expression (#13171 ) This tiny PR implements the following type inference: the type of `Foo(...)` will be `Foo`. --------- Co-authored-by: Carl Meyer <carl@astral.sh>	2024-08-30 16:48:06 -07:00
Chris Krycho	28ab5f4065	[red-knot] implement basic call expression inference (#13164 ) ## Summary Adds basic support for inferring the type resulting from a call expression. This only works for the result of call expressions; it performs no inference on parameters. It also intentionally does nothing with class instantiation, `__call__` implementors, or lambdas. ## Test Plan Adds a test that it infers the right thing! --------- Co-authored-by: Carl Meyer <carl@astral.sh>	2024-08-30 12:51:29 -07:00
Chris Krycho	f8656ff35e	[red-knot] infer basic (name-based) annotation expressions (#13130 ) ## Summary - Introduce methods for inferring annotation and type expressions. - Correctly infer explicit return types from functions where they are simple names that can be resolved in scope. Contributes to #12701 by way of helping unlock call expressions (this does not remotely finish that, as it stands, but it gets us moving that direction). ## Test Plan Added a test for function return types which use the name form of an annotation expression, since this is aiming toward call expressions. When we extend this to working for other annotation and type expression positions, we should add explicit tests for those as well. --------- Co-authored-by: Alex Waygood <alex.waygood@gmail.com> Co-authored-by: Carl Meyer <carl@astral.sh>	2024-08-30 08:24:36 -07:00
Carl Meyer	770ef2ab27	[red-knot] support deferred evaluation of type expressions (#13131 ) Prototype deferred evaluation of type expressions by deferring evaluation of class bases in a stub file. This allows self-referential class definitions, as occur with the definition of `str` in typeshed (which inherits `Sequence[str]`). --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-08-28 11:41:01 -07:00
Alex Waygood	cfafaa7637	[red-knot] Remove very noisy tracing call when resolving `ImportFrom` statements (#13136 )	2024-08-28 10:05:00 +00:00
Chris Krycho	81cd438d88	red-knot: infer and display ellipsis type (#13124 ) ## Summary Just what it says on the tin: adds basic `EllipsisType` inference for any time `...` appears in the AST. ## Test Plan Test that `x = ...` produces exactly what we would expect. --------- Co-authored-by: Carl Meyer <carl@oddbird.net>	2024-08-27 20:52:53 +01:00
Chris Krycho	aba1802828	red-knot: infer multiplication for strings and integers (#13117 ) ## Summary The resulting type when multiplying a string literal by an integer literal is one of two types: - `StringLiteral`, in the case where it is a reasonably small resulting string (arbitrarily bounded here to 4096 bytes, roughly a page on many operating systems), including the fully expanded string. - `LiteralString`, matching Pyright etc., for strings larger than that. Additionally: - Switch to using `Box<str>` instead of `String` for the internal value of `StringLiteral`, saving some non-trivial byte overhead (and keeping the total number of allocations the same). - Be clearer and more accurate about which types we ought to defer to in `StringLiteral` and `LiteralString` member lookup. ## Test Plan Added a test case covering multiplication times integers: positive, negative, zero, and in and out of bounds. --------- Co-authored-by: Alex Waygood <alex.waygood@gmail.com> Co-authored-by: Carl Meyer <carl@astral.sh>	2024-08-27 09:00:36 -07:00
Alex Waygood	a5ef124201	[red-knot] Improve the accuracy of the unresolved-import check (#13055 )	2024-08-27 14:17:22 +01:00
Chris Krycho	390bb43276	red-knot: flatten match expression in `infer_binary_expression` (#13115 ) ## Summary This fixes the outstanding TODO and make it easier to work with new cases. (Tidy first, then implement, basically!) ## Test Plan After making this change all the existing tests still pass. A classic refactor win. 🎉	2024-08-26 12:34:07 -07:00
Chris Krycho	fe8b15291f	red-knot: implement unary minus on integer literals (#13114 ) # Summary Add support for the first unary operator: negating integer literals. The resulting type is another integer literal, with the value being the negated value of the literal. All other types continue to return `Type::Unknown` for the present, but this is designed to make it easy to extend easily with other combinations of operator and operand. Contributes to #12701. ## Test Plan Add tests with basic negation, including of very large integers and double negation.	2024-08-26 12:08:18 -07:00
Chris Krycho	c4d628cc4c	red-knot: infer string literal types (#13113 ) ## Summary Introduce a `StringLiteralType` with corresponding `Display` type and a relatively basic test that the resulting representation is as expected. Note: we currently always allocate for `StringLiteral` types. This may end up being a perf issue later, at which point we may want to look at other ways of representing `value` here, i.e. with some kind of smarter string structure which can reuse types. That is most likely to show up with e.g. concatenation. Contributes to #12701. ## Test Plan Added a test for individual strings with both single and double quotes as well as concatenated strings with both forms.	2024-08-26 11:42:34 -07:00
Dylan	8c09496b07	[red-knot] Resolve function annotations before adding function symbol (#13084 ) This PR has the `SemanticIndexBuilder` visit function definition annotations before adding the function symbol/name to the builder. For example, the following snippet no longer causes a panic: ```python def bool(x) -> bool: Return True ``` Note: This fix changes the ordering of the global symbol table. Closes #13069	2024-08-23 19:31:36 -07:00
Alex Waygood	d19fd1b91c	[red-knot] Add symbols for `for` loop variables (#13075 ) ## Summary This PR adds symbols introduced by `for` loops to red-knot: - `x` in `for x in range(10): pass` - `x` and `y` in `for x, y in d.items(): pass` - `a`, `b`, `c` and `d` in `for [((a,), b), (c, d)] in foo: pass` ## Test Plan Several tests added, and the assertion in the benchmarks has been updated. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2024-08-23 23:40:27 +01:00
Teodoro Freund	b9c8113a8a	Added bytes type and some inference (#13061 ) ## Summary This PR adds the `bytes` type to red-knot: - Added the `bytes` type - Added support for bytes literals - Support for the `+` operator Improves on #12701 Big TODO on supporting and normalizing r-prefixed bytestrings (`rb"hello\n"`) ## Test Plan Added a test for a bytes literals, concatenation, and corner values	2024-08-22 13:27:15 -07:00
Dylan	2edd32aa31	[red-knot] `SemanticIndexBuilder` visits value before target in named expressions (#13053 ) The `SemanticIndexBuilder` was causing a cycle in a salsa query by attempting to resolve the target before the value in a named expression (e.g. `x := x+1`). This PR swaps the order, avoiding a panic. Closes #13012.	2024-08-22 07:59:13 -07:00
Dhruv Manilawala	8144a11f98	[red-knot] Add definition for with items (#12920 ) ## Summary This PR adds symbols and definitions introduced by `with` statements. The symbols and definitions are introduced for each with item. The type inference is updated to call the definition region type inference instead. ## Test Plan Add test case to check for symbol table and definitions.	2024-08-22 08:00:19 +05:30
Micha Reiser	dce87c21fd	Eagerly validate typeshed versions (#12786 )	2024-08-21 15:49:53 +00:00
Alex Waygood	ecd9e6a650	[red-knot] Improve the `unresolved-import` check (#13007 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2024-08-21 13:44:49 +00:00
Micha Reiser	a35cdbb275	Fix various panicks when linting black/src (#13033 )	2024-08-21 12:35:29 +00:00
Alex Waygood	37a60460ed	[red-knot] Improve various tracing logs (#13015 )	2024-08-20 18:34:51 +00:00
Micha Reiser	c65e3310d5	Add API to emit type-checking diagnostics (#12988 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-08-20 07:22:30 +00:00
Dhruv Manilawala	1a8f29ea41	[red-knot] Add symbols defined by `match` statements (#12926 ) ## Summary This PR adds symbols introduced by `match` statements. There are three patterns that introduces new symbols: * `as` pattern * Sequence pattern * Mapping pattern The recursive nature of the visitor makes sure that all symbols are added. ## Test Plan Add test case for all types of patterns that introduces a symbol.	2024-08-20 05:16:27 +00:00
Dhruv Manilawala	aefaddeae7	[red-knot] Add definition for augmented assignment (#12892 ) ## Summary This PR adds definition for augmented assignment. This is similar to annotated assignment in terms of implementation. An augmented assignment should also record a use of the variable but that's a TODO for now. ## Test Plan Add test case to validate that a definition is added.	2024-08-20 10:33:55 +05:30
Micha Reiser	dd0a7ec73e	Pull all types in corpus tests (#12919 )	2024-08-17 11:59:55 +00:00
Carl Meyer	6359e55383	[red-knot] type narrowing (#12706 ) Extend the `UseDefMap` to also track which constraints (provided by e.g. `if` tests) apply to each visible definition. Uses a custom `BitSet` and `BitSetArray` to track which constraints apply to which definitions, while keeping data inline as much as possible.	2024-08-16 16:34:13 -07:00
Alex Waygood	a9847af6e8	[red-knot] Use `Unknown` rather than `Unbound` for unresolved imports (#12932 )	2024-08-16 20:10:33 +01:00
Alex Waygood	a87b27c075	[red-knot] Add support for relative imports (#12910 ) Co-authored-by: Carl Meyer <carl@astral.sh>	2024-08-16 12:35:27 +01:00
Dhruv Manilawala	bd4a947b29	[red-knot] Add symbol and definition for parameters (#12862 ) ## Summary This PR adds support for adding symbols and definitions for function and lambda parameters to the semantic index. ### Notes * The default expression of a parameter is evaluated in the enclosing scope (not the type parameter or function scope). * The annotation expression of a parameter is evaluated in the type parameter scope if they're present other in the enclosing scope. * The symbols and definitions are added in the function parameter scope. ### Type Inference There are two definitions `Parameter` and `ParameterWithDefault` and their respective `*_definition` methods on the type inference builder. These methods are preferred and are re-used when checking from a different region. ## Test Plan Add test case for validating that the parameters are defined in the function / lambda scope. ### Benchmark update Validated the difference in diagnostics for benchmark code between `main` and this branch. All of them are either directly or indirectly referencing one of the function parameters. The diff is in the PR description.	2024-08-16 10:59:59 +05:30
Carl Meyer	80efb865e9	[red-knot] fix lookups of possibly-shadowed builtins (#12898 ) If a builtin is conditionally shadowed by a global, we didn't correctly fall back to builtins for the not-defined-in-globals path (see added test for an example.)	2024-08-15 14:09:29 -07:00
github-actions[bot]	ac7b1770e2	Sync vendored typeshed stubs (#12899 ) Close and reopen this PR to trigger CI Co-authored-by: typeshedbot <>	2024-08-14 18:11:23 -07:00
Dhruv Manilawala	7027344dfc	Add scope and definitions for comprehensions (#12748 ) ## Summary This PR adds scope and definition for comprehension nodes. This includes the following nodes: * List comprehension * Dictionary comprehension * Set comprehension * Generator expression ### Scope Each expression here adds it's own scope with one caveat - the `iter` expression of the first generator is part of the parent scope. For example, in the following code snippet the `iter1` variable is evaluated in the outer scope. ```py [x for x in iter1] ``` > The iterable expression in the leftmost for clause is evaluated directly in the enclosing scope and then passed as an argument to the implicitly nested scope. > > Reference: https://docs.python.org/3/reference/expressions.html#displays-for-lists-sets-and-dictionaries There's another special case for assignment expressions: > There is one special case: an assignment expression occurring in a list, set or dict comprehension or in a generator expression (below collectively referred to as “comprehensions”) binds the target in the containing scope, honoring a nonlocal or global declaration for the target in that scope, if one exists. > > Reference: https://peps.python.org/pep-0572/#scope-of-the-target For example, in the following code snippet, the variables `a` and `b` are available after the comprehension while `x` isn't: ```py [a := 1 for x in range(2) if (b := 2)] ``` ### Definition Each comprehension node adds a single definition, the "target" variable (`[_ for target in iter]`). This has been accounted for and a new variant has been added to `DefinitionKind`. ### Type Inference Currently, type inference is limited to a single scope. It doesn't _enter_ in another scope to infer the types of the remaining expressions of a node. To accommodate this, the type inference for a scope requires new methods which _doesn't_ infer the type of the `iter` expression of the leftmost outer generator (that's defined in the enclosing scope). The type inference for the scope region is split into two parts: * `infer_generator_expression` (similarly for comprehensions) infers the type of the `iter` expression of the leftmost outer generator * `infer_generator_expression_scope` (similarly for comprehension) infers the type of the remaining expressions except for the one mentioned in the previous point The type inference for the definition also needs to account for this special case of leftmost generator. This is done by defining a `first` boolean parameter which indicates whether this comprehension definition occurs first in the enclosing expression. ## Test Plan New test cases were added to validate multiple scenarios. Refer to the documentation for each test case which explains what is being tested.	2024-08-13 07:00:33 +05:30
Carl Meyer	fb9f0c448f	[red-knot] cleanup doc comments and attributes (#12792 ) Make `cargo doc -p red_knot_python_semantic --document-private-items` run warning-free. I'd still like to do this for all of ruff and start enforcing it in CI (https://github.com/astral-sh/ruff/issues/12372) but haven't gotten to it yet. But in the meantime I'm trying to maintain it for at least `red_knot_python_semantic`, as it helps to ensure our doc comments stay up to date. A few of the comments I just removed or shortened, as their continued relevance wasn't clear to me; please object in review if you think some of them are important to keep! Also remove a no-longer-needed `allow` attribute.	2024-08-12 12:15:16 -07:00
Carl Meyer	75131c6f4a	[red-knot] add IntersectionBuilder (#12791 ) For type narrowing, we'll need intersections (since applying type narrowing is just a type intersection.) Add `IntersectionBuilder`, along with some tests for it and `UnionBuilder` (renamed from `UnionTypeBuilder`). We use smart builders to ensure that we always keep these types in disjunctive normal form (DNF). That means that we never have deeply nested trees of unions and intersections: unions flatten into unions, intersections flatten into intersections, and intersections distribute over unions, so the most complex tree we can ever have is a union of intersections. We also never have a single-element union or a single-positive-element intersection; these both just simplify to the contained type. Maintaining these invariants means that `UnionBuilder` doesn't necessarily end up building a `Type::Union` (e.g. if you only add a single type to the union, it'll just return that type instead), and `IntersectionBuilder` doesn't necessarily build a `Type::Intersection` (if you add a union to the intersection, we distribute the intersection over that union, and `IntersectionBuilder` will end up returning a `Type::Union` of intersections). We also simplify intersections by ensuring that if a type and its negation are both in an intersection, they simplify out. (In future this should also respect subtyping, not just type identity, but we don't have subtyping yet.) We do implement subtyping of `Never` as a special case for now. Most of this PR is unused for now until type narrowing lands; I'm just breaking it out to reduce the review fatigue of a single massive PR.	2024-08-12 11:56:04 -07:00
Dhruv Manilawala	99dc208b00	[red-knot] Add filename and source location for diagnostics (#12842 ) ## Summary I'm not sure if this is useful but this is a hacky implementation to add the filename and row / column numbers to the current Red Knot diagnostics.	2024-08-12 15:56:30 +00:00
Micha Reiser	a99a45868c	Eagerly validate search paths (#12783 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2024-08-12 07:46:59 +00:00
Alex Waygood	cf1a57df5a	Remove `red_knot_python_semantic::python_version::TargetVersion` (#12790 )	2024-08-10 14:28:31 +01:00
Alex Waygood	37b9bac403	[red-knot] Add support for `--system-site-packages` virtual environments (#12759 )	2024-08-09 21:02:16 +01:00
Alex Waygood	c4e651921b	[red-knot] Move, rename and make public the `PyVersion` type (#12782 )	2024-08-09 16:49:17 +01:00
Micha Reiser	a176679b24	Log warnings when skipping editable installations (#12779 )	2024-08-09 16:29:43 +02:00
Micha Reiser	2abfab0f9b	Move Program and related structs to `red_knot_python_semantic` (#12777 )	2024-08-09 11:50:45 +02:00
Micha Reiser	ffaa35eafe	Add test helper to setup tracing (#12741 )	2024-08-09 07:04:04 +00:00
Carl Meyer	bc5b9b81dd	[red-knot] add dev dependency on ruff_db os feature from red_knot_pyt… (#12760 )	2024-08-08 18:10:30 +01:00
Alex Waygood	f1de08c2a0	[red-knot] Merge the semantic and module-resolver crates (#12751 )	2024-08-08 15:34:11 +01:00
Micha Reiser	dc6aafecc2	Setup tracing and document tracing usage (#12730 )	2024-08-08 06:28:40 +00:00
Micha Reiser	846f57fd15	Update salsa (#12711 )	2024-08-06 13:17:39 +00:00
Dhruv Manilawala	e91a0fe94a	[red-knot] Implement basic LSP server (#12624 ) ## Summary This PR adds basic LSP implementation for the Red Knot project. This is basically a fork of the existing `ruff_server` crate into a `red_knot_server` crate. The following are the main differences: 1. The `Session` stores a map from workspace root to the corresponding Red Knot database (`RootDatabase`). 2. The database is initialized with the newly implemented `LSPSystem` (implementation of `System` trait) 3. The `LSPSystem` contains the server index corresponding to each workspace and an underlying OS system implementation. For certain methods, the system first checks if there's an open document in LSP system and returns the information from that. Otherwise, it falls back to the OS system to get that information. These methods are `path_metadata`, `read_to_string` and `read_to_notebook` 4. Add `as_any_mut` method for `System` Why fork? Forking allows us to experiment with the functionalities that are specific to Red Knot. The architecture is completely different and so the requirements for an LSP implementation are different as well. For example, Red Knot only supports a single workspace, so the LSP system needs to map the multi-workspace support to each Red Knot instance. In the end, the server code isn't too big, it will be easier to implement Red Knot specific functionality without worrying about existing server limitations and it shouldn't be difficult to port the existing server. ## Review Most of the server files hasn't been changed. I'm going to list down the files that have been changed along with highlight the specific part of the file that's changed from the existing server code. Changed files: * Red Knot CLI implementation: https://github.com/astral-sh/ruff/pull/12624/files#diff-579596339a29d3212a641232e674778c339b446de33b890c7fdad905b5eb50e1 * In https://github.com/astral-sh/ruff/pull/12624/files#diff-b9a9041a8a2bace014bf3687c3ef0512f25e0541f112fad6131b14242f408db6, server capabilities have been updated, dynamic capability registration is removed * In https://github.com/astral-sh/ruff/pull/12624/files#diff-b9a9041a8a2bace014bf3687c3ef0512f25e0541f112fad6131b14242f408db6, the API for `clear_diagnostics` now take in a `Url` instead of `DocumentQuery` as the document version doesn't matter when clearing diagnostics after a document is closed * [`did_close`](https://github.com/astral-sh/ruff/pull/12624/files#diff-9271370102a6f3be8defaca40c82485b0048731942520b491a3bdd2ee0e25493), [`did_close_notebook`](https://github.com/astral-sh/ruff/pull/12624/files#diff-96fb53ffb12c1694356e17313e4bb37b3f0931e887878b5d7c896c19ff60283b), [`did_open`](https://github.com/astral-sh/ruff/pull/12624/files#diff-60e852cf1aa771e993131cabf98eb4c467963a8328f10eccdb43b3e8f0f1fb12), [`did_open_notebook`](https://github.com/astral-sh/ruff/pull/12624/files#diff-ac356eb5e36c3b2c1c135eda9dfbcab5c12574d1cb77c71f7da8dbcfcfb2d2f1) are updated to open / close file from the corresponding Red Knot workspace * The [diagnostic handler](https://github.com/astral-sh/ruff/pull/12624/files#diff-4475f318fd0290d0292834569a7df5699debdcc0a453b411b8c3d329f1b879d9) is updated to request diagnostics from Red Knot * The [`Session::new`] method in https://github.com/astral-sh/ruff/pull/12624/files#diff-55c96201296200c1cab37c8b0407b6c733381374b94be7ae50563bfe95264e4d is updated to construct the Red Knot databases for each workspace. It also contains the `index_mut` and `MutIndexGuard` implementation * And, `LSPSystem` implementation is in https://github.com/astral-sh/ruff/pull/12624/files#diff-4ed62bd359c43b0bf1a13f04349dcd954966934bb8d544de7813f974182b489e ## Test Plan First, configure VS Code to use the `red_knot` binary 1. Build the `red_knot` binary by `cargo build` 2. Update the VS Code extension to specify the path to this binary ```json { "ruff.path": ["/path/to/ruff/target/debug/red_knot"] } ``` 3. Restart VS Code Now, open a file containing red-knot specific diagnostics, close the file and validate that diagnostics disappear.	2024-08-06 11:27:30 +00:00
Dhruv Manilawala	5cc3fed9a8	[red-knot] Infer float and complex literal expressions (#12689 ) ## Summary This PR implements type inference for float and complex literal expressions. ## Test Plan Add test cases for both types.	2024-08-06 06:24:28 +00:00
Alex Waygood	5499821c67	[red-knot] Rename `workspace_root` variables in the module resolver to `src_root` (#12697 ) Fixes #12337	2024-08-05 23:07:18 +01:00
Carl Meyer	2393d19f91	[red-knot] infer instance types for builtins (#12695 ) Previously we wrongly inferred the type of the builtin type itself (e.g. `Literal[int]`); we need to infer the instance type instead.	2024-08-05 13:32:42 -07:00
Dhruv Manilawala	a8e2ba508e	[red-knot] Infer boolean literal expression (#12688 ) ## Summary This PR implements type inference for boolean literal expressions. ## Test Plan Add test cases for `True` and `False`.	2024-08-05 11:30:53 -07:00
Alex Waygood	fbab04fbe1	[red-knot] Allow multiple `site-packages` search paths (#12609 )	2024-08-02 13:33:19 +00:00
Carl Meyer	ee0518e8f7	[red-knot] implement attribute of union (#12601 ) I hit this `todo!` trying to run type inference over some real modules. Since it's a one-liner to implement it, I just did that rather than changing to `Type::Unknown`.	2024-07-31 19:45:24 -07:00
Micha Reiser	138e70bd5c	Upgrade to Rust 1.80 (#12586 )	2024-07-30 19:18:08 +00:00
Micha Reiser	e18b4e42d3	[red-knot] Upgrade to the new new salsa (#12406 )	2024-07-29 07:21:24 +00:00
Carl Meyer	4b69271809	[red-knot] resolve int/list/dict/set/tuple to builtin type (#12521 ) Now that we have builtins available, resolve some simple cases to the right builtin type. We should also adjust the display for types to include their module name; that's not done yet here.	2024-07-26 08:21:31 -07:00
Carl Meyer	2d3914296d	[red-knot] handle all syntax without panic (#12499 ) Extend red-knot type inference to cover all syntax, so that inferring types for a scope gives all expressions a type. This means we can run the red-knot semantic lint on all Python code without panics. It also means we can infer types for `builtins.pyi` without panics. To keep things simple, this PR intentionally doesn't add any new type inference capabilities: the expanded coverage is all achieved with `Type::Unknown`. But this puts the skeleton in place for adding better inference of all these language features. I also had to add basic Salsa cycle recovery (with just `Type::Unknown` for now), because some `builtins.pyi` definitions are cyclic. To test this, I added a comprehensive corpus of test snippets sourced from Cinder under [MIT license](https://github.com/facebookincubator/cinder/blob/cinder/3.10/cinderx/LICENSE), which matches Ruff's license. I also added to this corpus some additional snippets for newer language features: all the `27_func_generic_` and `73_class_generic_` files, as well as `20_lambda_default_arg.py`, and added a test which runs semantic-lint over all these files. (The test doesn't assert the test-corpus files are lint-free; just that they are able to lint without a panic.)	2024-07-25 17:38:08 -07:00
Micha Reiser	eac965ecaf	[red-knot] Watch search paths (#12407 )	2024-07-24 07:38:50 +00:00
Micha Reiser	40d9324f5a	[red-knot] Improved file watching (#12382 )	2024-07-23 08:18:59 +02:00
Carl Meyer	c7b13bb8fc	[red-knot] add cycle-free while-loop control flow (#12413 ) Add support for while-loop control flow. This doesn't yet include general support for terminals and reachability; that is wider than just while loops and belongs in its own PR. This also doesn't yet add support for cyclic definitions in loops; that comes with enough of its own complexity in Salsa that I want to handle it separately.	2024-07-22 14:27:33 -07:00
Carl Meyer	f22c8ab811	[red-knot] add maybe-undefined lint rule (#12414 ) Add a lint rule to detect if a name is definitely or possibly undefined at a given usage. If I create the file `undef/main.py` with contents: ```python x = int def foo(): z return x if flag: y = x y ``` And then run `cargo run --bin red_knot -- --current-directory ../ruff-examples/undef`, I get the output: ``` Name 'z' used when not defined. Name 'flag' used when not defined. Name 'y' used when possibly not defined. ``` If I modify the file to add `y = 0` at the top, red-knot re-checks it and I get the new output: ``` Name 'z' used when not defined. Name 'flag' used when not defined. ``` Note that `int` is not flagged, since it's a builtin, and `return x` in the function scope is not flagged, since it refers to the global `x`.	2024-07-22 13:53:59 -07:00
Alex Waygood	d8cf8ac2ef	[red-knot] Resolve symbols from `builtins.pyi` in the stdlib if they cannot be found in other scopes (#12390 ) Co-authored-by: Carl Meyer <carl@astral.sh>	2024-07-19 17:44:56 +01:00
Carl Meyer	f82bb67555	[red-knot] trace file when inferring types (#12401 ) When poring over traces, the ones that just include a definition or symbol or expression ID aren't very useful, because you don't know which file it comes from. This adds that information to the trace. I guess the downside here is that if calling `.file(db)` on a scope/definition/expression would execute other traced code, it would be marked as outside the span? I don't think that's a concern, because I don't think a simple field access on a tracked struct should ever execute our code. If I'm wrong and this is a problem, it seems like the tracing crate has this feature where you can record a field as `tracing::field::Empty` and then fill in its value later with `span.record(...)`, but when I tried this it wasn't working for me, not sure why. I think there's a lot more we can do to make our tracing output more useful for debugging (e.g. record an event whenever a definition/symbol/expression/use id is created with the details of that definition/symbol/expression/use), this is just dipping my toes in the water.	2024-07-19 07:13:51 -07:00
Carl Meyer	181e7b3c0d	[red-knot] rename module_global to global (#12385 ) Per comments in https://github.com/astral-sh/ruff/pull/12269, "module global" is kind of long, and arguably redundant. I tried just using "module" but there were too many cases where I felt this was ambiguous. I like the way "global" works out better, though it does require an understanding that in Python "global" generally means "module global" not "globally global" (though in a sense module globals are also globally global since modules are singletons).	2024-07-18 13:05:30 -07:00
Carl Meyer	519eca9fe7	[red-knot] support implicit global name lookups (#12374 ) Support falling back to a global name lookup if a name isn't defined in the local scope, in the cases where that is correct according to Python semantics. In class scopes, a name lookup checks the local namespace first, and if the name isn't found there, looks it up in globals. In function scopes (and type parameter scopes, which are function-like), if a name has any definitions in the local scope, it is a local, and accessing it when none of those definitions have executed yet just results in an `UnboundLocalError`, it does not fall back to a global. If the name does not have any definitions in the local scope, then it is an implicit global. Public symbol type lookups never include such a fall back. For example, if a name is not defined in a class scope, it is not available as a member on that class, even if a name lookup within the class scope would have fallen back to a global lookup. This PR makes the `@override` lint rule work again. Not yet included/supported in this PR: * Support for free variables / closures: a free symbol in a nested function-like scope referring to a symbol in an outer function-like scope. * Support for `global` and `nonlocal` statements, which force a symbol to be treated as global or nonlocal even if it has definitions in the local scope. * Module-global lookups should fall back to builtins if the name isn't found in the module scope. I would like to expose nicer APIs for the various kinds of symbols (explicit global, implicit global, free, etc), but this will also wait for a later PR, when more kinds of symbols are supported.	2024-07-18 10:50:43 -07:00
Carl Meyer	811f78d94d	[red-knot] small efficiency improvements and bugfixes to use-def map building (#12373 ) Adds inference tests sufficient to give full test coverage of the `UseDefMapBuilder::merge` method. In the process I realized that we could implement visiting of if statements in `SemanticBuilder` with fewer `snapshot`, `restore`, and `merge` operations, so I restructured that visit a bit. I also found one correctness bug in the `merge` method (it failed to extend the given snapshot with "unbound" for any missing symbols, meaning we would just lose the fact that the symbol could be unbound in the merged-in path), and two efficiency bugs (if one of the ranges to merge is empty, we can just use the other one, no need for copies, and if the ranges are overlapping -- which can occur with nested branches -- we can still just merge them with no copies), and fixed all three.	2024-07-18 09:24:58 -07:00
Carl Meyer	b2a49d8140	[red-knot] better docs for use-def maps (#12357 ) Add better doc comments and comments, as well as one debug assertion, to use-def map building.	2024-07-17 17:50:58 -07:00
Carl Meyer	985a999234	[red-knot] better docs for type inference (#12356 ) Add some docs for how type inference works. Also a couple minor code changes to rearrange or rename for better clarity.	2024-07-17 13:36:58 -07:00
Micha Reiser	91338ae902	[red-knot] Add basic workspace support (#12318 )	2024-07-17 11:34:21 +02:00
Carl Meyer	073588b48e	[red-knot] improve semantic index tests (#12355 ) Improve semantic index tests with better assertions than just `.len()`, and re-add use-definition test that was commented out in the switch to Salsa initially.	2024-07-16 23:46:49 -07:00
Carl Meyer	595b1aa4a1	[red-knot] per-definition inference, use-def maps (#12269 ) Implements definition-level type inference, with basic control flow (only if statements and if expressions so far) in Salsa. There are a couple key ideas here: 1) We can do type inference queries at any of three region granularities: an entire scope, a single definition, or a single expression. These are represented by the `InferenceRegion` enum, and the entry points are the salsa queries `infer_scope_types`, `infer_definition_types`, and `infer_expression_types`. Generally per-scope will be used for scopes that we are directly checking and per-definition will be used anytime we are looking up symbol types from another module/scope. Per-expression should be uncommon: used only for the RHS of an unpacking or multi-target assignment (to avoid re-inferring the RHS once per symbol defined in the assignment) and for test nodes in type narrowing (e.g. the `test` of an `If` node). All three queries return a `TypeInference` with a map of types for all definitions and expressions within their region. If you do e.g. scope-level inference, when it hits a definition, or an independently-inferable expression, it should use the relevant query (which may already be cached) to get all types within the smaller region. This avoids double-inferring smaller regions, even though larger regions encompass smaller ones. 2) Instead of building a control-flow graph and lazily traversing it to find definitions which reach a use of a name (which is O(n^2) in the worst case), instead semantic indexing builds a use-def map, where every use of a name knows which definitions can reach that use. We also no longer track all definitions of a symbol in the symbol itself; instead the use-def map also records which defs remain visible at the end of the scope, and considers these the publicly-visible definitions of the symbol (see below). Major items left as TODOs in this PR, to be done in follow-up PRs: 1) Free/global references aren't supported yet (only lookup based on definitions in current scope), which means the override-check example doesn't currently work. This is the first thing I'll fix as follow-up to this PR. 2) Control flow outside of if statements and expressions. 3) Type narrowing. There are also some smaller relevant changes here: 1) Eliminate `Option` in the return type of member lookups; instead always return `Type::Unbound` for a name we can't find. Also use `Type::Unbound` for modules we can't resolve (not 100% sure about this one yet.) 2) Eliminate the use of the terms "public" and "root" to refer to module-global scope or symbols. Instead consistently use the term "module-global". It's longer, but it's the clearest, and the most consistent with typical Python terminology. In particular I don't like "public" for this use because it has other implications around author intent (is an underscore-prefixed module-global symbol "public"?). And "root" is just not commonly used for this in Python. 3) Eliminate the `PublicSymbol` Salsa ingredient. Many non-module-global symbols can also be seen from other scopes (e.g. by a free var in a nested scope, or by class attribute access), and thus need to have a "public type" (that is, the type not as seen from a particular use in the control flow of the same scope, but the type as seen from some other scope.) So all symbols need to have a "public type" (here I want to keep the use of the term "public", unless someone has a better term to suggest -- since it's "public type of a symbol" and not "public symbol" the confusion with e.g. initial underscores is less of an issue.) At least initially, I would like to try not having special handling for module-global symbols vs other symbols. 4) Switch to using "definitions that reach end of scope" rather than "all definitions" in determining the public type of a symbol. I'm convinced that in general this is the right way to go. We may want to refine this further in future for some free-variable cases, but it can be changed purely by making changes to the building of the use-def map (the `public_definitions` index in it), without affecting any other code. One consequence of combining this with no control-flow support (just last-definition-wins) is that some inference tests now give more wrong-looking results; I left TODO comments on these tests to fix them when control flow is added. And some potential areas for consideration in the future: 1) Should `symbol_ty` be a Salsa query? This would require making all symbols a Salsa ingredient, and tracking even more dependencies. But it would save some repeated reconstruction of unions, for symbols with multiple public definitions. For now I'm not making it a query, but open to changing this in future with actual perf evidence that it's better.	2024-07-16 11:02:30 -07:00
Alex Waygood	5b21922420	[red-knot] Add more stress tests for module resolver invalidation (#12272 )	2024-07-10 14:34:06 +00:00
Micha Reiser	ac04380f36	[red-knot] Rename `FileSystem` to `System` (#12214 )	2024-07-09 07:20:51 +00:00
Alex Waygood	a62a432a48	[red-knot] Respect typeshed's `VERSIONS` file when resolving stdlib modules (#12141 )	2024-07-05 22:43:31 +00:00
Carl Meyer	0e44235981	[red-knot] intern types using Salsa (#12061 ) Intern types using Salsa interning instead of in the `TypeInference` result. This eliminates the need for `TypingContext`, and also paves the way for finer-grained type inference queries.	2024-07-05 12:16:37 -07:00
Micha Reiser	4d385b60c8	[red-knot] Migrate CLI to Salsa (#11972 )	2024-07-04 07:23:45 +00:00
Micha Reiser	262053f85c	[red-knot]: Implement `HasTy` for `Alias` (#11971 )	2024-07-04 07:17:10 +00:00
Micha Reiser	3ce8b9fcae	Make `Definition` a salsa-ingredient (#12151 )	2024-07-04 06:46:08 +00:00
Micha Reiser	dcb9523b1e	Address review feedback from 11963 (#12145 )	2024-07-02 09:05:55 +02:00
Micha Reiser	25080acb7a	[red-knot] Introduce `ExpressionNodeKey` to improve typing of `expression_map` (#12142 )	2024-07-01 16:15:53 +02:00
Micha Reiser	228b1c4235	[red-knot] Remove `Scope::name` (#12137 )	2024-07-01 15:55:50 +02:00
Micha Reiser	955138b74a	Refactor `ast_ids` traits to take `ScopeId` instead of `VfsFile` plus `FileScopeId`. (#12139 )	2024-07-01 15:50:07 +02:00
Micha Reiser	37f260b5af	Introduce `HasTy` trait and `SemanticModel` facade (#11963 )	2024-07-01 14:48:27 +02:00
Micha Reiser	5109b50bb3	Use `CompactString` for `Identifier` (#12101 )	2024-07-01 10:06:02 +02:00
Alex Waygood	736a4ead14	[red-knot] Move module-resolution logic to its own crate (#11964 )	2024-06-21 13:25:44 +00:00
Micha Reiser	927069c12f	[red-knot] Upgrade to Salsa 3.0 (#11952 )	2024-06-20 20:19:16 +01:00
Micha Reiser	b456051be8	[red-knot] Add tracing to Salsa queries (#11949 )	2024-06-20 13:33:41 +02:00
Micha Reiser	2dfbf118d7	[red-knot] Extract `red_knot_python_semantic` crate (#11926 )	2024-06-20 13:24:24 +02:00

... 13 14 15 16 17 ...

856 Commits