Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Micha Reiser	2a5ace6e55	[ty] Implement diagnostic caching (#19605 )	2025-07-30 11:04:34 +01:00
Brent Westbrook	a54061e757	[ty] Fix empty spans following a line terminator and unprintable character spans in diagnostics (#19535 ) ## Summary This was previously the last commit in #19415, split out to make it easier to review. This applies the fixes from `c9b99e4`, `5021f32`, and `2922490cb8` to the new rendering code in `ruff_db`. I initially intended only to fix the empty span after a line terminator (as you can see in the branch name), but the two fixes were tied pretty closely together, and my initial fix for the empty spans needed a big change after trying to handle unprintable characters too. I can still split this up if it would help with review. I would just start with the unprintable characters first. The implementation here is essentially copy-pasted from `ruff_linter::message::text.rs`, with the `SourceCode` struct renamed to `EscapedSourceCode` since there's already a `SourceCode` in scope in `render.rs`. It's also updated slightly to account for the multiple annotations for a single snippet. The original implementation used some types from the `line_width` module from `ruff_linter`. I copied over heavily stripped-down versions of these instead of trying to import them. We could inline the remaining code entirely, if we want, but I thought it was nice enough to keep. I also moved over `ceil_char_boundary`, which is unchanged except to make it a free function taking a `&str` instead of a `Locator` method. All of this code could be deleted from `ruff_linter` if we also move over the `grouped` output format, which will be the last user after #19415. ## Test Plan I added new tests in `ruff_linter` that call into the new rendering code to snapshot the diagnostics for the affected cases. These are copies of existing snapshots in Ruff, so it's helpful to compare them. These are a bit noisy because of the other rendering differences in the header, but all of the `^^^` indicators should be the same. <details><summary>`empty_span_after_line_terminator` diff</summary> ```diff diff --git a/crates/ruff_linter/src/rules/pycodestyle/snapshots/ruff_linter__rules__pycodestyle__tests__E112_E11.py.snap b/crates/ruff_linter/src/message/snapshots/ruff_linter__message__text__tests__empty_span_after_line_terminator.snap index 5ade4346e0..6df75c16f0 100644 --- a/crates/ruff_linter/src/rules/pycodestyle/snapshots/ruff_linter__rules__pycodestyle__tests__E112_E11.py.snap +++ b/crates/ruff_linter/src/message/snapshots/ruff_linter__message__text__tests__empty_span_after_line_terminator.snap @@ -1,17 +1,20 @@ --- -source: crates/ruff_linter/src/rules/pycodestyle/mod.rs +source: crates/ruff_linter/src/message/text.rs +expression: value.to_string() --- -E11.py:9:1: E112 Expected an indented block +error[no-indented-block]: Expected an indented block + --> E11.py:9:1 \| 7 \| #: E112 8 \| if False: 9 \| print() - \| ^ E112 + \| ^ 10 \| #: E113 11 \| print() \| -E11.py:9:1: SyntaxError: Expected an indented block after `if` statement +error[invalid-syntax]: SyntaxError: Expected an indented block after `if` statement + --> E11.py:9:1 \| 7 \| #: E112 8 \| if False: @@ -21,7 +24,8 @@ E11.py:9:1: SyntaxError: Expected an indented block after `if` statement 11 \| print() \| -E11.py:12:1: SyntaxError: Unexpected indentation +error[invalid-syntax]: SyntaxError: Unexpected indentation + --> E11.py:12:1 \| 10 \| #: E113 11 \| print() @@ -31,7 +35,8 @@ E11.py:12:1: SyntaxError: Unexpected indentation 14 \| mimetype = 'application/x-directory' \| -E11.py:14:1: SyntaxError: Expected a statement +error[invalid-syntax]: SyntaxError: Expected a statement + --> E11.py:14:1 \| 12 \| print() 13 \| #: E114 E116 @@ -41,17 +46,19 @@ E11.py:14:1: SyntaxError: Expected a statement 16 \| create_date = False \| -E11.py:45:1: E112 Expected an indented block +error[no-indented-block]: Expected an indented block + --> E11.py:45:1 \| 43 \| #: E112 44 \| if False: # 45 \| print() - \| ^ E112 + \| ^ 46 \| #: 47 \| if False: \| -E11.py:45:1: SyntaxError: Expected an indented block after `if` statement +error[invalid-syntax]: SyntaxError: Expected an indented block after `if` statement + --> E11.py:45:1 \| 43 \| #: E112 44 \| if False: # ``` </details> <details><summary>`unprintable_characters` diff</summary> ```diff diff --git a/crates/ruff_linter/src/rules/pylint/snapshots/ruff_linter__rules__pylint__tests__PLE2512_invalid_characters.py.snap b/crates/ruff_linter/src/message/snapshots/ruff_linter__message__text__tests__unprintable_characters.snap index 52cfdf9cce..fcfa1ac9f1 100644 --- a/crates/ruff_linter/src/rules/pylint/snapshots/ruff_linter__rules__pylint__tests__PLE2512_invalid_characters.py.snap +++ b/crates/ruff_linter/src/message/snapshots/ruff_linter__message__text__tests__unprintable_characters.snap @@ -1,161 +1,115 @@ --- -source: crates/ruff_linter/src/rules/pylint/mod.rs +source: crates/ruff_linter/src/message/text.rs +expression: value.to_string() --- -invalid_characters.py:24:12: PLE2512 [] Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:24:12 \| 22 \| cr_ok = f'\\r' 23 \| 24 \| sub = 'sub ' - \| ^ PLE2512 + \| ^ 25 \| sub = f'sub ' \| - = help: Replace with escape sequence +help: Replace with escape sequence -ℹ Safe fix -21 21 \| cr_ok = '\\r' -22 22 \| cr_ok = f'\\r' -23 23 \| -24 \|-sub = 'sub ' - 24 \|+sub = 'sub \x1A' -25 25 \| sub = f'sub ' -26 26 \| -27 27 \| sub_ok = '\x1a' - -invalid_characters.py:25:13: PLE2512 [] Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:25:13 \| 24 \| sub = 'sub ' 25 \| sub = f'sub ' - \| ^ PLE2512 + \| ^ 26 \| 27 \| sub_ok = '\x1a' \| - = help: Replace with escape sequence - -ℹ Safe fix -22 22 \| cr_ok = f'\\r' -23 23 \| -24 24 \| sub = 'sub ' -25 \|-sub = f'sub ' - 25 \|+sub = f'sub \x1A' -26 26 \| -27 27 \| sub_ok = '\x1a' -28 28 \| sub_ok = f'\x1a' +help: Replace with escape sequence -invalid_characters.py:55:25: PLE2512 [] Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:55:25 \| 53 \| zwsp_after_multicharacter_grapheme_cluster = f"ಫ್ರಾನ್ಸಿಸ್ಕೊ " 54 \| 55 \| nested_fstrings = f'␈{f'{f'␛'}'}' - \| ^ PLE2512 + \| ^ 56 \| 57 \| # https://github.com/astral-sh/ruff/issues/7455#issuecomment-1741998106 \| - = help: Replace with escape sequence - -ℹ Safe fix -52 52 \| zwsp_after_multicharacter_grapheme_cluster = "ಫ್ರಾನ್ಸಿಸ್ಕೊ " -53 53 \| zwsp_after_multicharacter_grapheme_cluster = f"ಫ್ರಾನ್ಸಿಸ್ಕೊ " -54 54 \| -55 \|-nested_fstrings = f'␈{f'{f'␛'}'}' - 55 \|+nested_fstrings = f'␈{f'\x1A{f'␛'}'}' -56 56 \| -57 57 \| # https://github.com/astral-sh/ruff/issues/7455#issuecomment-1741998106 -58 58 \| x = f"""}}ab""" +help: Replace with escape sequence -invalid_characters.py:58:12: PLE2512 [] Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:58:12 \| 57 \| # https://github.com/astral-sh/ruff/issues/7455#issuecomment-1741998106 58 \| x = f"""}}ab""" - \| ^ PLE2512 + \| ^ 59 \| # https://github.com/astral-sh/ruff/issues/7455#issuecomment-1741998256 60 \| x = f"""}}a␛b""" \| - = help: Replace with escape sequence +help: Replace with escape sequence -ℹ Safe fix -55 55 \| nested_fstrings = f'␈{f'{f'␛'}'}' -56 56 \| -57 57 \| # https://github.com/astral-sh/ruff/issues/7455#issuecomment-1741998106 -58 \|-x = f"""}}ab""" - 58 \|+x = f"""}}a\x1Ab""" -59 59 \| # https://github.com/astral-sh/ruff/issues/7455#issuecomment-1741998256 -60 60 \| x = f"""}}a␛b""" -61 61 \| - -invalid_characters.py:64:12: PLE2512 Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:64:12 \| 63 \| # https://github.com/astral-sh/ruff/issues/13294 64 \| print(r"""␈␛� - \| ^ PLE2512 + \| ^ 65 \| """) 66 \| print(fr"""␈␛� \| - = help: Replace with escape sequence +help: Replace with escape sequence -invalid_characters.py:66:13: PLE2512 Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:66:13 \| 64 \| print(r"""␈␛� 65 \| """) 66 \| print(fr"""␈␛� - \| ^ PLE2512 + \| ^ 67 \| """) 68 \| print(Rf"""␈␛� \| - = help: Replace with escape sequence +help: Replace with escape sequence -invalid_characters.py:68:13: PLE2512 Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:68:13 \| 66 \| print(fr"""␈␛� 67 \| """) 68 \| print(Rf"""␈␛� - \| ^ PLE2512 + \| ^ 69 \| """) \| - = help: Replace with escape sequence +help: Replace with escape sequence -invalid_characters.py:73:9: PLE2512 Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:73:9 \| 71 \| # https://github.com/astral-sh/ruff/issues/18815 72 \| b = "\␈" 73 \| sub = "\" - \| ^ PLE2512 + \| ^ 74 \| esc = "\␛" 75 \| zwsp = "\" \| - = help: Replace with escape sequence +help: Replace with escape sequence -invalid_characters.py:80:25: PLE2512 [] Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:80:25 \| 78 \| # tstrings 79 \| esc = t'esc esc ␛' 80 \| nested_tstrings = t'␈{t'{t'␛'}'}' - \| ^ PLE2512 + \| ^ 81 \| nested_ftstrings = t'␈{f'{t'␛'}'}' \| - = help: Replace with escape sequence - -ℹ Safe fix -77 77 \| -78 78 \| # tstrings -79 79 \| esc = t'esc esc ␛' -80 \|-nested_tstrings = t'␈{t'{t'␛'}'}' - 80 \|+nested_tstrings = t'␈{t'\x1A{t'␛'}'}' -81 81 \| nested_ftstrings = t'␈{f'{t'␛'}'}' -82 82 \| +help: Replace with escape sequence -invalid_characters.py:81:26: PLE2512 [] Invalid unescaped character SUB, use "\x1A" instead +error[invalid-character-sub]: Invalid unescaped character SUB, use "\x1A" instead + --> invalid_characters.py:81:26 \| 79 \| esc = t'esc esc ␛' 80 \| nested_tstrings = t'␈{t'{t'␛'}'}' 81 \| nested_ftstrings = t'␈{f'{t'␛'}'}' - \| ^ PLE2512 + \| ^ \| - = help: Replace with escape sequence - -ℹ Safe fix -78 78 \| # tstrings -79 79 \| esc = t'esc esc ␛' -80 80 \| nested_tstrings = t'␈{t'{t'␛'}'}' -81 \|-nested_ftstrings = t'␈{f'{t'␛'}'}' - 81 \|+nested_ftstrings = t'␈{f'\x1A{t'␛'}'}' -82 82 \| +help: Replace with escape sequence ``` </details>	2025-07-29 08:25:58 -04:00
Brent Westbrook	4daf59e5e7	Move concise diagnostic rendering to `ruff_db` (#19398 ) ## Summary This PR moves most of the work of rendering concise diagnostics in Ruff into `ruff_db`, where the code is shared with ty. To accomplish this without breaking backwards compatibility in Ruff, there are two main changes on the `ruff_db`/ty side: - Added the logic from Ruff for remapping notebook line numbers to cells - Reordered the fields in the diagnostic to match Ruff and rustc ```text # old error[invalid-assignment] try.py:3:1: Object of type `Literal[1]` is not assignable to `str` # new try.py:3:1: error[invalid-assignment]: Object of type `Literal[1]` is not assignable to `str` ``` I don't think the notebook change failed any tests on its own, and only a handful of snaphots changed in ty after reordering the fields, but this will obviously affect any other uses of the concise format, outside of tests, too. The other big change should only affect Ruff: - Added three new `DisplayDiagnosticConfig` options Micha and I hoped that we could get by with one option (`hide_severity`), but Ruff also toggles `show_fix_status` itself, independently (there are cases where we want neither severity nor the fix status), and during the implementation I realized we also needed access to an `Applicability`. The main goal here is to suppress the severity (`error` above) because ruff only uses the `error` severity and to use the secondary/noqa code instead of the line name (`invalid-assignment` above). ```text # ty - same as "new" above try.py:3:1: error[invalid-assignment]: Object of type `Literal[1]` is not assignable to `str` # ruff try.py:3:1: RUF123 [*] Object of type `Literal[1]` is not assignable to `str` ``` This part of the concise diagnostic is actually shared with the `full` output format in Ruff, but with the settings above, there are no snapshot changes to either format. ## Test Plan Existing tests with the handful of updates mentioned above, as well as some new tests in the `concise` module. Also this PR. Swapping the fields might have broken mypy_primer, unless it occasionally times out on its own. I also ran this script in the root of my Ruff checkout, which also has CPython in it: ```shell flags=(--isolated --no-cache --no-respect-gitignore --output-format concise .) diff <(target/release/ruff check ${flags[@]} 2> /dev/null) \ <(ruff check ${flags[@]} 2> /dev/null) ``` This yielded an expected diff due to some t-string error changes on main since 0.12.4: ```diff 33622c33622 < crates/ruff_python_parser/resources/inline/err/f_string_lambda_without_parentheses.py:1:15: SyntaxError: Expected an element of or the end of the f-string --- > crates/ruff_python_parser/resources/inline/err/f_string_lambda_without_parentheses.py:1:15: SyntaxError: Expected an f-string or t-string element or the end of the f-string or t-string 33742c33742 < crates/ruff_python_parser/resources/inline/err/implicitly_concatenated_unterminated_string_multiline.py:4:1: SyntaxError: Expected an element of or the end of the f-string --- > crates/ruff_python_parser/resources/inline/err/implicitly_concatenated_unterminated_string_multiline.py:4:1: SyntaxError: Expected an f-string or t-string element or the end of the f-string or t-string 34131c34131 < crates/ruff_python_parser/resources/inline/err/t_string_lambda_without_parentheses.py:2:15: SyntaxError: Expected an element of or the end of the t-string --- > crates/ruff_python_parser/resources/inline/err/t_string_lambda_without_parentheses.py:2:15: SyntaxError: Expected an f-string or t-string element or the end of the f-string or t-string ``` So modulo color, the results are identical on 38,186 errors in our test suite and CPython 3.10. --------- Co-authored-by: David Peter <mail@david-peter.de>	2025-07-23 11:43:32 -04:00
Brent Westbrook	fd335eb8b7	Move fix suggestion to subdiagnostic (#19464 ) Summary -- This PR tweaks Ruff's internal usage of the new diagnostic model to more closely match the intended use, as I understand it. Specifically, it moves the fix/help suggestion from the primary annotation's message to a subdiagnostic. In turn, it adds the secondary/noqa code as the new primary annotation message. As shown in the new `ruff_db` tests, this more closely mirrors Ruff's current diagnostic output. I also added `Severity::Help` to render the fix suggestion with a `help:` prefix instead of `info:`. These changes don't have any external impact now but should help a bit with #19415. Test Plan -- New full output format tests in `ruff_db` Rendered Diagnostics -- Full diagnostic output from `annotate-snippets` in this PR: ``` error[unused-import]: `os` imported but unused --> fib.py:1:8 \| 1 \| import os \| ^^ \| help: Remove unused import: `os` ``` Current Ruff output for the same code: ``` fib.py:1:8: F401 [] `os` imported but unused \| 1 \| import os \| ^^ F401 \| = help: Remove unused import: `os` ``` Proposed final output after #19415: ``` F401 [] `os` imported but unused --> fib.py:1:8 \| 1 \| import os \| ^^ \| help: Remove unused import: `os` ``` These are slightly updated from https://github.com/astral-sh/ruff/pull/19464#issuecomment-3097377634 below to remove the extra noqa codes in the primary annotation messages for the first and third cases.	2025-07-22 10:03:58 -04:00
Micha Reiser	5e29278aa2	[ty] Reduce size of `TypeInference` (#19435 )	2025-07-22 11:36:36 +02:00
Andrew Gallant	64f9481fd0	[ty] Add caching for submodule completion suggestions (#19408 ) This change makes it so we aren't doing a directory traversal every time we ask for completions from a module. Specifically, submodules that aren't attributes of their parent module can only be discovered by looking at the directory tree. But we want to avoid doing a directory scan unless we think there are changes. To make this work, this change does a little bit of surgery to `FileRoot`. Previously, a `FileRoot` was only used for library search paths. Its revision was bumped whenever a file in that tree was added, deleted or even modified (to support the discovery of `pth` files and changes to its contents). This generally seems fine since these are presumably dependency paths that shouldn't change frequently. In this change, we add a `FileRoot` for the project. But having the `FileRoot`'s revision bumped for every change in the project makes caching based on that `FileRoot` rather ineffective. That is, cache invalidation will occur too aggressively. To the point that there is little point in adding caching in the first place. To mitigate this, a `FileRoot`'s revision is only bumped on a change to a child file's contents when the `FileRoot` is a `LibrarySearchPath`. Otherwise, we only bump the revision when a file is created or added. The effect is that, at least in VS Code, when a new module is added or removed, this change is picked up and the cache is properly invalidated. Other LSP clients with worse support for file watching (which seems to be the case for the CoC vim plugin that I use) don't work as well. Here, the cache is less likely to be invalidated which might cause completions to have stale results. Unless there's an obvious way to fix or improve this, I propose punting on improvements here for now.	2025-07-18 11:54:27 -04:00
Dhruv Manilawala	99d0ac60b4	[ty] Track open files in the server (#19264 ) ## Summary This PR updates the server to keep track of open files both system and virtual files. This is done by updating the project by adding the file in the open file set in `didOpen` notification and removing it in `didClose` notification. This does mean that for workspace diagnostics, ty will only check open files because the behavior of different diagnostic builder is to first check `is_file_open` and only add diagnostics for open files. So, this required updating the `is_file_open` model to be `should_check_file` model which validates whether the file needs to be checked based on the `CheckMode`. If the check mode is open files only then it will check whether the file is open. If it's all files then it'll return `true` by default. Closes: astral-sh/ty#619 ## Test Plan ### Before There are two files in the project: `__init__.py` and `diagnostics.py`. In the video, I'm demonstrating the old behavior where making changes to the (open) `diagnostics.py` file results in re-parsing the file: https://github.com/user-attachments/assets/c2ac0ecd-9c77-42af-a924-c3744b146045 ### After Same setup as above. In the video, I'm demonstrating the new behavior where making changes to the (open) `diagnostics.py` file doesn't result in re-parting the file: https://github.com/user-attachments/assets/7b82fe92-f330-44c7-b527-c841c4545f8f	2025-07-18 19:33:35 +05:30
Andrew Gallant	ba7ed3a6f9	[ty] Use `…` as the "cut" indicator in diagnostic rendering (#19420 ) This makes ty match ruff's behavior. Specifically, we want to use `…` instead of the default `...` because `...` has special significance in Python.	2025-07-18 07:46:48 -04:00
Brent Westbrook	997dc2e7cc	Move JUnit rendering to `ruff_db` (#19370 ) Summary -- This PR moves the JUnit output format to the new rendering infrastructure. As I mention in a TODO in the code, there's some code that will be shared with the `grouped` output format. Hopefully I'll have that PR up too by the time this one is reviewed. Test Plan -- Existing tests moved to `ruff_db` --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2025-07-17 18:24:13 -04:00
Brent Westbrook	e9cac3684a	Move Pylint rendering to `ruff_db` (#19340 ) Summary -- This is a very simple output format, the only decision is what to do if the file is missing from the diagnostic. For now, I opted to `unwrap_or_default` both the path and the `OneIndexed` row number, giving `:1: main diagnostic message` in the test without a file. Another quirk here is that the path is relativized. I just pasted in the `relativize_path` and `get_cwd` implementations from `ruff_linter::fs` for now, but maybe there's a better place for them. I didn't see any details about why this needs to be relativized in the original [issue](https://github.com/astral-sh/ruff/issues/1953), [PR](https://github.com/astral-sh/ruff/pull/1995), or in the pylint [docs](https://flake8.pycqa.org/en/latest/internal/formatters.html#pylint-formatter), but it did change the results of the CLI integration test when I tried deleting it. I haven't been able to reproduce that in the CLI, though, so it may only happen with `Command::current_dir`. Test Plan -- Tests ported from `ruff_linter` and a new test for the case with no file --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2025-07-15 10:14:49 -04:00
Brent Westbrook	e9b0c33703	Move RDJSON rendering to `ruff_db` (#19293 ) ## Summary Another output format like #19133. This is the [reviewdog](https://github.com/reviewdog/reviewdog) output format, which is somewhat similar to regular JSON. Like #19270, in the first commit I converted from using `json!` to `Serialize` structs, then in the second commit I moved the module to `ruff_db`. The reviewdog [schema](`320a8e73a9/proto/rdf/jsonschema/DiagnosticResult.json`) seems a bit more flexible than our JSON schema, so I'm not sure if we need any preview checks here. I'll flag the places I wasn't sure about as review comments. ## Test Plan New tests in `rdjson.rs`, ported from the old `rjdson.rs` module, as well as the new CLI output tests. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2025-07-15 12:39:21 +00:00
Micha Reiser	3da8b51dc1	[ty] Fix server version (#19284 )	2025-07-14 09:06:34 +02:00
Brent Westbrook	b5c5f710fc	Render Azure, JSON, and JSON lines output with the new diagnostics (#19133 ) ## Summary This was originally stacked on #19129, but some of the changes I made for JSON also impacted the Azure format, so I went ahead and combined them. The main changes here are: - Implementing `FileResolver` for Ruff's `EmitterContext` - Adding `FileResolver::notebook_index` and `FileResolver::is_notebook` methods - Adding a `DisplayDiagnostics` (with an "s") type for rendering a group of diagnostics at once - Adding `Azure`, `Json`, and `JsonLines` as new `DiagnosticFormat`s I tried a couple of alternatives to the `FileResolver::notebook` methods like passing down the `NotebookIndex` separately and trying to reparse a `Notebook` from Ruff's `SourceFile`. The latter seemed promising, but the `SourceFile` only stores the concatenated plain text of the notebook, not the re-parsable JSON. I guess the current version is just a variation on passing the `NotebookIndex`, but at least we can reuse the existing `resolver` argument. I think a lot of this can be cleaned up once Ruff has its own actual file resolver. As suggested, I also tried deleting the corresponding `Emitter` files in `ruff_linter`, but it doesn't look like git was able to follow this as a rename. It did, however, track that the tests were moved, so the snapshots should be easy to review. ## Test Plan Existing Ruff tests ported to tests in `ruff_db`. I think some other existing ruff tests also cover parts of this refactor. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2025-07-11 15:04:46 -04:00
Andrew Gallant	100d765ddf	[ty] Document path separator usage in `VendoredFileSystem` Ref https://github.com/astral-sh/ruff/pull/19266#discussion_r2198530383	2025-07-11 10:06:35 -04:00
Andrew Gallant	948463aafa	[ty] Move `SystemOrVendoredPathRef` This moves the type and adds a few methods so that it can be used elsewhere.	2025-07-11 10:06:35 -04:00
Andrew Gallant	729fa12575	[ty] Add "readdir" for vendored file systems This is mostly just holding a zip file in the right way to simulate reading a directory. We want this to be able to discover sub-modules for completions.	2025-07-11 10:06:35 -04:00
Brent Westbrook	f14ee9edd5	Use structs for JSON serialization (#19270 ) ## Summary See https://github.com/astral-sh/ruff/pull/19133#discussion_r2198413586 for recent discussion. This PR moves to using structs for the types in our JSON output format instead of the `json!` macro. I didn't rename any of the `message` references because that should be handled when rebasing #19133 onto this. My plan for handling the `preview` behavior with the new diagnostics is to use a wrapper enum. Something like: ```rust #[derive(Serialize)] #[serde(untagged)] pub(crate) enum JsonDiagnostic<'a> { Old(OldJsonDiagnostic<'a>), } #[derive(Serialize)] pub(crate) struct OldJsonDiagnostic<'a> { // ... } ``` Initially I thought I could use a `&dyn Serialize` for the affected fields, but I see that `Serialize` isn't dyn-compatible in testing this now. ## Test Plan Existing tests. One quirk of the new types is that their fields are in alphabetical order. I guess `json!` sorts the fields alphabetically? The tests were failing before I sorted the struct fields. ## Other formats It looks like the `rdjson`, `sarif`, and `gitlab` formats also use `json!`, so if we decide to merge this, I can do something similar for those before moving them to the new diagnostic format.	2025-07-11 09:37:44 -04:00
Charlie Marsh	3ee3434187	Auto-generate environment variable references for ty (#19205 ) ## Summary This PR mirrors the environment variable implementation we have in uv: `efc361223c/crates/uv-static/src/env_vars.rs (L6-L7)`. See: https://github.com/astral-sh/ty/issues/773.	2025-07-08 10:48:31 -04:00
Brent Westbrook	2643dc5b7a	Rename `Diagnostic::syntax_error` methods, separate `Ord` implementation (#19179 ) ## Summary This PR addresses some additional feedback on #19053: - Renaming the `syntax_error` methods to `invalid_syntax` to match the lint id - Moving the standalone `diagnostic_from_violation` function to `Violation::into_diagnostic` - Removing the `Ord` and `PartialOrd` implementations from `Diagnostic` in favor of `Diagnostic::start_ordering` ## Test Plan Existing tests ## Additional Follow-ups Besides these, I also put the following comments on my todo list, but they seemed like they might be big enough to have their own PRs: - [Use `LintId::IOError` for IO errors](https://github.com/astral-sh/ruff/pull/19053#discussion_r2189425922) - [Move `Fix` and `Edit`](https://github.com/astral-sh/ruff/pull/19053#discussion_r2189448647) - [Avoid so many unwraps](https://github.com/astral-sh/ruff/pull/19053#discussion_r2189465980)	2025-07-08 09:54:19 -04:00
Brent Westbrook	77a5c5ac80	Combine `OldDiagnostic` and `Diagnostic` (#19053 ) ## Summary This PR is a collaboration with @AlexWaygood from our pairing session last Friday. The main goal here is removing `ruff_linter::message::OldDiagnostic` in favor of using `ruff_db::diagnostic::Diagnostic` directly. This involved a few major steps: - Transferring the fields - Transferring the methods and trait implementations, where possible - Converting some constructor methods to free functions - Moving the `SecondaryCode` struct - Updating the method names I'm hoping that some of the methods, especially those in the `expect_ruff_*` family, won't be necessary long-term, but I avoided trying to replace them entirely for now to keep the already-large diff a bit smaller. ### Related refactors Alex and I noticed a few refactoring opportunities while looking at the code, specifically the very similar implementations for `create_parse_diagnostic`, `create_unsupported_syntax_diagnostic`, and `create_semantic_syntax_diagnostic`. We combined these into a single generic function, which I then copied into `ruff_linter::message` with some small changes and a TODO to combine them in the future. I also deleted the `DisplayParseErrorType` and `TruncateAtNewline` types for reporting parse errors. These were added in #4124, I believe to work around the error messages from LALRPOP. Removing these didn't affect any tests, so I think they were unnecessary now that we fully control the error messages from the parser. On a more minor note, I factored out some calls to the `OldDiagnostic::filename` (now `Diagnostic::expect_ruff_filename`) function to avoid repeatedly allocating `String`s in some places. ### Snapshot changes The `show_statistics_syntax_errors` integration test changed because the `OldDiagnostic::name` method used `syntax-error` instead of `invalid-syntax` like in ty. I think this (`--statistics`) is one of the only places we actually use this name for syntax errors, so I hope this is okay. An alternative is to use `syntax-error` in ty too. The other snapshot changes are from removing this code, as discussed on [Discord](https://discord.com/channels/1039017663004942429/1228460843033821285/1388252408848847069): `34052a1185/crates/ruff_linter/src/message/mod.rs (L128-L135)` I think both of these are technically breaking changes, but they only affect syntax errors and are very narrow in scope, while also pretty substantially simplifying the refactor, so I hope they're okay to include in a patch release. ## Test plan Existing tests, with the adjustments mentioned above --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2025-07-03 13:01:09 -04:00
Ibraheem Ahmed	ebc70a4002	[ty] Support LSP go-to with vendored typeshed stubs (#19057 ) ## Summary Extracts the vendored typeshed stubs lazily and caches them on the local filesystem to support go-to in the LSP. Resolves https://github.com/astral-sh/ty/issues/77.	2025-07-02 07:58:58 -04:00
Micha Reiser	f7fc8fb084	[ty] Request configuration from client (#18984 ) ## Summary This PR makes the necessary changes to the server that it can request configurations from the client using the `configuration` request. This PR doesn't make use of the request yet. It only sets up the foundation (mainly the coordination between client and server) so that future PRs could pull specific settings. I plan to use this for pulling the Python environment from the Python extension. Deno does something very similar to this. ## Test Plan Tested that diagnostics are still shown.	2025-07-02 14:31:41 +05:30
Micha Reiser	29927f2b59	Update Rust toolchain to 1.88 and MSRV to 1.86 (#19011 )	2025-06-28 20:24:00 +02:00
Ibraheem Ahmed	6f7b1c9bb3	[ty] Add environment variable to dump Salsa memory usage stats (#18928 ) ## Summary Setting `TY_MEMORY_REPORT=full` will generate and print a memory usage report to the CLI after a `ty check` run: ``` =======SALSA STRUCTS======= `Definition` metadata=7.24MB fields=17.38MB count=181062 `Expression` metadata=4.45MB fields=5.94MB count=92804 `member_lookup_with_policy_::interned_arguments` metadata=1.97MB fields=2.25MB count=35176 ... =======SALSA QUERIES======= `File -> ty_python_semantic::semantic_index::SemanticIndex` metadata=11.46MB fields=88.86MB count=1638 `Definition -> ty_python_semantic::types::infer::TypeInference` metadata=24.52MB fields=86.68MB count=146018 `File -> ruff_db::parsed::ParsedModule` metadata=0.12MB fields=69.06MB count=1642 ... =======SALSA SUMMARY======= TOTAL MEMORY USAGE: 577.61MB struct metadata = 29.00MB struct fields = 35.68MB memo metadata = 103.87MB memo fields = 409.06MB ``` Eventually, we should integrate these numbers into CI in some form. The one limitation currently is that heap allocations in salsa structs (e.g. interned values) are not tracked, but memoized values should have full coverage. We may also want a peak memory usage counter (that accounts for non-salsa memory), but that is relatively simple to profile manually (e.g. `time -v ty check`) and would require a compile-time option to avoid runtime overhead.	2025-06-26 21:27:51 +00:00
Micha Reiser	76387295a5	[ty] Move venv and conda env discovery to `SearchPath::from_settings` (#18938 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2025-06-26 16:39:27 +02:00
Micha Reiser	0194452928	[ty] Rename `src.root` setting to `environment.root` (#18760 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2025-06-24 14:40:44 +02:00
Micha Reiser	23261a38a0	[ty] Add more benchmarks (#18714 )	2025-06-18 13:41:38 +02:00
Micha Reiser	3a430fa6da	[ty] Allow overriding rules for specific files (#18648 )	2025-06-15 14:27:39 +01:00
Ibraheem Ahmed	c9dff5c7d5	[ty] AST garbage collection (#18482 ) ## Summary Garbage collect ASTs once we are done checking a given file. Queries with a cross-file dependency on the AST will reparse the file on demand. This reduces ty's peak memory usage by ~20-30%. The primary change of this PR is adding a `node_index` field to every AST node, that is assigned by the parser. `ParsedModule` can use this to create a flat index of AST nodes any time the file is parsed (or reparsed). This allows `AstNodeRef` to simply index into the current instance of the `ParsedModule`, instead of storing a pointer directly. The indices are somewhat hackily (using an atomic integer) assigned by the `parsed_module` query instead of by the parser directly. Assigning the indices in source-order in the (recursive) parser turns out to be difficult, and collecting the nodes during semantic indexing is impossible as `SemanticIndex` does not hold onto a specific `ParsedModuleRef`, which the pointers in the flat AST are tied to. This means that we have to do an extra AST traversal to assign and collect the nodes into a flat index, but the small performance impact (~3% on cold runs) seems worth it for the memory savings. Part of https://github.com/astral-sh/ty/issues/214.	2025-06-13 08:40:11 -04:00
Micha Reiser	1f27d53fd5	[ty] File inclusion and exclusion (#18498 )	2025-06-12 19:07:31 +02:00
renovate[bot]	475a02b725	Update pre-commit dependencies (#18581 )	2025-06-09 08:08:17 +02:00
Micha Reiser	86e5a311f0	[ty] Introduce and use `System::env_var` for better test isolation (#18538 )	2025-06-07 19:56:58 +02:00
Ibraheem Ahmed	8531f4b3ca	[ty] Add infrastructure for AST garbage collection (#18445 ) ## Summary https://github.com/astral-sh/ty/issues/214 will require a couple invasive changes that I would like to get merged even before garbage collection is fully implemented (to avoid rebasing): - `ParsedModule` can no longer be dereferenced directly. Instead you need to load a `ParsedModuleRef` to access the AST, which requires a reference to the salsa database (as it may require re-parsing the AST if it was collected). - `AstNodeRef` can only be dereferenced with the `node` method, which takes a reference to the `ParsedModuleRef`. This allows us to encode the fact that ASTs do not live as long as the database and may be collected as soon a given instance of a `ParsedModuleRef` is dropped. There are a number of places where we currently merge the `'db` and `'ast` lifetimes, so this requires giving some types/functions two separate lifetime parameters.	2025-06-05 11:43:18 -04:00
Micha Reiser	8005ebb405	Update salsa past generational id change (#18362 )	2025-05-30 15:31:33 +02:00
Alex Waygood	a5ebb3f3a2	[ty] Support ephemeral uv virtual environments (#18335 )	2025-05-28 14:54:59 +00:00
Micha Reiser	d8216fa328	[ty] Gracefully handle salsa cancellations and panics in background request handlers (#18254 )	2025-05-26 13:37:49 +01:00
Micha Reiser	3b56c7ca3d	Update salsa (#18212 )	2025-05-20 09:19:34 +02:00
Brent Westbrook	d6009eb942	Unify `Message` variants (#18051 ) ## Summary This PR unifies the ruff `Message` enum variants for syntax errors and rule violations into a single `Message` struct consisting of a shared `db::Diagnostic` and some additional, optional fields used for some rule violations. This version of `Message` is nearly a drop-in replacement for `ruff_diagnostics::Diagnostic`, which is the next step I have in mind for the refactor. I think this is also a useful checkpoint because we could possibly add some of these optional fields to the new `Diagnostic` type. I think we've previously discussed wanting support for `Fix`es, but the other fields seem less relevant, so we may just need to preserve the `Message` wrapper for a bit longer. ## Test plan Existing tests --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2025-05-19 13:34:04 -04:00
Micha Reiser	6985de4c40	[ty] Show related information in diagnostic (#17359 )	2025-05-19 18:52:12 +02:00
Micha Reiser	9ae698fe30	Switch to Rust 2024 edition (#18129 )	2025-05-16 13:25:28 +02:00
Micha Reiser	196e4befba	Update MSRV to 1.85 and toolchain to 1.87 (#18126 )	2025-05-16 09:19:55 +02:00
Micha Reiser	b6b7caa023	[ty] Change layout of extra verbose output and respect `--color` for verbose output (#18089 )	2025-05-15 09:57:59 +02:00
Andrew Gallant	b71ef8a26e	ruff_db: completely rip `lint:` prefix out This does a deeper removal of the `lint:` prefix by removing the `DiagnosticId::as_str` method and replacing it with `as_concise_str`. We remove the associated error type and simplify the `Display` impl for `DiagnosticId` as well. This turned out to catch a `lint:` that was still in the diagnostic output: the part that says why a lint is enabled.	2025-05-09 12:42:14 -04:00
Andrew Gallant	50c780fc8b	ty: switch to use `annotate-snippets` ID functionality We just set the ID on the `Message` and it just does what we want in this case. I think I didn't do this originally because I was trying to preserve the existing rendering? I'm not sure. I might have just missed this method.	2025-05-09 12:42:14 -04:00
Andrew Gallant	244ea27d5f	ruff_db: a small tweak to remove empty message case In a subsequent commit, we're going to start using `annotate-snippets`'s functionality for diagnostic IDs in the rendering. As part of doing that, I wanted to remove this special casing of an empty message. I did that independently to see what, if anything, would change. (The changes look fine to me. They'll be tweaked again in the next commit along with a bunch of others.)	2025-05-09 12:42:14 -04:00
Andrew Gallant	2c4cbb6e29	ty: get rid of `lint:` prefix in ID for diagnostic rendering In #289, we seem to have consensus that this prefix isn't really pulling its weight. Ref #289	2025-05-09 12:42:14 -04:00
Micha Reiser	6cd8a49638	[ty] Update salsa (#17964 )	2025-05-09 11:54:07 +02:00
Brent Westbrook	981bd70d39	Convert `Message::SyntaxError` to use `Diagnostic` internally (#17784 ) ## Summary This PR is a first step toward integration of the new `Diagnostic` type into ruff. There are two main changes: - A new `UnifiedFile` enum wrapping `File` for red-knot and a `SourceFile` for ruff - ruff's `Message::SyntaxError` variant is now a `Diagnostic` instead of a `SyntaxErrorMessage` The second of these changes was mostly just a proof of concept for the first, and it went pretty smoothly. Converting `DiagnosticMessage`s will be most of the work in replacing `Message` entirely. ## Test Plan Existing tests, which show no changes. --------- Co-authored-by: Carl Meyer <carl@astral.sh> Co-authored-by: Micha Reiser <micha@reiser.io>	2025-05-08 12:45:51 -04:00
Micha Reiser	067a8ac574	[ty] Default to latest supported python version (#17938 )	2025-05-08 16:58:35 +02:00
David Peter	4f890b2867	[ty] Update salsa (#17937 ) ## Summary * Update salsa to pull in https://github.com/salsa-rs/salsa/pull/850. * Some refactoring of salsa event callbacks in various `Db`'s due to https://github.com/salsa-rs/salsa/pull/849 closes https://github.com/astral-sh/ty/issues/108 ## Test Plan Ran `cargo run --bin ty -- -vvv` on a test file to make sure that salsa Events are still logged.	2025-05-08 12:02:53 +02:00
Alex Waygood	74fe7982ba	[ty] Sort collected diagnostics before snapshotting them in mdtest (#17926 )	2025-05-07 18:23:22 +01:00
Micha Reiser	6f821ac846	Show a warning at the end of the diagnostic list if there are any fatal warnings (#17855 )	2025-05-06 07:14:21 +00:00
Micha Reiser	e95130ad80	Introduce `TY_MAX_PARALLELISM` environment variable (#17830 )	2025-05-04 16:27:15 +02:00
Micha Reiser	fa628018b2	Use `#[expect(lint)]` over `#[allow(lint)]` where possible (#17822 )	2025-05-03 21:20:31 +02:00
Eric Botti	8535af8516	[red-knot] Add support for the LSP diagnostic tag (#17657 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2025-05-03 20:35:03 +02:00
Micha Reiser	b51c4f82ea	Rename Red Knot (#17820 )	2025-05-03 19:49:15 +02:00
Micha Reiser	b7e69ecbfc	[red-knot] Increase durability of read-only `File` fields (#17757 )	2025-05-01 09:25:48 +02:00
Micha Reiser	d94be0e780	[red-knot] Include salsa backtrace in check and mdtest panic messages (#17732 ) Co-authored-by: David Peter <sharkdp@users.noreply.github.com>	2025-04-30 10:26:40 +02:00
Micha Reiser	1d788981cd	[red-knot] Capture backtrace in "check-failed" diagnostic (#17641 ) Co-authored-by: David Peter <sharkdp@users.noreply.github.com>	2025-04-29 16:58:58 +00:00
Hans	9b9d16c3ba	[red-knot] colorize concise output diagnostics (#17232 ) (#17479 ) Co-authored-by: Micha Reiser <micha@reiser.io> Co-authored-by: Andrew Gallant <andrew@astral.sh>	2025-04-29 16:07:16 +02:00
Andrew Gallant	405878a128	ruff_db: render file paths in diagnostics as relative paths if possible This is done in what appears to be the same way as Ruff: we get the CWD, strip the prefix from the path if possible, and use that. If stripping the prefix fails, then we print the full path as-is. Fixes #17233	2025-04-28 14:32:34 -04:00
Micha Reiser	1c65e0ad25	Split `SourceLocation` into `LineColumn` and `SourceLocation` (#17587 )	2025-04-27 11:27:33 +01:00
Micha Reiser	cfa1505068	[red-knot] Fix CLI hang when a dependent query panics (#17631 )	2025-04-26 06:28:45 +00:00
Andrew Gallant	bc0a5aa409	ruff_db: add tests for annotations with no ranges ... and fix the case where an annotation with a `Span` but no `TextRange` or message gets completely dropped.	2025-04-25 13:25:20 -04:00
Andrew Gallant	43bd043755	ruff_db: add a `From` impl for `FileRange` to `Span` These types are almost equivalent. The only difference is that a `Span`'s range is optional.	2025-04-24 11:43:01 -04:00
Brent Westbrook	e7f38fe74b	[red-knot] Detect semantic syntax errors (#17463 ) Summary -- This PR extends semantic syntax error detection to red-knot. The main changes here are: 1. Adding `SemanticSyntaxChecker` and `Vec<SemanticSyntaxError>` fields to the `SemanticIndexBuilder` 2. Calling `SemanticSyntaxChecker::visit_stmt` and `visit_expr` in the `SemanticIndexBuilder`'s `visit_stmt` and `visit_expr` methods 3. Implementing `SemanticSyntaxContext` for `SemanticIndexBuilder` 4. Adding new mdtests to test the context implementation and show diagnostics (3) is definitely the trickiest and required (I think) a minor addition to the `SemanticIndexBuilder`. I tried to look around for existing code performing the necessary checks, but I definitely could have missed something or misused the existing code even when I found it. There's still one TODO around `global` statement handling. I don't think there's an existing way to look this up, but I'm happy to work on that here or in a separate PR. This currently only affects detection of one error (`LoadBeforeGlobalDeclaration` or [PLE0118](https://docs.astral.sh/ruff/rules/load-before-global-declaration/) in ruff), so it's not too big of a problem even if we leave the TODO. Test Plan -- New mdtests, as well as new errors for existing mdtests --------- Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2025-04-23 09:52:58 -04:00
Brent Westbrook	9c47b6dbb0	[red-knot] Detect version-related syntax errors (#16379 ) ## Summary This PR extends version-related syntax error detection to red-knot. The main changes here are: 1. Passing `ParseOptions` specifying a `PythonVersion` to parser calls 2. Adding a `python_version` method to the `Db` trait to make this possible 3. Converting `UnsupportedSyntaxError`s to `Diagnostic`s 4. Updating existing mdtests to avoid unrelated syntax errors My initial draft of (1) and (2) in #16090 instead tried passing a `PythonVersion` down to every parser call, but @MichaReiser suggested the `Db` approach instead [here](https://github.com/astral-sh/ruff/pull/16090#discussion_r1969198407), and I think it turned out much nicer. All of the new `python_version` methods look like this: ```rust fn python_version(&self) -> ruff_python_ast::PythonVersion { Program::get(self).python_version(self) } ``` with the exception of the `TestDb` in `ruff_db`, which hard-codes `PythonVersion::latest()`. ## Test Plan Existing mdtests, plus a new mdtest to see at least one of the new diagnostics.	2025-04-17 14:00:30 -04:00
Andrew Gallant	7d11ef1564	red_knot_python_semantic: make `TextRange` required for reporting a lint diagnostic This commit shuffles the reporting API around a little bit such that a range is required, up front, when reporting a lint diagnostic. This in turn enables us to make suppression checking eager. In order to avoid callers needing to provide the range twice, we create a primary annotation without a message inside the `Diagnostic` encapsulated by the guard. We do this instead of requiring the message up front because we're concerned about API complexity and the effort involved in creating the message. In order to provide a means of attaching a message to the primary annotation, we expose a convenience API on `LintDiagnosticGuard` for setting the message. This isn't generally possible for a `Diagnostic`, but since a `LintDiagnosticGuard` knows how the `Diagnostic` was constructed, we can offer this API correctly. It strikes me that it might be easy to forget to attach a primary annotation message, btu I think this the "least" bad failure mode. And in particular, it should be somewhat obvious that it's missing once one adds a snapshot test for how the diagnostic renders. Otherwise, this API gives us the ability to eagerly check whether a diagnostic should be reported with nearly minimal information. It also shouldn't have any footguns since it guarantees that the primary annotation is tied to the file in the typing context. And it keeps things pretty simple: callers only need to provide what is actually strictly necessary to make a diagnostic.	2025-04-11 12:36:36 -04:00
Andrew Gallant	b79d43a852	ruff_db: add primary annotation message mutators on `Diagnostic` This will enable us to provide an API on `LintDiagnosticGuard` for setting the primary annotation message. It will require an `unwrap()`, but due to how `LintDiagnosticGuard` will build a `Diagnostic`, this `unwrap()` will be guaranteed to succeed. (And it won't bubble out to every user of `LintDiagnosticGuard`.)	2025-04-11 12:36:36 -04:00
Andrew Gallant	bd5b8a9ec6	ruff_db: tweak APIs accepting a diagnostic message This expands the set of types accepted for diagnostic messages. Instead of only `std::fmt::Display` impls, we now also accept a concrete `DiagnosticMessage`. This will be useful to avoid unnecessary copies of every single diagnostic message created via `InferContext::report_lint`. (I'll call out how this helps in a subsequent commit.)	2025-04-11 12:36:36 -04:00
Andrew Gallant	81045758d3	ruff_db: use `Annotation::get_message` in more places I added this accessor because tests want it, but we can also use it in other places internally. It's a little nicer because it does the `as_deref()` for you.	2025-04-10 13:21:00 -04:00
Andrew Gallant	7d958a9ee5	red_knot_python_semantic: remove the "old" secondary message type This finally completes the deletion of all old diagnostic types. We do this by migrating the second (and last) use of secondary diagnostic messages: to highlight the return type of a function definition when its return value is inconsistent with the type. Like the last diagnostic, we do actually change the message here a bit. We don't need a sub-diagnostic here, and we can instead just add a secondary annotation to highlight the return type.	2025-04-10 13:21:00 -04:00
Andrew Gallant	28b64064f5	ruff_db: tweak how the revealed type diagnostic is rendered In the new diagnostic data model, we really should have a main diagnostic message and a primary span (with an optional message attached to it) for every diagnostic. In this commit, I try to make this true for the "revealed type" diagnostic. Instead of the annotation saying both "revealed type is" and also the revealed type itself, the annotation is now just the revealed type and the main diagnostic message is "Revealed type." I expect this may be controversial. I'm open to doing something different. I tried to avoid redundancy, but maybe this is a special case where we want the redundancy. I'm honestly not sure. I do like how it looks with this commit, but I'm not working with Red Knot's type checking daily, so my opinion doesn't count for much. This did also require some tweaking to concise diagnostic formatting in order to preserve the essential information. This commit doesn't update every relevant snapshot. Just a few. I split the rest out into the next commit.	2025-04-10 13:21:00 -04:00
Micha Reiser	3150812ac4	[red-knot] Add 'Format document' to playground (#17217 ) ## Summary This is more "because we can" than something we need. But since we're already building an "almost IDE" ## Test Plan https://github.com/user-attachments/assets/3a4bdad1-ba32-455a-9909-cfeb8caa1b28	2025-04-07 09:26:03 +02:00
Andrew Gallant	adeba3dca7	ruff_db: simplify lifetimes on `DiagnosticDisplay` I initially split the lifetime out into three distinct lifetimes on near-instinct because I moved the struct into the public API. But because they are all shared borrows, and because there are no other APIs on `DisplayDiagnostic` to access individual fields (and probably never will be), it's probably fine to just specify one lifetime. Because of subtyping, the one lifetime will be the shorter of the three. There's also the point that `ruff_db` isn't _really_ a public API, since it isn't a library that others depend on. So my instinct is probably a bit off there.	2025-04-02 12:47:02 -04:00
Andrew Gallant	718b0cadf4	ruff_db: switch diagnostic rendering over to `std::fmt::Display` It was already using this approach internally, so this is "just" a matter of rejiggering the public API of `Diagnostic`. We were previously writing directly to a `std::io::Write` since it was thought that this worked better with the linear typing fakery. Namely, it increased confidence that the diagnostic rendering was actually written somewhere useful, instead of just being converted to a string that could potentially get lost. For reasons discussed in #17130, the linear type fakery was removed. And so there is less of a reason to require a `std::io::Write` implementation for diagnostic rendering. Indeed, this would sometimes result in `unwrap()` calls when one wants to convert to a `String`.	2025-04-02 11:01:16 -04:00
Andrew Gallant	c30e80a3f4	ruff_db: delete most of the old diagnostic code We do keep around `OldSecondaryDiagnosticMessage`, since that's part of the Red Knot `InferContext` API. But it's a rather simple type, and we'll be able to delete it entirely once `InferContext` exposes the new `Diagnostic` type directly. Since we aren't consuming `OldSecondaryDiagnosticMessage` any more, we can now accept a slice instead of a vec. (Thanks Clippy.)	2025-04-02 10:10:01 -04:00
Andrew Gallant	4e169e5f6c	red_knot: use `Diagnostic` inside of red knot This replaces things like `TypeCheckDiagnostic` with the new Diagnostic` type. This is a "surgical" replacement where we retain the existing API of of diagnostic reporting such that _most_ of Red Knot doesn't need to be changed to support this update. But it will enable us to start using the new diagnostic renderer and to delete the old renderer. It also paves the path for exposing the new `Diagnostic` data model to the broader Red Knot codebase.	2025-04-02 10:10:01 -04:00
Andrew Gallant	883b8e3870	ruff_db: port concise diagnostic rendering to new renderer Previously, this was only available in the old renderer. To avoid regressions, we just copy it to the new renderer. We don't bother with DRY because the old renderer will be deleted very soon.	2025-04-02 10:10:01 -04:00
Andrew Gallant	2ca2f73ba8	ruff_db: tweak line terminators emitted by diagnostic rendering This change just brings diagnostic rendering into parity with the status quo.	2025-04-02 10:10:01 -04:00
Andrew Gallant	90f0766210	ruff_db: make `Diagnostic::print` use a non-mutable borrow Now that we don't need to update the `printed` flag, this can just be an immutable borrow. (Arguably this should have been an immutable borrow even initially, but I didn't want to introduce interior mutability without a more compelling justification.)	2025-04-02 10:10:01 -04:00
Andrew Gallant	a9527edbbe	ruff_db: switch `Diagnostic` to use `Arc`, drop linear type fakery The switch to `Arc` was done because Salsa sometimes requires cloning a `Diagnostic` (or something that contains a `Diagnostic`). And so it probably makes sense to make this cheap. Since `Diagnostic` exposes a mutable API, we adopt "clone on write" semantics. Although, it's more like, "clone on write when the `Arc` has more than one reference." In the common case of creating a `Diagnostic` and then immediately mutating it, no additional copies should be made over the status quo. We also drop the linear type fakery. Its interaction with Salsa is somewhat awkward, and it has been suggested that there will be points where diagnostics will be dropped unceremoniously without an opportunity to tag them as having been ignored. Moreover, this machinery was added out of "good sense" and isn't actually motivated by real world problems with accidentally ignoring diagnostics. So that makes it easier, I think, to just kick this out entirely instead of trying to find a way to make it work.	2025-04-02 10:10:01 -04:00
Andrew Gallant	57be814acb	ruff_db: add method to create sub-diagnostics from old secondary messages This is temporary to scaffold the refactor. The main idea is that we want to take the `InferContext` API, as it is, and migrate that to the new diagnostic data model internally. Then we can rip out the old stuff and iterate on the API.	2025-04-02 10:10:01 -04:00
Micha Reiser	2ae39edccf	[red-knot] Goto type definition (#16901 ) ## Summary Implement basic Goto type definition support for Red Knot's LSP. This PR also builds the foundation for other LSP operations. E.g., Goto definition, hover, etc., should be able to reuse some, if not most, logic introduced in this PR. The basic steps of resolving the type definitions are: 1. Find the closest token for the cursor offset. This is a bit more subtle than I first anticipated because the cursor could be positioned right between the callee and the `(` in `call(test)`, in which case we want to resolve the type for `call`. 2. Find the node with the minimal range that fully encloses the token found in 1. I somewhat suspect that 1 and 2 could be done at the same time but it complicated things because we also need to compute the spine (ancestor chain) for the node and there's no guarantee that the found nodes have the same ancestors 3. Reduce the node found in 2. to a node that is a valid goto target. This may require traversing upwards to e.g. find the closest expression. 4. Resolve the type for the goto target 5. Resolve the location for the type, return it to the LSP ## Design decisions The current implementation navigates to the inferred type. I think this is what we want because it means that it correctly accounts for narrowing (in which case we want to go to the narrowed type because that's the value's type at the given position). However, it does have the downside that Goto type definition doesn't work whenever we infer `T & Unknown` because intersection types aren't supported. I'm not sure what to do about this specific case, other than maybe ignoring `Unkown` in Goto type definition if the type is an intersection? ## Known limitations * Types defined in the vendored typeshed aren't supported because the client can't open files from the red knot binary (we can either implement our own file protocol and handler OR extract the typeshed files and point there). See https://github.com/astral-sh/ruff/issues/17041 * Red Knot only exposes an API to get types for expressions and definitions. However, there are many other nodes with identifiers that can have a type (e.g. go to type of a globals statement, match patterns, ...). We can add support for those in separate PRs (after we figure out how to query the types from the semantic model). See https://github.com/astral-sh/ruff/issues/17113 * We should have a higher-level API for the LSP that doesn't directly call semantic queries. I intentionally decided not to design that API just yet. ## Test plan https://github.com/user-attachments/assets/fa077297-a42d-4ec8-b71f-90c0802b4edb Goto type definition on a union <img width="1215" alt="Screenshot 2025-04-01 at 13 02 55" src="https://github.com/user-attachments/assets/689cabcc-4a86-4a18-b14a-c56f56868085" /> Note: I recorded this using a custom typeshed path so that navigating to builtins works.	2025-04-02 12:12:48 +00:00
Micha Reiser	8d16a5c8c9	[red-knot] Use `web-time` instead of `FileTime::now` (#16967 ) ## Summary `std::time::now` isn't available on `wasm32-unknown-unknown` but it is used by `FileTime::now`. This PR replaces the usages of `FileTime::now` with a target specific helper function that we already had in the memory file system. Fixes https://github.com/astral-sh/ruff/issues/16966 ## Test Plan Tested that the playground no longer crash when adding an extra-path	2025-03-25 13:03:30 +00:00
Andrew Gallant	6883c1dde7	ruff_db: delete old diagnostic renderer ... and switch to the new one. We do this switch by converting the old diagnostics to a `Diagnostic`, and then rendering that. This does not quite emit identical output. There are some changes. They could be fixed to remain the same, but the changes aren't obviously worse to me and I think the right way to improve them is to move Red Knot to the new `Diagnostic` API. The next commit will have the snapshot changes.	2025-03-17 12:46:49 -04:00
Andrew Gallant	9291074ba6	ruff_db: tweak main diagnostic message In our existing diagnostics, our message is just the diagnostic ID, and the message goes to the annotation. In reality, the diagnostic can have its own message distinct from the optional messages associated with an annotation. In order to make the outputs match, we do a small tweak here: when the main diagnostic message is empty, we drop the colon after the diagnostic ID. I expect that we'll want to rejigger this output format more in the future, but for now this was a very simple change to preserve the status quo.	2025-03-17 12:46:49 -04:00
Andrew Gallant	602a27c4e3	ruff_db: tweak number of line terminators emitted in new diagnostic renderer When moving over to the new renderer, I noticed that it was emitting an extra line terminator compared to the status quo. This removes it by turning the line terminator into a line delimiter between diagnostics.	2025-03-17 12:46:49 -04:00
Andrew Gallant	ff548b1272	ruff_db: clarify the error conditions of `Diagnostic::print`	2025-03-17 12:46:49 -04:00
Micha Reiser	c100d519e9	[internal]: Upgrade salsa (#16794 ) ## Summary Another salsa upgrade. The main motivation is to stay on a recent salsa version because there are still a lot of breaking changes happening. The most significant changes in this update: * Salsa no longer derives `Debug` by default. It now requires `interned(debug)` (or similar) * This version ships the foundation for garbage collecting interned values. However, this comes at the cost that queries now track which interned values they created (or read). The micro benchmarks in the salsa repo showed a significant perf regression. Will see if this also visible in our benchmarks. ## Test Plan `cargo test`	2025-03-17 11:05:54 +01:00
Micha Reiser	6f5a68608e	[ci]: Fixup codspeed upgrade (#16790 ) ## Summary Benchmark isn't a required build step. That's why https://github.com/astral-sh/ruff/pull/16784/ got merged with the step failing. This PR fixes up the benchmarking step	2025-03-17 09:14:22 +01:00
Micha Reiser	a467e7c8d3	[red-knot] Case sensitive module resolver (#16521 ) ## Summary This PR implements the first part of https://github.com/astral-sh/ruff/discussions/16440. It ensures that Red Knot's module resolver is case sensitive on all systems. This PR combines a few approaches: 1. It uses `canonicalize` on non-case-sensitive systems to get the real casing of a path. This works for as long as no symlinks or mapped network drives (the windows `E:\` is mapped to `\\server\share` thingy). This is the same as what Pyright does 2. If 1. fails, fall back to recursively list the parent directory and test if the path's file name matches the casing exactly as listed in by list dir. This is the same approach as CPython takes in its module resolver. The main downside is that it requires more syscalls because, unlike CPython, we Red Knot needs to invalidate its caches if a file name gets renamed (CPython assumes that the folders are immutable). It's worth noting that the file watching test that I added that renames `lib.py` to `Lib.py` currently doesn't pass on case-insensitive systems. Making it pass requires some more involved changes to `Files`. I plan to work on this next. There's the argument that landing this PR on its own isn't worth it without this issue being addressed. I think it's still a good step in the right direction even when some of the details on how and where the path case sensitive comparison is implemented. ## Test plan I added multiple integration tests (including a failing one). I tested that the `case-sensitivity` detection works as expected on Windows, MacOS and Linux and that the fast-paths are taken accordingly.	2025-03-14 19:16:44 +00:00
Micha Reiser	a128ca761f	[red-knot] Very minor simplification of the render tests (#16759 )	2025-03-14 19:13:07 +00:00
Andrew Gallant	b9d7c36a23	ruff_db: add a new diagnostic renderer We don't actually hook this up to anything in this PR, but we do go to some trouble to granularly unit test it. The unit tests caught plenty of bugs after I initially wrote down the implementation, so they were very much worth it. Closes #16506	2025-03-14 14:59:33 -04:00
Andrew Gallant	ef9a825827	ruff_db: add `context` configuration Instead of hard-coding a specific context window, it seemed prudent to make this configurable. That makes it easier to test different context window sizes as well. I am not totally convinced that this is the right place for this configuration. I could see the context window size being a property of `Diagnostic` instead, since we might want to change the context window size based not just on some end user configuration, but perhaps also the specific diagnostic. But for now, I think it's fine for it to live here, and all of the rendering logic doesn't care where it lives. So it should be relatively easy to change in the future.	2025-03-14 14:59:33 -04:00
Andrew Gallant	eb6871d209	ruff_db: add concise diagnostic mode This adds a new configuration knob to diagnostic rendering that, when enabled, will make diagnostic rendering much more terse. Specifically, it will guarantee that each diagnostic will only use one line. This doesn't actually hook the concise output option up to anything. We'll do that plumbing in the next commit.	2025-03-14 14:46:17 -04:00
Micha Reiser	ce0018c3cb	Add `OsSystem` support to mdtests (#16518 ) ## Summary This PR introduces a new mdtest option `system` that can either be `in-memory` or `os` where `in-memory` is the default. The motivation for supporting `os` is so that we can write OS/system specific tests with mdtests. Specifically, I want to write mdtests for the module resolver, testing that module resolution is case sensitive. ## Test Plan I tested that the case-sensitive module resolver test start failing when setting `system = "os"`	2025-03-06 10:41:40 +01:00
Andrew Gallant	cc324abcc2	ruff_db: add new `Diagnostic` type ... with supporting types. This is meant to give us a base to work with in terms of our new diagnostic data model. I expect the representations to be tweaked over time, but I think this is a decent start. I would also like to add doctest examples, but I think it's better if we wait until an initial version of the renderer is done for that.	2025-03-05 08:23:02 -05:00
Andrew Gallant	80be0a0115	ruff_db: move `ParseDiagnostic` to `old` submodule too This should have been with the previous two commits, but I missed it.	2025-03-05 08:23:02 -05:00
Andrew Gallant	b2e90c3f5c	ruff_db: rename `ParseDiagnostic` to `OldParseDiagnostic` I missed this in the previous commits.	2025-03-05 08:23:02 -05:00

1 2 3 4 5 ...

253 Commits