Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Charlie Marsh	e7a2779402	Bump version to v0.0.289 (#7308 )	2023-09-12 12:00:11 -04:00
Dhruv Manilawala	ee0f1270cf	Add `NotebookIndex` to the cache (#6863 ) ## Summary This PR updates the `FileCache` to include an optional `NotebookIndex` to support caching for Jupyter Notebooks. We only require the index to compute the diagnostics and thus we don't really need to store the entire `Notebook` on the `Diagnostics` struct. This means we only need the index to be stored in the cache to reconstruct the `Diagnostics`. ## Test Plan Update an existing test case to run over the fixtures under `ruff_notebook` crate where there are multiple Jupyter Notebook. Locally, the following commands were run in order: 1. Remove the cache: `rm -rf .ruff_cache` 2. Run without cache: `cargo run --bin ruff -- check --isolated crates/ruff_notebook/resources/test/fixtures/jupyter/unused_variable.ipynb --no-cache` 3. Run with cache: `cargo run --bin ruff -- check --isolated crates/ruff_notebook/resources/test/fixtures/jupyter/unused_variable.ipynb` 4. Check whether the `.ruff_cache` directory was created or not 5. Run with cache again and verify: `cargo run --bin ruff -- check --isolated crates/ruff_notebook/resources/test/fixtures/jupyter/unused_variable.ipynb` ## Benchmarks https://github.com/astral-sh/ruff/pull/6863#issuecomment-1715675186 fixes: #6671	2023-09-12 18:29:03 +05:30
Dhruv Manilawala	f5701fcc63	Use snapshots for remaining lexer tests (#7264 ) ## Summary This PR updates the remaining lexer test cases to use the snapshots. This is mainly a mechanical refactor. ## Motivation The main motivation is so that when we add the token range values to the test case output, it's easier to update the test cases. The reason they were not using the snapshots before was because of the usage of `test_case` macro. The macros is mainly used for different EOL test cases. If we just generate the snapshots directly, then the snapshot name would be suffixed with `-1`, `-2`, etc. as the test function is still the same. So, we'll create the snapshot ourselves with the platform name for the respective EOL test cases. ## Test Plan `cargo test`	2023-09-12 00:16:38 +05:30
Micha Reiser	7c9bbcf4e2	Bump version to 0.0.288 (#7271 ) Co-authored-by: Zanie Blue <contact@zanie.dev>	2023-09-11 18:18:11 +02:00
Micha Reiser	47a253fb62	Add PreviewMode option to formatter ## Summary This PR adds the `--preview` and `--no-preview` options to the `format` command (hidden) and passes it through to the formatte. ## Test Plan I added the `dbg(f.options().preview())` statement in `FormatNodeRule::fmt` and verified that the option gets correctly passed to the formatter.	2023-09-08 12:04:28 +02:00
Micha Reiser	d9544a2d37	Disable default criterion features (#7241 )	2023-09-08 10:03:14 +00:00
Micha Reiser	e376c3ff7e	Split implicit concatenated strings before binary expressions (#7145 )	2023-09-08 06:51:26 +00:00
Micha Reiser	f1a4eb9c28	Use the unicode-ident crate (#7212 )	2023-09-07 08:19:25 +00:00
Victor Hugo Gomes	041cdb95e0	Update identifier Unicode character validation to match Python spec (#7209 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-07 07:08:42 +00:00
Manuel Martinez	2e58ad437e	build: add libcst from crates (#7179 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-06 08:24:28 +00:00
konsti	e02d76f070	Use insta_cmd (#6737 )	2023-09-05 12:21:27 +00:00
Micha Reiser	93ca8ebbc0	Formatter: Detect line endings (#7054 )	2023-09-04 08:09:31 +02:00
Dhruv Manilawala	4eaa412370	Update LibCST (#7062 ) ## Summary This PR updates the revision of `LibCST` dependency to `9c263aa897` inorder to fix https://github.com/astral-sh/ruff/issues/4899 ## Test Plan The test case including the carriage return (`\r`) character was added for `F504` and then `cargo test`. fixes: #4899	2023-09-03 09:11:24 +05:30
Charlie Marsh	2f3a950f6f	Bump version to 0.0.287 (#7038 )	2023-09-01 17:32:26 +01:00
Charlie Marsh	afcd00da56	Create `ruff_notebook` crate (#7039 ) ## Summary This PR moves `ruff/jupyter` into its own `ruff_notebook` crate. Beyond the move itself, there were a few challenges: 1. `ruff_notebook` relies on the source map abstraction. I've moved the source map into `ruff_diagnostics`, since it doesn't have any dependencies on its own and is used alongside diagnostics. 2. `ruff_notebook` has a couple tests for end-to-end linting and autofixing. I had to leave these tests in `ruff` itself. 3. We had code in `ruff/jupyter` that relied on Python lexing, in order to provide a more targeted error message in the event that a user saves a `.py` file with a `.ipynb` extension. I removed this in order to avoid a dependency on the parser, it felt like it wasn't worth retaining just for that dependency. ## Test Plan `cargo test`	2023-09-01 13:56:44 +00:00
Charlie Marsh	7c1aa98f43	Run cargo update (#6964 )	2023-08-31 13:11:11 -05:00
Micha Reiser	edfd888bd6	Add unicode benchmark (#7002 )	2023-08-30 09:57:57 +02:00
Charlie Marsh	fad23bbe60	Add a --check flag to the formatter CLI (#6982 ) ## Summary Returns an exit code of 1 if any files would be reformatted: ``` ruff on  charlie/format-check:main [$?⇡] is 📦 v0.0.286 via 🐍 v3.11.2 via 🦀 v1.72.0 ❯ cargo run -p ruff_cli -- format foo.py --check Compiling ruff_cli v0.0.286 (/Users/crmarsh/workspace/ruff/crates/ruff_cli) Finished dev [unoptimized + debuginfo] target(s) in 1.69s Running `target/debug/ruff format foo.py --check` warning: `ruff format` is a work-in-progress, subject to change at any time, and intended only for experimentation. 1 file would be reformatted ruff on  charlie/format-check:main [$?⇡] is 📦 v0.0.286 via 🐍 v3.11.2 via 🦀 v1.72.0 took 2s ❯ echo $? 1 ``` Closes #6966.	2023-08-29 12:40:00 -04:00
Chris Pryer	fa25dabf17	Add comments option to playground (#6911 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-08-28 07:26:23 +00:00
Micha Reiser	a6aa16630d	Move `Configuration` to `ruff_workspace` crate (#6920 )	2023-08-28 06:21:35 +00:00
konsti	c2413dcd2c	Add prototype of `ruff format` for projects (#6871 ) Summary Add recursive formatting based on `ruff check` file discovery for `ruff format`, as a prototype for the formatter alpha. This allows e.g. `format ../projects/django/`. It's still lacking support for any settings except line length. Note just like the existing `ruff format` this will become part of the production build, i.e. you'll be able to use it - hidden by default and with a prominent warning - with `ruff format .` after the next release. Error handling works in my manual tests (the colors do also work): ``` $ target/debug/ruff format scripts/ warning: `ruff format` is a work-in-progress, subject to change at any time, and intended for internal use only. ``` (the above changes `add_rule.py` where we have the wrong bin op breaking) ``` $ target/debug/ruff format ../projects/django/ warning: `ruff format` is a work-in-progress, subject to change at any time, and intended for internal use only. Failed to format /home/konsti/projects/django/tests/test_runner_apps/tagged/tests_syntax_error.py: source contains syntax errors: ParseError { error: UnrecognizedToken(Name { name: "syntax_error" }, None), offset: 131, source_path: "<filename>" } ``` ``` $ target/debug/ruff format a warning: `ruff format` is a work-in-progress, subject to change at any time, and intended for internal use only. Failed to read /home/konsti/ruff/a/d.py: Permission denied (os error 13) ``` Test Plan Missing! I'm not sure if it's worth building tests at this stage or how they should look like.	2023-08-27 19:12:18 +00:00
Charlie Marsh	059757a8c8	Implement `Ranged` on more structs (#6921 ) Now that it's in `ruff_text_size`, we can use it in a few places that we couldn't before.	2023-08-27 19:03:08 +00:00
Micha Reiser	ed1b4122d0	Use Codspeed for continous benchmarking (#6896 )	2023-08-26 16:34:35 +02:00
Charlie Marsh	91880b8273	Bump version to 0.0.286 (#6876 )	2023-08-25 14:59:26 -04:00
konsti	0b6dab5e3f	Add jupyter notebook cell ids in 4.5+ if missing (#6853 ) Summary See https://github.com/astral-sh/ruff/issues/6834#issuecomment-1691202417 Test Plan Added a new notebook	2023-08-25 08:34:42 +00:00
Micha Reiser	04a9a8dd03	Maybe parenthesize long constants and names (#6816 )	2023-08-24 09:47:57 +00:00
Dhruv Manilawala	fb7caf43c8	Update lexer tests to use snapshots (#6658 ) ## Summary This PR updates the lexer tests to use the snapshot testing framework. It also makes the following changes: * Remove the use of macros in the lexer tests * Use `test_case` for EOL tests ## Test Plan ``` cargo test --package ruff_python_parser --lib --all-features -- lexer::tests --no-capture ```	2023-08-22 18:23:19 +00:00
Zanie Blue	5892c691ea	Bump version to 0.0.285 (#6660 ) Requires - https://github.com/astral-sh/ruff/pull/6655 - https://github.com/astral-sh/ruff/pull/6657	2023-08-17 15:46:28 -05:00
Zanie Blue	6253d8e2c8	Remove unused runtime string formatting logic (#6624 ) In https://github.com/astral-sh/ruff/pull/6616 we are adding support for nested replacements in format specifiers which makes actually formatting strings infeasible without a great deal of complexity. Since we're not using these functions (they just exist for runtime use in RustPython), we can just remove them.	2023-08-16 17:38:33 +00:00
Micha Reiser	232b44a8ca	Indent statements in suppressed ranges (#6507 )	2023-08-15 08:00:35 +02:00
Micha Reiser	51ae47ad56	Remove lex and parsing from formatter benchmark (#6547 )	2023-08-14 10:25:37 +02:00
Charlie Marsh	768686148f	Add support for unions to our Python builtins type system (#6541 ) ## Summary Fixes some TODOs introduced in https://github.com/astral-sh/ruff/pull/6538. In short, given an expression like `1 if x > 0 else "Hello, world!"`, we now return a union type that says the expression can resolve to either an `int` or a `str`. The system remains very limited, it only works for obvious primitive types, and there's no attempt to do inference on any more complex variables. (If any expression yields `Unknown` or `TypeError`, we propagate that result throughout and abort on the client's end.)	2023-08-13 18:00:50 -04:00
konsti	0c9ded9d84	Use a faster diffing library for the formatter ecosystem checks (#6497 ) Summary Some files seems notoriously slow in the formatter (secons in debug mode). This time was however almost exclusively spent in the diff algorithm to collect the similarity index, so i replaced that. I kept `similar` for printing actual diff to avoid rewriting that too, with the disadvantage that we now have to diff libraries in format_dev. I used this PR to remove the spinner from tracing-indicatif and changed `flamegraph --perfdata perf.data` to `flamegraph --perfdata perf.data --no-inline` as the former wouldn't finish for me on release builds with debug info.	2023-08-11 15:51:54 +02:00
Zanie Blue	3ecd263b4d	Bump version to 0.0.284 (#6453 ) ## What's Changed This release fixes a few bugs, notably the previous release announced a breaking change where the default target Python version changed from 3.10 to 3.8 but it was not applied. Thanks to @rco-ableton for fixing this in https://github.com/astral-sh/ruff/pull/6444 ### Bug Fixes * Do not trigger `S108` if path is inside `tempfile.` call by @dhruvmanila in https://github.com/astral-sh/ruff/pull/6416 Do not allow on zero tab width by @tjkuson in https://github.com/astral-sh/ruff/pull/6429 * Fix false-positive in submodule resolution by @charliermarsh in https://github.com/astral-sh/ruff/pull/6435 ## New Contributors * @rco-ableton made their first contribution in https://github.com/astral-sh/ruff/pull/6444 Full Changelog: https://github.com/astral-sh/ruff/compare/v0.0.283...v0.0.284	2023-08-09 13:32:33 -05:00
Zanie Blue	fe9590f39f	Bump version number to 0.0.283 (#6407 )	2023-08-08 12:31:30 -05:00
Charlie Marsh	743118ae9a	Bump version to 0.0.282 (#6241 )	2023-08-01 13:21:33 +00:00
Micha Reiser	ecfdd8d58b	Add static assertions to nodes (#6228 )	2023-08-01 11:54:49 +02:00
konsti	0fddb31235	Use tracing for format_dev (#6177 ) ## Summary [tracing](https://github.com/tokio-rs/tracing) is library for logging, tracing and related features that has a large ecosystem. Using [tracing-subscriber](https://docs.rs/tracing-subscriber) and [tracing-indicatif](https://github.com/emersonford/tracing-indicatif), we get a nice logging output that you can configure with `RUST_LOG` (e.g. `RUST_LOG=debug`) and a live look into the formatter progress. Default: ![Screenshot from 2023-07-30 13-59-53](https://github.com/astral-sh/ruff/assets/6826232/6432f835-9ff1-4771-955b-398e54c406dc) `RUST_LOG=debug`: ![Screenshot from 2023-07-30 14-01-32](https://github.com/astral-sh/ruff/assets/6826232/5f2c87da-0867-4159-82e7-b5757eebb8eb) It's easy to see in this output which files take a disproportionate amount of time. [Peek 2023-07-30 14-35.webm](https://github.com/astral-sh/ruff/assets/6826232/2c92db5c-1354-465b-a6bc-ddfb281d6f9d) It opens up further integration with the tracing ecosystem, [tracing-timing](https://docs.rs/tracing-timing/latest/tracing_timing/) and [tokio-console](https://github.com/tokio-rs/console) can e.g. show histograms and the json output allows us building better pipelines than grepping a log file. One caveat is using `parent: None` for the logging statements because tracing subscriber does not allow deactivating the span without reimplementing all the other log message formatting, too, and we don't need span information, esp. since it would currently show the progress bar span. ## Test Plan n/a	2023-07-31 19:14:01 +00:00
Charlie Marsh	dbd60b2cf5	Bump version to 0.0.281 (#6195 )	2023-07-31 13:21:43 -04:00
Micha Reiser	40f54375cb	Pull in RustPython parser (#6099 )	2023-07-27 09:29:11 +00:00
Micha Reiser	2cf00fee96	Remove parser dependency from ruff-python-ast (#6096 )	2023-07-26 17:47:22 +02:00
Micha Reiser	16e1737d1b	Use cursor based lexer (#6012 )	2023-07-26 11:32:26 +02:00
Dhruv Manilawala	025fa4eba8	Integrate the new Jupyter AST nodes in Ruff (#6086 ) ## Summary This PR adds the implementation for the new Jupyter AST nodes i.e., `ExprLineMagic` and `StmtLineMagic`. ## Test Plan Add test cases for `unparse` containing magic commands resolves: #6087	2023-07-26 08:20:30 +00:00
Zanie Blue	3000a47fe8	Include file permissions in key for cached files (#5901 ) Reimplements https://github.com/astral-sh/ruff/pull/3104 Closes https://github.com/astral-sh/ruff/issues/5726 Note that we will generate the hash for a cache key twice in normal operation. Once to check for the cached item and again to update the cache. We could optimize this by generating the hash once in `diagnostics::lint_file` and passing the `u64` into `get` and `update`. We'd probably want to wrap it in a `CacheKeyHash` enum for type safety. ## Test plan Unit tests for Windows and Unix. Manual test with case from issue ``` ❯ touch fake.py ❯ chmod +x fake.py ❯ ./target/debug/ruff --select EXE fake.py fake.py:1:1: EXE002 The file is executable but no shebang is present Found 1 error. ❯ chmod -x fake.py ❯ ./target/debug/ruff --select EXE fake.py ```	2023-07-25 17:06:47 +00:00
Charlie Marsh	ed72c027a3	Replace `NoHashHasher` usages with `FxHashMap` (#6049 ) ## Summary I had always assumed that `NoHashHasher` would be faster when using integer keys, but benchmarking shows otherwise: ``` linter/default-rules/numpy/globals.py time: [66.544 µs 66.606 µs 66.678 µs] thrpt: [44.253 MiB/s 44.300 MiB/s 44.342 MiB/s] change: time: [-0.1843% +0.1087% +0.3718%] (p = 0.46 > 0.05) thrpt: [-0.3704% -0.1086% +0.1847%] No change in performance detected. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high mild linter/default-rules/pydantic/types.py time: [1.3787 ms 1.3811 ms 1.3837 ms] thrpt: [18.431 MiB/s 18.466 MiB/s 18.498 MiB/s] change: time: [-0.4827% -0.1074% +0.1927%] (p = 0.56 > 0.05) thrpt: [-0.1924% +0.1075% +0.4850%] No change in performance detected. linter/default-rules/numpy/ctypeslib.py time: [624.82 µs 625.96 µs 627.17 µs] thrpt: [26.550 MiB/s 26.601 MiB/s 26.650 MiB/s] change: time: [-0.7071% -0.4908% -0.2736%] (p = 0.00 < 0.05) thrpt: [+0.2744% +0.4932% +0.7122%] Change within noise threshold. linter/default-rules/large/dataset.py time: [3.1585 ms 3.1634 ms 3.1685 ms] thrpt: [12.840 MiB/s 12.861 MiB/s 12.880 MiB/s] change: time: [-1.5338% -1.3463% -1.1476%] (p = 0.00 < 0.05) thrpt: [+1.1610% +1.3647% +1.5577%] Performance has improved. linter/all-rules/numpy/globals.py time: [140.17 µs 140.37 µs 140.58 µs] thrpt: [20.989 MiB/s 21.020 MiB/s 21.051 MiB/s] change: time: [-0.1066% +0.3140% +0.7479%] (p = 0.14 > 0.05) thrpt: [-0.7423% -0.3130% +0.1067%] No change in performance detected. Found 3 outliers among 100 measurements (3.00%) 2 (2.00%) high mild 1 (1.00%) high severe linter/all-rules/pydantic/types.py time: [2.7030 ms 2.7069 ms 2.7112 ms] thrpt: [9.4064 MiB/s 9.4216 MiB/s 9.4351 MiB/s] change: time: [-0.6721% -0.4874% -0.2974%] (p = 0.00 < 0.05) thrpt: [+0.2982% +0.4898% +0.6766%] Change within noise threshold. Found 14 outliers among 100 measurements (14.00%) 12 (12.00%) high mild 2 (2.00%) high severe linter/all-rules/numpy/ctypeslib.py time: [1.4709 ms 1.4727 ms 1.4749 ms] thrpt: [11.290 MiB/s 11.306 MiB/s 11.320 MiB/s] change: time: [-1.1617% -0.9766% -0.8094%] (p = 0.00 < 0.05) thrpt: [+0.8160% +0.9862% +1.1754%] Change within noise threshold. Found 12 outliers among 100 measurements (12.00%) 9 (9.00%) high mild 3 (3.00%) high severe linter/all-rules/large/dataset.py time: [5.8086 ms 5.8163 ms 5.8240 ms] thrpt: [6.9854 MiB/s 6.9946 MiB/s 7.0038 MiB/s] change: time: [-1.5651% -1.3536% -1.1584%] (p = 0.00 < 0.05) thrpt: [+1.1720% +1.3721% +1.5900%] Performance has improved. ``` My guess is that `NoHashHasher` underperforms because the keys are not randomly distributed... Anyway, it's a ~1% (significant) performance gain on some of the above, plus we get to remove a dependency.	2023-07-24 23:41:57 +00:00
Charlie Marsh	574c0e0105	Use `match` instead of `phf` for confusable lookup (#5953 ) I don't know whether we want to make this change but here's some data... Binary size: - `main`: 30,384 - `charlie/match-phf`: 30,416 llvm-lines: - `main`: 1,784,148 - `charlie/match-phf`: 1,789,877 llvm-lines and binary size are both unchanged (or, by < 5) when moving from `u8` to `u32` return types, and even when moving to `char` keys and values. I didn't expect this, but I'm not very knowledgable on this topic. Performance: ``` Confusables/match/src time: [4.9102 µs 4.9352 µs 4.9777 µs] change: [+1.7469% +2.2421% +2.8710%] (p = 0.00 < 0.05) Performance has regressed. Found 12 outliers among 100 measurements (12.00%) 2 (2.00%) low mild 4 (4.00%) high mild 6 (6.00%) high severe Confusables/match-with-skip/src time: [2.0676 µs 2.0945 µs 2.1317 µs] change: [+0.9384% +1.6000% +2.3920%] (p = 0.00 < 0.05) Change within noise threshold. Found 8 outliers among 100 measurements (8.00%) 3 (3.00%) high mild 5 (5.00%) high severe Confusables/phf/src time: [31.087 µs 31.188 µs 31.305 µs] change: [+1.9262% +2.2188% +2.5496%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 3 (3.00%) low mild 6 (6.00%) high mild 6 (6.00%) high severe Confusables/phf-with-skip/src time: [2.0470 µs 2.0486 µs 2.0502 µs] change: [-0.3093% -0.1446% +0.0106%] (p = 0.08 > 0.05) No change in performance detected. Found 4 outliers among 100 measurements (4.00%) 2 (2.00%) high mild 2 (2.00%) high severe ``` The `-with-skip` variants add our optimization which first checks whether the character is ASCII. So `match` is way, way faster than PHF, but it tends not to matter since almost all source code is ASCII anyway.	2023-07-24 02:23:36 +00:00
Charlie Marsh	f5a2fb5b5d	Bump version to 0.0.280 (#5965 )	2023-07-21 22:36:13 -04:00
Charlie Marsh	f1f89f2a7e	Bump version to 0.0.279 (#5949 )	2023-07-21 15:46:53 -04:00
konsti	196cc9b655	Fix RustPython rev to main branch (#5950 ) Summary I accidentally merged earlier while the RustPython parser rev was still pointing to the feature branch instead of to the merged main. This make the rev point to the RustPython parser repo main again	2023-07-21 15:53:14 +00:00
konsti	972f9a9c15	Fix formatting lambda with empty arguments (#5944 ) Summary Fix implemented in https://github.com/astral-sh/RustPython-Parser/pull/35: Previously, empty lambda arguments (e.g. `lambda: 1`) would get the range of the entire expression, which leads to incorrect comment placement. Now empty lambda arguments get an empty range between the `lambda` and the `:` tokens. Test Plan Added a regression test. 149 instances of unstable formatting remaining. ``` $ cargo run --bin ruff_dev --release -- format-dev --stability-check --error-file formatter-ecosystem-errors.txt --multi-project target/checkouts > formatter-ecosystem-progress.txt $ rg "Unstable formatting" target/formatter-ecosystem-errors.txt \| wc -l 149 ```	2023-07-21 15:48:45 +02:00
Micha Reiser	76e9ce6dc0	Fix `SimpleTokenizer`'s backward lexing of `# ` (#5878 )	2023-07-20 11:54:18 +02:00
konsti	92f471a666	Handle io errors gracefully (#5611 ) ## Summary It can happen that we can't read a file (a python file, a jupyter notebook or pyproject.toml), which needs to be handled and handled consistently for all file types. Instead of using `Err` or `error!`, we emit E602 with the io error as message and continue. This PR makes sure we handle all three cases consistently, emit E602. I'm not convinced that it should be possible to disable io errors, but we now handle the regular case consistently and at least print warning consistently. I went with `warn!` but i can change them all to `error!`, too. It also checks the error case when a pyproject.toml is not readable. The error message is not very helpful, but it's now a bit clearer that actually ruff itself failed instead vs this being a diagnostic. ## Examples This is how an Err of `run` looks now: ![image](https://github.com/astral-sh/ruff/assets/6826232/890f7ab2-2309-4b6f-a4b3-67161947cc83) With an unreadable file and `IOError` disabled: ![image](https://github.com/astral-sh/ruff/assets/6826232/fd3d6959-fa23-4ddf-b2e5-8d6022df54b1) (we lint zero files but count files before linting not during so we exit 0) I'm not sure if it should (or if we should take a different path with manual ExitStatus), but this currently also triggers when `files` is empty: ![image](https://github.com/astral-sh/ruff/assets/6826232/f7ede301-41b5-4743-97fd-49149f750337) ## Test Plan Unix only: Create a temporary directory with files with permissions `000` (not readable by the owner) and run on that directory. Since this breaks the assumptions of most of the test code (single file, `ruff` instead of `ruff_cli`), the test code is rather cumbersome and looks a bit misplaced; i'm happy about suggestions to fit it in closer with the other tests or streamline it in other ways. I added another test for when the entire directory is not readable.	2023-07-20 11:30:14 +02:00
Charlie Marsh	5f3da9955a	Rename `ruff_python_whitespace` to `ruff_python_trivia` (#5886 ) ## Summary This crate now contains utilities for dealing with trivia more broadly: whitespace, newlines, "simple" trivia lexing, etc. So renaming it to reflect its increased responsibilities. To avoid conflicts, I've also renamed `Token` and `TokenKind` to `SimpleToken` and `SimpleTokenKind`.	2023-07-19 11:48:27 -04:00
Micha Reiser	46a17d11f3	playground: Add AST/Tokens/Formatter panels (#5859 )	2023-07-19 14:46:08 +00:00
Micha Reiser	cda90d071c	Upgrade cargo insta (#5872 )	2023-07-19 12:56:32 +02:00
Charlie Marsh	4204fc002d	Remove exception-handler lexing from `unused-bound-exception` fix (#5851 ) ## Summary The motivation here is that it will make this rule easier to rewrite as a deferred check. Right now, we can't run this rule in the deferred phase, because it depends on the `except_handler` to power its autofix. Instead of lexing the `except_handler`, we can use the `SimpleTokenizer` from the formatter, and just lex forwards and backwards. For context, this rule detects the unused `e` in: ```python try: pass except ValueError as e: pass ```	2023-07-18 18:27:46 +00:00
konsti	730e6b2b4c	Refactor `StmtIf`: Formatter and Linter (#5459 ) ## Summary Previously, `StmtIf` was defined recursively as ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub orelse: Vec<Stmt>, } ``` Every `elif` was represented as an `orelse` with a single `StmtIf`. This means that this representation couldn't differentiate between ```python if cond1: x = 1 else: if cond2: x = 2 ``` and ```python if cond1: x = 1 elif cond2: x = 2 ``` It also makes many checks harder than they need to be because we have to recurse just to iterate over an entire if-elif-else and because we're lacking nodes and ranges on the `elif` and `else` branches. We change the representation to a flat ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub elif_else_clauses: Vec<ElifElseClause>, } pub struct ElifElseClause { pub range: TextRange, pub test: Option<Expr>, pub body: Vec<Stmt>, } ``` where `test: Some(_)` represents an `elif` and `test: None` an else. This representation is different tradeoff, e.g. we need to allocate the `Vec<ElifElseClause>`, the `elif`s are now different than the `if`s (which matters in rules where want to check both `if`s and `elif`s) and the type system doesn't guarantee that the `test: None` else is actually last. We're also now a bit more inconsistent since all other `else`, those from `for`, `while` and `try`, still don't have nodes. With the new representation some things became easier, e.g. finding the `elif` token (we can use the start of the `ElifElseClause`) and formatting comments for if-elif-else (no more dangling comments splitting, we only have to insert the dangling comment after the colon manually and set `leading_alternate_branch_comments`, everything else is taken of by having nodes for each branch and the usual placement.rs fixups). ## Merge Plan This PR requires coordination between the parser repo and the main ruff repo. I've split the ruff part, into two stacked PRs which have to be merged together (only the second one fixes all tests), the first for the formatter to be reviewed by @michareiser and the second for the linter to be reviewed by @charliermarsh. * MH: Review and merge https://github.com/astral-sh/RustPython-Parser/pull/20 * MH: Review and merge or move later in stack https://github.com/astral-sh/RustPython-Parser/pull/21 * MH: Review and approve https://github.com/astral-sh/RustPython-Parser/pull/22 * MH: Review and approve formatter PR https://github.com/astral-sh/ruff/pull/5459 * CM: Review and approve linter PR https://github.com/astral-sh/ruff/pull/5460 * Merge linter PR in formatter PR, fix ecosystem checks (ecosystem checks can't run on the formatter PR and won't run on the linter PR, so we need to merge them first) * Merge https://github.com/astral-sh/RustPython-Parser/pull/22 * Create tag in the parser, update linter+formatter PR * Merge linter+formatter PR https://github.com/astral-sh/ruff/pull/5459 --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-18 13:40:15 +02:00
konsti	d098256c96	Add a tool for shrinking failing examples (#5731 ) ## Summary For formatter instabilities, the message we get look something like this: ```text Unstable formatting /home/konsti/ruff/target/checkouts/deepmodeling:dpdispatcher/dpdispatcher/slurm.py @@ -47,9 +47,9 @@ - script_header_dict["slurm_partition_line"] = ( - NOT_YET_IMPLEMENTED_ExprJoinedStr - ) + script_header_dict[ + "slurm_partition_line" + ] = NOT_YET_IMPLEMENTED_ExprJoinedStr Unstable formatting /home/konsti/ruff/target/checkouts/deepmodeling:dpdispatcher/dpdispatcher/pbs.py @@ -26,9 +26,9 @@ - pbs_script_header_dict["select_node_line"] += ( - NOT_YET_IMPLEMENTED_ExprJoinedStr - ) + pbs_script_header_dict[ + "select_node_line" + ] += NOT_YET_IMPLEMENTED_ExprJoinedStr ``` For ruff crashes. you don't even get that but just the file that crashed it. To extract the actual bug, you'd need to manually remove parts of the file, rerun to see if the bug still occurs (and revert if it doesn't) until you have a minimal example. With this script, you run ```shell cargo run --bin ruff_shrinking -- target/checkouts/deepmodeling:dpdispatcher/dpdispatcher/slurm.py target/minirepo/code.py "Unstable formatting" "target/debug/ruff_dev format-dev --stability-check target/minirepo" ``` and get ```python class Slurm(): def gen_script_header(self, job): if resources.queue_name != "": script_header_dict["slurm_partition_line"] = f"#SBATCH --partition {resources.queue_name}" ``` which is an nice minimal example. I've been using this script and it would be easier for me if this were part of main. The main disadvantage to merging is that it adds additional dependencies. ## Test Plan I've been using this for a number of minimization. This is an internal helper script you only run manually. I could add a test that minimizes a rule violation if required. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-18 08:03:35 +00:00
David Szotten	52aa2fc875	upgrade rustpython to remove tuple-constants (#5840 ) c.f. https://github.com/astral-sh/RustPython-Parser/pull/28 Tests: No snapshots changed --------- Co-authored-by: Zanie <contact@zanie.dev>	2023-07-17 22:50:31 +00:00
konsti	7dd30f0270	Read black options in format_dev script (#5827 ) ## Summary Comparing repos with black requires that we use the settings as black, notably line length and magic trailing comma behaviour. Excludes and preserving quotes (vs. a preference for either quote style) is not yet implemented because they weren't needed for the test projects. In the other two commits i fixed the output when the progress bar is hidden (this way is recommonded in the indicatif docs), added a `scratch.pyi` file to gitignore because black formats stub files differently and also updated the ecosystem readme with the projects json without forks. ## Test Plan I added a `line-length` vs `line_length` test. Otherwise only my personal usage atm, a PR to integrate the script into the CI to check some projects will follow.	2023-07-17 13:29:43 +00:00
Tom Kuson	8420008e79	Avoid checking `EXE001` and `EXE002` on WSL (#5735 ) ## Summary Do not raise `EXE001` and `EXE002` if WSL is detected. Uses the [`wsl`](https://crates.io/crates/wsl) crate. Closes #5445. ## Test Plan `cargo test` I don't use Windows, so was unable to test on a WSL environment. It would be good if someone who runs Windows could check the functionality.	2023-07-13 07:36:07 -04:00
Charlie Marsh	0ead9a16ac	Bump version to 0.0.278 (#5714 )	2023-07-12 12:39:56 -04:00
David Szotten	1e894f328c	formatter: multi char tokens in SimpleTokenizer (#5610 )	2023-07-10 09:00:59 +01:00
Charlie Marsh	401d172e47	Use a simple match statement for case-insensitive noqa lookup (#5633 ) ## Summary It turns out that just doing this match directly without `AhoCorasick` is much faster, like 2x (and removes one dependency, though we likely already rely on this transitively).	2023-07-09 22:15:23 -04:00
Charlie Marsh	072358e26b	Use Instagram's LibCST rather than our fork (#5593 ) ## Summary Historically, we only used a fork to enable building without pyo3. But pyo3 is an optional feature. I may've just not understood how to accomplish this way back when.	2023-07-07 10:00:44 -04:00
konsti	b22e6c3d38	Extend ruff_dev formatter script to compute statistics and format a project (#5492 ) ## Summary This extends the `ruff_dev` formatter script util. Instead of only doing stability checks, you can now choose different compatible options on the CLI and get statistics. * It adds an option the formats all files that ruff would check to allow looking at an entire black-formatted repository with `git diff` * It computes the [Jaccard index](https://en.wikipedia.org/wiki/Jaccard_index) as a measure of deviation between input and output, which is useful as single number metric for assessing our current deviations from black. * It adds progress bars to both the single projects as well as the multi-project mode. * It adds an option to write the multi-project output to a file Sample usage: ``` $ cargo run --bin ruff_dev -- format-dev --stability-check crates/ruff/resources/test/cpython $ cargo run --bin ruff_dev -- format-dev --stability-check /home/konsti/projects/django Syntax error in /home/konsti/projects/django/tests/test_runner_apps/tagged/tests_syntax_error.py: source contains syntax errors (parser error): BaseError { error: UnrecognizedToken(Name { name: "syntax_error" }, None), offset: 131, source_path: "<filename>" } Found 0 stability errors in 2755 files (jaccard index 0.911) in 9.75s $ cargo run --bin ruff_dev -- format-dev --write /home/konsti/projects/django ``` Options: ``` Several utils related to the formatter which can be run on one or more repositories. The selected set of files in a repository is the same as for `ruff check`. * Check formatter stability: Format a repository twice and ensure that it looks that the first and second formatting look the same. * Format: Format the files in a repository to be able to check them with `git diff` * Statistics: The subcommand the Jaccard index between the (assumed to be black formatted) input and the ruff formatted output Usage: ruff_dev format-dev [OPTIONS] [FILES]... Arguments: [FILES]... Like `ruff check`'s files. See `--multi-project` if you want to format an ecosystem checkout Options: --stability-check Check stability We want to ensure that once formatted content stays the same when formatted again, which is known as formatter stability or formatter idempotency, and that the formatter prints syntactically valid code. As our test cases cover only a limited amount of code, this allows checking entire repositories. --write Format the files. Without this flag, the python files are not modified --format <FORMAT> Control the verbosity of the output [default: default] Possible values: - minimal: Filenames only - default: Filenames and reduced diff - full: Full diff and invalid code -x, --exit-first-error Print only the first error and exit, `-x` is same as pytest --multi-project Checks each project inside a directory, useful e.g. if you want to check all of the ecosystem checkouts --error-file <ERROR_FILE> Write all errors to this file in addition to stdout. Only used in multi-project mode ``` ## Test Plan I ran this on django (2755 files, jaccard index 0.911) and discovered a magic trailing comma problem and that we really needed to implement import formatting. I ran the script on cpython to identify https://github.com/astral-sh/ruff/pull/5558.	2023-07-07 11:30:12 +00:00
Charlie Marsh	cc822082a7	Refactor `noqa` directive parsing away from regex-based implementation (#5554 ) ## Summary I'll write up a more detailed description tomorrow, but in short, this PR removes our regex-based implementation in favor of "manual" parsing. I tried a couple different implementations. In the benchmarks below: - `Directive/Regex` is our implementation on `main`. - `Directive/Find` just uses `text.find("noqa")`, which is insufficient, since it doesn't cover case-insensitive variants like `NOQA`, and doesn't handle multiple `noqa` matches in a single like, like ` # Here's a noqa comment # noqa: F401`. But it's kind of a baseline. - `Directive/Memchr` uses three `memchr` iterative finders (one for `noqa`, `NOQA`, and `NoQA`). - `Directive/AhoCorasick` is roughly the variant checked-in here. The raw results: ``` Directive/Regex/# noqa: F401 time: [273.69 ns 274.71 ns 276.03 ns] change: [+1.4467% +1.8979% +2.4243%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 3 (3.00%) low mild 8 (8.00%) high mild 4 (4.00%) high severe Directive/Find/# noqa: F401 time: [66.972 ns 67.048 ns 67.132 ns] change: [+2.8292% +2.9377% +3.0540%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 1 (1.00%) low severe 3 (3.00%) low mild 8 (8.00%) high mild 3 (3.00%) high severe Directive/AhoCorasick/# noqa: F401 time: [76.922 ns 77.189 ns 77.536 ns] change: [+0.4265% +0.6862% +0.9871%] (p = 0.00 < 0.05) Change within noise threshold. Found 8 outliers among 100 measurements (8.00%) 1 (1.00%) low mild 3 (3.00%) high mild 4 (4.00%) high severe Directive/Memchr/# noqa: F401 time: [62.627 ns 62.654 ns 62.679 ns] change: [-0.1780% -0.0887% -0.0120%] (p = 0.03 < 0.05) Change within noise threshold. Found 11 outliers among 100 measurements (11.00%) 1 (1.00%) low severe 5 (5.00%) low mild 3 (3.00%) high mild 2 (2.00%) high severe Directive/Regex/# noqa: F401, F841 time: [321.83 ns 322.39 ns 322.93 ns] change: [+8602.4% +8623.5% +8644.5%] (p = 0.00 < 0.05) Performance has regressed. Found 5 outliers among 100 measurements (5.00%) 1 (1.00%) low severe 2 (2.00%) low mild 1 (1.00%) high mild 1 (1.00%) high severe Directive/Find/# noqa: F401, F841 time: [78.618 ns 78.758 ns 78.896 ns] change: [+1.6909% +1.8771% +2.0628%] (p = 0.00 < 0.05) Performance has regressed. Found 3 outliers among 100 measurements (3.00%) 3 (3.00%) high mild Directive/AhoCorasick/# noqa: F401, F841 time: [87.739 ns 88.057 ns 88.468 ns] change: [+0.1843% +0.4685% +0.7854%] (p = 0.00 < 0.05) Change within noise threshold. Found 11 outliers among 100 measurements (11.00%) 5 (5.00%) low mild 3 (3.00%) high mild 3 (3.00%) high severe Directive/Memchr/# noqa: F401, F841 time: [80.674 ns 80.774 ns 80.860 ns] change: [-0.7343% -0.5633% -0.4031%] (p = 0.00 < 0.05) Change within noise threshold. Found 14 outliers among 100 measurements (14.00%) 4 (4.00%) low severe 9 (9.00%) low mild 1 (1.00%) high mild Directive/Regex/# noqa time: [194.86 ns 195.93 ns 196.97 ns] change: [+11973% +12039% +12103%] (p = 0.00 < 0.05) Performance has regressed. Found 6 outliers among 100 measurements (6.00%) 5 (5.00%) low mild 1 (1.00%) high mild Directive/Find/# noqa time: [25.327 ns 25.354 ns 25.383 ns] change: [+3.8524% +4.0267% +4.1845%] (p = 0.00 < 0.05) Performance has regressed. Found 9 outliers among 100 measurements (9.00%) 6 (6.00%) high mild 3 (3.00%) high severe Directive/AhoCorasick/# noqa time: [34.267 ns 34.368 ns 34.481 ns] change: [+0.5646% +0.8505% +1.1281%] (p = 0.00 < 0.05) Change within noise threshold. Found 5 outliers among 100 measurements (5.00%) 5 (5.00%) high mild Directive/Memchr/# noqa time: [21.770 ns 21.818 ns 21.874 ns] change: [-0.0990% +0.1464% +0.4046%] (p = 0.26 > 0.05) No change in performance detected. Found 10 outliers among 100 measurements (10.00%) 4 (4.00%) low mild 4 (4.00%) high mild 2 (2.00%) high severe Directive/Regex/# type: ignore # noqa: E501 time: [278.76 ns 279.69 ns 280.72 ns] change: [+7449.4% +7469.8% +7490.5%] (p = 0.00 < 0.05) Performance has regressed. Found 3 outliers among 100 measurements (3.00%) 1 (1.00%) low mild 1 (1.00%) high mild 1 (1.00%) high severe Directive/Find/# type: ignore # noqa: E501 time: [67.791 ns 67.976 ns 68.184 ns] change: [+2.8321% +3.1735% +3.5418%] (p = 0.00 < 0.05) Performance has regressed. Found 6 outliers among 100 measurements (6.00%) 5 (5.00%) high mild 1 (1.00%) high severe Directive/AhoCorasick/# type: ignore # noqa: E501 time: [75.908 ns 76.055 ns 76.210 ns] change: [+0.9269% +1.1427% +1.3955%] (p = 0.00 < 0.05) Change within noise threshold. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high severe Directive/Memchr/# type: ignore # noqa: E501 time: [72.549 ns 72.723 ns 72.957 ns] change: [+1.5881% +1.9660% +2.3974%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 10 (10.00%) high mild 5 (5.00%) high severe Directive/Regex/# type: ignore # nosec time: [66.967 ns 67.075 ns 67.207 ns] change: [+1713.0% +1715.8% +1718.9%] (p = 0.00 < 0.05) Performance has regressed. Found 10 outliers among 100 measurements (10.00%) 1 (1.00%) low severe 3 (3.00%) low mild 2 (2.00%) high mild 4 (4.00%) high severe Directive/Find/# type: ignore # nosec time: [18.505 ns 18.548 ns 18.597 ns] change: [+1.3520% +1.6976% +2.0333%] (p = 0.00 < 0.05) Performance has regressed. Found 4 outliers among 100 measurements (4.00%) 4 (4.00%) high mild Directive/AhoCorasick/# type: ignore # nosec time: [16.162 ns 16.206 ns 16.252 ns] change: [+1.2919% +1.5587% +1.8430%] (p = 0.00 < 0.05) Performance has regressed. Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) high mild 1 (1.00%) high severe Directive/Memchr/# type: ignore # nosec time: [39.192 ns 39.233 ns 39.276 ns] change: [+0.5164% +0.7456% +0.9790%] (p = 0.00 < 0.05) Change within noise threshold. Found 13 outliers among 100 measurements (13.00%) 2 (2.00%) low severe 4 (4.00%) low mild 3 (3.00%) high mild 4 (4.00%) high severe Directive/Regex/# some very long comment that # is interspersed with characters but # no directive time: [81.460 ns 81.578 ns 81.703 ns] change: [+2093.3% +2098.8% +2104.2%] (p = 0.00 < 0.05) Performance has regressed. Found 4 outliers among 100 measurements (4.00%) 2 (2.00%) low mild 2 (2.00%) high mild Directive/Find/# some very long comment that # is interspersed with characters but # no directive time: [26.284 ns 26.331 ns 26.387 ns] change: [+0.7554% +1.1027% +1.3832%] (p = 0.00 < 0.05) Change within noise threshold. Found 6 outliers among 100 measurements (6.00%) 5 (5.00%) high mild 1 (1.00%) high severe Directive/AhoCorasick/# some very long comment that # is interspersed with characters but # no direc... time: [28.643 ns 28.714 ns 28.787 ns] change: [+1.3774% +1.6780% +2.0028%] (p = 0.00 < 0.05) Performance has regressed. Found 2 outliers among 100 measurements (2.00%) 2 (2.00%) high mild Directive/Memchr/# some very long comment that # is interspersed with characters but # no directive time: [55.766 ns 55.831 ns 55.897 ns] change: [+1.5802% +1.7476% +1.9021%] (p = 0.00 < 0.05) Performance has regressed. Found 2 outliers among 100 measurements (2.00%) 2 (2.00%) low mild ``` While memchr is faster than aho-corasick in some of the common cases (like `# noqa: F401`), the latter is way, way faster when there _isn't_ a match (like 2x faster -- see the last two cases). Since most comments _aren't_ `noqa` comments, this felt like the right tradeoff. Note that all implementations are significantly faster than the regex version. (I know I originally reported a 10x speedup, but I ended up improving the regex version a bit in some prior PRs, so it got unintentionally faster via some refactors.) There's also one behavior change in here, which is that we now allow variable spaces, e.g., `#noqa` or `# noqa`. Previously, we required exactly one space. This thus closes #5177.	2023-07-06 16:03:10 +00:00
Charlie Marsh	9a8e5f7877	Run `cargo update` (#5534 ) ```console ❯ cargo update Updating crates.io index Updating git repository `https://github.com/charliermarsh/LibCST` Updating git repository `https://github.com/astral-sh/RustPython-Parser.git` Updating git repository `https://github.com/youknowone/unicode_names2.git` Updating bitflags v2.3.2 -> v2.3.3 Updating bstr v1.5.0 -> v1.6.0 Updating clap v4.3.8 -> v4.3.11 Updating clap_builder v4.3.8 -> v4.3.11 Updating clap_complete v4.3.1 -> v4.3.2 Updating colored v2.0.0 -> v2.0.4 Removing hermit-abi v0.2.6 Removing hermit-abi v0.3.1 Adding hermit-abi v0.3.2 Updating is-terminal v0.4.7 -> v0.4.8 Updating itoa v1.0.6 -> v1.0.8 Adding linux-raw-sys v0.4.3 Updating num_cpus v1.15.0 -> v1.16.0 Updating paste v1.0.12 -> v1.0.13 Updating pin-project-lite v0.2.9 -> v0.2.10 Updating quote v1.0.28 -> v1.0.29 Updating regex v1.8.4 -> v1.9.0 Updating regex-automata v0.1.10 -> v0.3.0 Updating regex-syntax v0.7.2 -> v0.7.3 Removing rustix v0.37.20 Adding rustix v0.37.23 Adding rustix v0.38.3 Updating rustversion v1.0.12 -> v1.0.13 Updating ryu v1.0.13 -> v1.0.14 Updating serde v1.0.164 -> v1.0.166 Updating serde_derive v1.0.164 -> v1.0.166 Updating serde_json v1.0.99 -> v1.0.100 Updating syn v2.0.22 -> v2.0.23 Updating thiserror v1.0.40 -> v1.0.41 Updating thiserror-impl v1.0.40 -> v1.0.41 Updating unicode-ident v1.0.9 -> v1.0.10 Updating uuid v1.3.4 -> v1.4.0 Updating windows-targets v0.48.0 -> v0.48.1 ```	2023-07-05 12:34:15 -04:00
Charlie Marsh	324455f580	Bump version to 0.0.277 (#5515 )	2023-07-04 17:31:32 -04:00
Thomas de Zeeuw	0b963ddcfa	Add unreachable code rule (#5384 ) Co-authored-by: Thomas de Zeeuw <thomas@astral.sh> Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-04 14:27:23 +00:00
Charlie Marsh	3992c47c00	Bump version to 0.0.276 (#5488 )	2023-07-03 18:02:49 +00:00
Micha Reiser	dc65007fe9	Use rayon to parallelize the stability check <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR uses rayon to parallelize the stability check by scheduling each project as its own task. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I ran the ecosystem check. It now makes use of all cores (except at the end, there are some large projects). ## Performance The check now completes in minutes where it took about 30 minutes before. <!-- How was it tested? -->	2023-06-30 10:05:25 +02:00
Micha Reiser	955e9ef821	Fix invalid syntax for binary expression in unary op (#5370 )	2023-06-29 08:09:26 +02:00
Charlie Marsh	6587fb844a	Add snapshot tests for resolver (#5404 ) ## Summary This PR adds some snapshot tests for the resolver based on executing resolutions within a "mock" of the Airflow repo (that is: a folder that contains a subset of the repo's files, but all empty, and with an only-partially-complete virtual environment). It's intended to act as a lightweight integration test, to enable us to test resolutions on a "real" project without adding a dependency on Airflow itself.	2023-06-28 13:38:51 +00:00
Charlie Marsh	1ed227a1e0	Port Pyright's import resolver to Rust (#5381 ) ## Summary This PR contains the first step towards enabling robust first-party, third-party, and standard library import resolution in Ruff (including support for `typeshed`, stub files, native modules, etc.) by porting Pyright's import resolver to Rust. The strategy taken here was to start with a more-or-less direct port of the Pyright's TypeScript resolver. The code is intentionally similar, and the test suite is effectively a superset of Pyright's test suite for its own resolver. Due to the nature of the port, the code is very, very non-idiomatic for Rust. The code is also entirely unused outside of the test suite, and no effort has been made to integrate it with the rest of the codebase. Future work will include: - Refactoring the code (now that it works) to match Rust and Ruff idioms. - Further testing, in practice, to ensure that the resolver can resolve imports in a complex project, when provided with a virtual environment path. - Caching, to minimize filesystem lookups and redundant resolutions. - Integration into Ruff itself (use Ruff's existing settings, find rules that can make use of robust resolution, etc.)	2023-06-27 16:15:07 +00:00
konstin	7f6cb9dfb5	Format call expressions (without call chaining) (#5341 ) ## Summary This formats call expressions with magic trailing comma and parentheses behaviour but without call chaining ## Test Plan Lots of new test fixtures, including some that don't work yet	2023-06-27 09:29:40 +00:00
Dhruv Manilawala	2fc38d81e6	Experimental release for Jupyter notebook integration (#5363 ) ## Summary Experimental release for Jupyter Notebook integration. Currently, this requires a user to explicitly opt-in using the [include](https://beta.ruff.rs/docs/settings/#include) configuration: ```toml [tool.ruff] include = [".py", ".pyi", "*/pyproject.toml", ".ipynb"] ``` Or, a user can pass in the file directly: ```sh ruff check path/to/notebook.ipynb ``` For known limitations, please refer #5188 ## Test Plan Following command should work without the `--all-features` flag: ```sh cargo dev round-trip /path/to/notebook.ipynb ``` Following command should work with the above config file along with `select = ["ALL"]`: ```sh cargo run --bin ruff -- check --no-cache --config=../test-repos/openai-cookbook/pyproject.toml --fix ../test-repos/openai-cookbook/ ``` Passing the Jupyter notebook directly: ```sh cargo run --bin ruff -- check --no-cache --isolated --select=ALL --fix ../test-repos/openai-cookbook/examples/Classification_using_embeddings.ipynb ```	2023-06-26 21:22:42 +05:30
Micha Reiser	f18a1f70de	Add tests for skip magic trailing comma <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds tests that verify that the magic trailing comma is not respected if disabled in the formatter options. Our test setup now allows to create a `<fixture-name>.options.json` file that contains an array of configurations that should be tested. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan It's all about tests :) <!-- How was it tested? -->	2023-06-26 14:15:55 +02:00
Micha Reiser	8879927b9a	Use `insta::glob` instead of `fixture` macro (#5364 )	2023-06-26 08:46:18 +00:00
Charlie Marsh	18c73c1f9b	Improve backslash-detection rule for docstrings (#5360 )	2023-06-26 01:58:20 +00:00
Charlie Marsh	b233763156	Run `cargo update` (#5357 )	2023-06-25 18:16:59 -04:00
Micha Reiser	6ba9d5d5a4	Upgrade RustPython (#5334 )	2023-06-23 20:39:47 +00:00
Charlie Marsh	f45d1c2b84	Remove HashMap and HashSet for known-standard-library detection (#5345 ) ## Summary This is a lot more concise and probably much more performant (with fewer instructions).	2023-06-23 19:59:03 +00:00
Micha Reiser	c52aa8f065	Basic string formatting <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR implements formatting for non-f-string Strings that do not use implicit concatenation. Docstring formatting is out of the scope of this PR. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a few tests for simple string literals. ## Performance Ouch. This is hitting performance somewhat hard. This is probably because we now iterate each string a couple of times: 1. To detect if it is an implicit string continuation 2. To detect if the string contains any new lines 3. To detect the preferred quote 4. To normalize the string Edit: I integrated the detection of newlines into the preferred quote detection so that we only iterate the string three time. We can probably do better by merging the implicit string continuation with the quote detection and new line detection by iterating till the end of the string part and returning the offset. We then use our simple tokenizer to skip over any comments or whitespace until we find the first non trivia token. From there we keep continue doing this in a loop until we reach the end o the string. I'll leave this improvement for later.	2023-06-23 09:46:05 +02:00
Micha Reiser	f7e1cf4b51	Format `class` definitions (#5289 )	2023-06-22 09:09:43 +00:00
Charlie Marsh	1c0a3a467f	Bump version to 0.0.275 (#5276 )	2023-06-21 21:53:37 -04:00
Charlie Marsh	e0339b538b	Bump version to 0.0.274 (#5230 )	2023-06-20 22:12:32 -04:00
Micha Reiser	e520a3a721	Fix ArgWithDefault comments handling (#5204 )	2023-06-20 20:48:07 +00:00
Charlie Marsh	fde5dbc9aa	Bump version to 0.0.273 (#5218 )	2023-06-20 14:37:28 -04:00
Charlie Marsh	6331598511	Upgrade `RustPython` to access ranged names (#5194 ) ## Summary In https://github.com/astral-sh/RustPython-Parser/pull/8, we modified RustPython to include ranges for any identifiers that aren't `Expr::Name` (which already has an identifier). For example, the `e` in `except ValueError as e` was previously un-ranged. To extract its range, we had to do some lexing of our own. This change should improve performance and let us remove a bunch of code. ## Test Plan `cargo test`	2023-06-20 15:43:38 +00:00
Charlie Marsh	36e01ad6eb	Upgrade RustPython (#5192 ) ## Summary This PR upgrade RustPython to pull in the changes to `Arguments` (zip defaults with their identifiers) and all the renames to `CmpOp` and friends.	2023-06-19 21:09:53 +00:00
Dhruv Manilawala	48f4f2d63d	Maintain consistency when deserializing to JSON (#5114 ) ## Summary Maintain consistency while deserializing Jupyter notebook to JSON. The following changes were made: 1. Use string array to store the source value as that's the default (`5781720423/nbformat/v4/nbjson.py (L56-L57)`) 2. Remove unused structs and enums 3. Reorder the keys in alphabetical order as that's the default. (`5781720423/nbformat/v4/nbjson.py (L51)`) ### Side effect Removing the `preserve_order` feature means that the order of keys in JSON output (`--format json`) will be in alphabetical order. This is because the value is represented using `serde_json::Value` which internally is a `BTreeMap`, thus sorting it as per the string key. For posterity if this turns out to be not ideal, then we could define a struct representing the JSON object and the order of struct fields will determine the order in the JSON string. ## Test Plan Add a test case to assert the raw JSON string.	2023-06-19 23:47:56 +05:30
Thomas de Zeeuw	e3c12764f8	Only use a single cache file per Python package (#5117 ) ## Summary This changes the caching design from one cache file per source file, to one cache file per package. This greatly reduces the amount of cache files that are opened and written, while maintaining roughly the same (combined) size as bincode is very compact. Below are some very much not scientific performance tests. It uses projects/sources to check: * small.py: single, 31 bytes Python file with 2 errors. * test.py: single, 43k Python file with 8 errors. * fastapi: FastAPI repo, 1134 files checked, 0 errors. Source \| Before # files \| After # files \| Before size \| After size -------\|-------\|-------\|-------\|------- small.py \| 1 \| 1 \| 20 K \| 20 K test.py \| 1 \| 1 \| 60 K \| 60 K fastapi \| 1134 \| 518 \| 4.5 M \| 2.3 M One question that might come up is why fastapi still has 518 cache files and not 1? That is because this is using the existing package resolution, which sees examples, docs, etc. as separate from the "main" source code (in the fastapi directory in the repo). In this future it might be worth consider switching to a one cache file per repo strategy. This new design is not perfect and does have a number of known issues. First, like the old design it doesn't remove the cache for a source file that has been (re)moved until `ruff clean` is called. Second, this currently uses a large mutex around the mutation of the package cache (e.g. inserting result). This could be (or become) a bottleneck. It's future work to test and improve this (if needed). Third, currently the packages and opened and stored in a sequential loop, this could be done parallel. This is also future work. ## Test Plan Run `ruff check` (with caching enabled) twice on any Python source code and it should produce the same results.	2023-06-19 17:46:13 +02:00
konstin	b8d378b0a3	Add a script that tests formatter stability on repositories (#5055 ) ## Summary We want to ensure that once formatted content stays the same when formatted again, which is known as formatter stability or formatter idempotency, and that the formatter prints syntactically valid code. As our test cases cover only a limited amount of code, this allows checking entire repositories. This adds a new subcommand to `ruff_dev` which can be invoked as `cargo run --bin ruff_dev -- check-formatter-stability <repo>`. While initially only intended to check stability, it has also found cases where the formatter printed invalid syntax or panicked. ## Test Plan Running this on cpython is already identifying bugs (https://github.com/astral-sh/ruff/pull/5089)	2023-06-19 14:13:38 +00:00
Thomas de Zeeuw	b0f89fa814	Support glob patterns in pep8_naming ignore-names (#5024 ) ## Summary Support glob patterns in pep8_naming ignore-names. Closes #2787 ## Test Plan Added new tests.	2023-06-13 17:37:13 +02:00
Dhruv Manilawala	d8f5d2d767	Add support for auto-fix in Jupyter notebooks (#4665 ) ## Summary Add support for applying auto-fixes in Jupyter Notebook. ### Solution Cell offsets are the boundaries for each cell in the concatenated source code. They are represented using `TextSize`. It includes the start and end offset as well, thus creating a range for each cell. These offsets are updated using the `SourceMap` markers. ### SourceMap `SourceMap` contains markers constructed from each edits which tracks the original source code position to the transformed positions. The following drawing might make it clear: ![SourceMap visualization](https://github.com/astral-sh/ruff/assets/67177269/3c94e591-70a7-4b57-bd32-0baa91cc7858) The center column where the dotted lines are present are the markers included in the `SourceMap`. The `Notebook` looks at these markers and updates the cell offsets after each linter loop. If you notice closely, the destination takes into account all of the markers before it. The index is constructed only when required as it's only used to render the diagnostics. So, a `OnceCell` is used for this purpose. The cell offsets, cell content and the index will be updated after each iteration of linting in the mentioned order. The order is important here as the content is updated as per the new offsets and index is updated as per the new content. ## Limitations ### 1 Styling rules such as the ones in `pycodestyle` will not be applicable everywhere in Jupyter notebook, especially at the cell boundaries. Let's take an example where a rule suggests to have 2 blank lines before a function and the cells contains the following code: ```python import something # --- def first(): pass def second(): pass ``` (Again, the comment is only to visualize cell boundaries.) In the concatenated source code, the 2 blank lines will be added but it shouldn't actually be added when we look in terms of Jupyter notebook. It's as if the function `first` is at the start of a file. `nbqa` solves this by recording newlines before and after running `autopep8`, then running the tool and restoring the newlines at the end (refer https://github.com/nbQA-dev/nbQA/pull/807). ## Test Plan Three commands were run in order with common flags (`--select=ALL --no-cache --isolated`) to isolate which stage the problem is occurring: 1. Only diagnostics 2. Fix with diff (`--fix --diff`) 3. Fix (`--fix`) ### https://github.com/facebookresearch/segment-anything ``` ------------------------------------------------------------------------------- Jupyter Notebooks 3 0 0 0 0 \|- Markdown 3 98 0 94 4 \|- Python 3 513 468 4 41 (Total) 611 468 98 45 ------------------------------------------------------------------------------- ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/segment-anything/*/.ipynb --fix ... Found 180 errors (89 fixed, 91 remaining). ``` ### https://github.com/openai/openai-cookbook ``` ------------------------------------------------------------------------------- Jupyter Notebooks 65 0 0 0 0 \|- Markdown 64 3475 12 2507 956 \|- Python 65 9700 7362 1101 1237 (Total) 13175 7374 3608 2193 =============================================================================== ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/openai-cookbook/*/.ipynb --fix error: Failed to parse /path/to/openai-cookbook/examples/vector_databases/Using_vector_databases_for_embeddings_search.ipynb:cell 4:29:18: unexpected token '-' ... Found 4227 errors (2165 fixed, 2062 remaining). ``` ### https://github.com/tensorflow/docs ``` ------------------------------------------------------------------------------- Jupyter Notebooks 150 0 0 0 0 \|- Markdown 1 55 0 46 9 \|- Python 1 402 289 60 53 (Total) 457 289 106 62 ------------------------------------------------------------------------------- ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/tensorflow-docs/*/.ipynb --fix error: Failed to parse /path/to/tensorflow-docs/site/en/guide/extension_type.ipynb:cell 80:1:1: unexpected token Indent error: Failed to parse /path/to/tensorflow-docs/site/en/r1/tutorials/eager/custom_layers.ipynb:cell 20:1:1: unexpected token Indent error: Failed to parse /path/to/tensorflow-docs/site/en/guide/data.ipynb:cell 175:5:14: unindent does not match any outer indentation level error: Failed to parse /path/to/tensorflow-docs/site/en/r1/tutorials/representation/unicode.ipynb:cell 30:1:1: unexpected token Indent ... Found 12726 errors (5140 fixed, 7586 remaining). ``` ### https://github.com/tensorflow/models ``` ------------------------------------------------------------------------------- Jupyter Notebooks 46 0 0 0 0 \|- Markdown 1 11 0 6 5 \|- Python 1 328 249 19 60 (Total) 339 249 25 65 ------------------------------------------------------------------------------- ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/tensorflow-models/*/.ipynb --fix ... Found 4856 errors (2690 fixed, 2166 remaining). ``` resolves: #1218 fixes: #4556	2023-06-12 14:14:15 +00:00
Charlie Marsh	1d756dc3a7	Move Python whitespace utilities into new `ruff_python_whitespace` crate (#4993 ) ## Summary `ruff_newlines` becomes `ruff_python_whitespace`, and includes the existing "universal newline" handlers alongside the Python whitespace-specific utilities.	2023-06-10 00:59:57 +00:00
konstin	651d89794c	Use phf for confusables to reduce llvm lines (#4926 ) * Use phf for confusables to reduce llvm lines ## Summary This replaces FxHashMap for the confusables with a perfect hash map from the [phf crate](https://github.com/rust-phf/rust-phf) to reduce the generated llvm instructions. A perfect hash function is one that doesn't have any collisions. We can build one because we know all keys at compile time. This improves hashmap efficiency, even though this is likely not noticeable in our case (except someone has a large non-english crate to test on). The original hashmap contained a lot of duplicates, which i had to remove when phf_map complained, i did so by sorting the keys. The important part that it reduces the llvm instructions generated (#3808, `RUSTFLAGS="-Csymbol-mangling-version=v0" cargo llvm-lines -p ruff --lib \| head -20`): ``` Lines Copies Function name ----- ------ ------------- 1740502 38973 (TOTAL) 27423 (1.6%, 1.6%) 1 (0.0%, 0.0%) ruff[cef4c65d96248843]::rules::ruff::rules::confusables::CONFUSABLES::{closure#0} 10193 (0.6%, 2.2%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::RuleCodePrefix>::iter 8107 (0.5%, 2.6%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::Rule>::noqa_code 7345 (0.4%, 3.0%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::checkers::ast::Checker as ruff_python_ast[3778b140caf21545]::visitor::Visitor>::visit_stmt 6412 (0.4%, 3.4%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:spanned::SpannedDeserializer<toml_edit[7e3a6c5e67260672]:🇩🇪:value::ValueDeserializer>> 6412 (0.4%, 3.8%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:table::TableMapAccess> 6409 (0.4%, 4.2%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:datetime::DatetimeDeserializer> 5696 (0.3%, 4.5%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::checkers::ast::Checker as ruff_python_ast[3778b140caf21545]::visitor::Visitor>::visit_expr 4448 (0.3%, 4.7%) 1 (0.0%, 0.0%) ruff[cef4c65d96248843]::flake8_to_ruff::converter::convert 3702 (0.2%, 4.9%) 1 (0.0%, 0.0%) <&ruff[cef4c65d96248843]::registry::Linter as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter 3349 (0.2%, 5.1%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::registry::Linter>::code_for_rule 3132 (0.2%, 5.3%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::Rule as core[da82827a87f140f9]::fmt::Debug>::fmt 3130 (0.2%, 5.5%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<&ruff[cef4c65d96248843]::codes::Rule>>::from 3130 (0.2%, 5.7%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<ruff[cef4c65d96248843]::codes::Rule>>::from 3130 (0.2%, 5.9%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::Rule as core[da82827a87f140f9]::convert::AsRef<str>>::as_ref 3128 (0.2%, 6.0%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::RuleIter>::get 2669 (0.2%, 6.2%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_seq::<toml_edit[7e3a6c5e67260672]:🇩🇪:array::ArraySeqAccess> ``` After: ``` Lines Copies Function name ----- ------ ------------- 1710487 38900 (TOTAL) 10193 (0.6%, 0.6%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::RuleCodePrefix>::iter 8107 (0.5%, 1.1%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::Rule>::noqa_code 7345 (0.4%, 1.5%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::checkers::ast::Checker as ruff_python_ast[5588cd60041c8605]::visitor::Visitor>::visit_stmt 6412 (0.4%, 1.9%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:spanned::SpannedDeserializer<toml_edit[7e3a6c5e67260672]:🇩🇪:value::ValueDeserializer>> 6412 (0.4%, 2.2%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:table::TableMapAccess> 6409 (0.4%, 2.6%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:datetime::DatetimeDeserializer> 5696 (0.3%, 3.0%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::checkers::ast::Checker as ruff_python_ast[5588cd60041c8605]::visitor::Visitor>::visit_expr 4448 (0.3%, 3.2%) 1 (0.0%, 0.0%) ruff[52408f46d2058296]::flake8_to_ruff::converter::convert 3702 (0.2%, 3.4%) 1 (0.0%, 0.0%) <&ruff[52408f46d2058296]::registry::Linter as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter 3349 (0.2%, 3.6%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::registry::Linter>::code_for_rule 3132 (0.2%, 3.8%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::Rule as core[da82827a87f140f9]::fmt::Debug>::fmt 3130 (0.2%, 4.0%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<&ruff[52408f46d2058296]::codes::Rule>>::from 3130 (0.2%, 4.2%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<ruff[52408f46d2058296]::codes::Rule>>::from 3130 (0.2%, 4.4%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::Rule as core[da82827a87f140f9]::convert::AsRef<str>>::as_ref 3128 (0.2%, 4.5%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::RuleIter>::get 2669 (0.2%, 4.7%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_seq::<toml_edit[7e3a6c5e67260672]:🇩🇪:array::ArraySeqAccess> 2659 (0.2%, 4.9%) 1 (0.0%, 0.0%) <&ruff[52408f46d2058296]::codes::Pylint as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter ``` I'd assume this has a positive effect both on compile time and on runtime, but i don't know the actual effect on compile times and can't really measure. ## Test plan Check CI for any performance regressions. This should fix #3808 if we merge it. * clippy * Update update_ambiguous_characters.py	2023-06-08 08:13:20 +02:00
Micha Reiser	39a1f3980f	Upgrade RustPython (#4900 )	2023-06-08 05:53:14 +00:00
Charlie Marsh	5235977abc	Bump version to 0.0.272 (#4948 )	2023-06-08 02:17:29 +00:00
Charlie Marsh	2b5fb70482	Bump version to 0.0.271 (#4890 )	2023-06-06 15:11:48 -04:00
Micha Reiser	33434fcb9c	Add Formatter benchmark (#4860 )	2023-06-05 21:05:42 +02:00
Zanie Adkins	5ae4667fd5	Upgrade `criterion` to `0.5.1` (#4838 )	2023-06-03 21:33:44 +01:00
Charlie Marsh	b8f45c93b4	Use a separate fix-isolation group for every parent node (#4774 )	2023-06-02 03:07:55 +00:00
Charlie Marsh	621718784a	Replace deletion-tracking with enforced isolation levels (#4766 )	2023-06-02 02:45:56 +00:00
konstin	d4027d8b65	Use new formatter infrastructure in CLI and test (#4767 ) * Use dummy verbatim formatter for all nodes * Use new formatter infrastructure in CLI and test * Expose the new formatter in the CLI * Merge import blocks	2023-06-01 11:55:04 +02:00
Charlie Marsh	399eb84d5e	Add a `ruff_textwrap` crate (#4731 )	2023-05-31 16:35:23 +00:00
Charlie Marsh	9d0ffd33ca	Move universal newline handling into its own crate (#4729 )	2023-05-31 12:00:47 -04:00
Micha Reiser	6c1ff6a85f	Upgrade RustPython (#4747 )	2023-05-31 08:26:35 +00:00
Charlie Marsh	a4f73ea8c7	Remove unused `getrandom` dependency (#4734 )	2023-05-30 14:34:20 -04:00
Charlie Marsh	ea31229be0	Track `TYPE_CHECKING` blocks in `Importer` (#4593 )	2023-05-30 16:18:10 +00:00
Micha Reiser	0cd453bdf0	Generic "comment to node" association logic (#4642 )	2023-05-30 09:28:01 +00:00
Micha Reiser	6146b75dd0	Add `MultiMap` implementation for storing comments (#4639 )	2023-05-30 09:51:25 +02:00
Micha Reiser	33a7ed058f	Create `PreorderVisitor` trait (#4658 )	2023-05-26 06:14:08 +00:00
konstin	b6a382eeaf	Lint pyproject.toml (#4496 ) This adds a new rule `InvalidPyprojectToml` that lints pyproject.toml by checking if https://github.com/PyO3/pyproject-toml-rs can parse it. This means the linting is currently very basic, e.g. we don't check whether the name is actually a valid python project name or appropriately normalized. It does catch errors e.g. with invalid dependency requirements or problems withs the license specifications. It is open to be extended in the future (validate name, SPDX expressions, classifiers, ...), either in ruff or in pyproject-toml-rs. Test plan: ``` scripts/ecosystem_all_check.sh check --select RUF200 ``` This lead to a bunch of ``` RUF200 Failed to parse pyproject.toml: missing field `name` ``` (e.g. https://github.com/amitsk/fastapi-todos/blob/main/pyproject.toml) which is indeed invalid (https://packaging.python.org/en/latest/specifications/declaring-project-metadata/#specification). Filtering those out, the following other problems were found by `cd target/ecosystem_all_results/ && rg RUF200`: ``` UCL-ARC:rred-reports.stdout.txt 1:pyproject.toml:27:16: RUF200 Failed to parse pyproject.toml: Version specifier `>='3.9'` doesn't match PEP 440 rules EndlessTrax:python-start-project.stdout.txt 1:pyproject.toml:14:16: RUF200 Failed to parse pyproject.toml: Expected package name starting with an alphanumeric character, found '#' redjax:gardening-api.stdout.txt 1:pyproject.toml:7:11: RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules ajslater:codex.stdout.txt 2: 3:17 RUF200 Failed to parse pyproject.toml: invalid type: sequence, expected a string LDmitriy7:404_AvatarsBot.stdout.txt 1:pyproject.toml:3:11: RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules ajslater:comicbox.stdout.txt 1:pyproject.toml:3:17: RUF200 Failed to parse pyproject.toml: invalid type: sequence, expected a string manueldevillena:forecast-earnings.stdout.txt 1:pyproject.toml:24:12: RUF200 Failed to parse pyproject.toml: Expected one of `@`, `(`, `<`, `=`, `>`, `~`, `!`, `;`, found `^` redjax:ohio_utility_scraper.stdout.txt 1:pyproject.toml:11:11: RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules agronholm:typeguard.stdout.txt 1:pyproject.toml:40:8: RUF200 Failed to parse pyproject.toml: Expected a valid marker name, found 'python_implementation' cyuss:decathlon-turnover.stdout.txt 1:pyproject.toml:7:12: RUF200 Failed to parse pyproject.toml: invalid type: string "Youcef", expected a table with 'name' and 'email' keys ajslater:boilerplate.stdout.txt 1:pyproject.toml:3:17: RUF200 Failed to parse pyproject.toml: invalid type: sequence, expected a string kaparoo:lightning-project-template.stdout.txt 1:pyproject.toml:56:16: RUF200 Failed to parse pyproject.toml: You can't mix a >= operator with a local version (`+cu117`) dijital20:pytexas2023-decorators.stdout.txt 1:pyproject.toml:5:11: RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules pfouque:django-anymail-history.stdout.txt 1:pyproject.toml:137:12: RUF200 Failed to parse pyproject.toml: Version specifier `> = 1.2.0` doesn't match PEP 440 rules pfouque:django-fakemessages.stdout.txt 1:pyproject.toml:130:12: RUF200 Failed to parse pyproject.toml: Version specifier `> = 1.2.0` doesn't match PEP 440 rules pypa:build.stdout.txt 1:tests/packages/test-invalid-requirements/pyproject.toml:2:12: RUF200 Failed to parse pyproject.toml: Expected one of `@`, `(`, `<`, `=`, `>`, `~`, `!`, `;`, found `i` 4:tests/packages/test-no-requires/pyproject.toml:1:1: RUF200 Failed to parse pyproject.toml: missing field `requires` UnoYakshi:FRAAND.stdout.txt 2: 3:11 RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules DHolmanCoding:python-template.stdout.txt 1:pyproject.toml:22:1: RUF200 Failed to parse pyproject.toml: missing field `requires` ``` Overall, this emitted errors in 43 out of 3408 projects (`rg -c RUF200 target/ecosystem_all_results/ \| wc -l`) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-05-25 12:05:28 +00:00
Charlie Marsh	f4572fe40b	Bump version to 0.0.270 (#4637 )	2023-05-24 16:34:29 +00:00
Micha Reiser	6943beee66	Remove source position from `FormatElement::DynamicText` (#4619 )	2023-05-24 16:36:14 +02:00
Micha Reiser	652c644c2a	Introduce `ruff_index` crate (#4597 )	2023-05-23 17:40:35 +02:00
Micha Reiser	154439728a	Add `AnyNode` and `AnyNodeRef` unions (#4578 )	2023-05-23 08:53:22 +02:00
Christopher Covington	3b8121379d	Name ambiguous characters (#4448 )	2023-05-22 17:16:57 +00:00
Micha Reiser	daadd24bde	Include decorators in `Function` and `Class` definition ranges (#4467 )	2023-05-22 17:50:42 +02:00
Micha Reiser	063431cb0f	Upgrade RustPython (#4576 )	2023-05-22 14:50:49 +02:00
Micha Reiser	2f35099f81	Remove `regex` dependency from `ruff_python_ast` (#4518 )	2023-05-19 06:44:18 +00:00
Charlie Marsh	d4c0a41b00	Bump version to 0.0.269 (#4506 )	2023-05-18 19:45:20 +00:00
Charlie Marsh	8702b5a40a	Bump version to 0.0.268 (#4501 )	2023-05-18 15:35:46 -04:00
figsoda	bab818e801	Update RustPython dependencies (#4503 )	2023-05-18 15:28:13 -04:00
Jeong, YunWon	4b05ca1198	Specialize ConversionFlag (#4450 )	2023-05-16 18:00:13 +02:00
Charlie Marsh	f0465bf106	Emit non-logical newlines for "empty" lines (#4444 )	2023-05-16 14:58:56 +00:00
Jeong, YunWon	6049aabe27	Update RustPyhon and enable full-lexer feature (#4442 )	2023-05-16 07:19:57 +00:00
Micha Reiser	fa26860296	Refactor range from `Attributed` to `Node`s (#4422 )	2023-05-16 06:36:32 +00:00
Micha Reiser	7e7be05ddf	Upgrade dependencies (#4389 )	2023-05-13 13:00:25 +00:00
Micha Reiser	f5afa8198c	Use new `rustpython_format` crate over `rustpython-common` (#4388 )	2023-05-13 12:35:02 +00:00
Charlie Marsh	dcedd5cd9d	Bump version to 0.0.267 (#4400 )	2023-05-12 19:04:56 +00:00
Charlie Marsh	67076b2dcb	Bump version to 0.0.266 (#4391 )	2023-05-12 13:11:03 -04:00
Jeong, YunWon	bbadbb5de5	Refactor code to use the new RustPython `is` method (#4369 )	2023-05-11 16:16:36 +02:00
Jeong, YunWon	be6e00ef6e	Re-integrate RustPython parser repository (#4359 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-05-11 07:47:17 +00:00
Charlie Marsh	11e1380df4	Bump version to 0.0.265 (#4248 )	2023-05-05 13:16:05 -04:00
Charlie Marsh	8cb76f85eb	Bump version to 0.0.264 (#4179 )	2023-05-01 23:33:38 -07:00
Micha Reiser	e04ef42334	Use `memchr` to speedup newline search on x86 (#3985 )	2023-04-26 20:15:47 +01:00
Micha Reiser	cab65b25da	Replace row/column based `Location` with byte-offsets. (#3931 )	2023-04-26 18:11:02 +00:00
Charlie Marsh	7266eb0d69	Add support for providing command-line arguments via `argfile` (#4087 )	2023-04-25 17:58:21 -06:00
Charlie Marsh	fd7ccb4c9e	Bump version to 0.0.263 (#4086 )	2023-04-24 23:32:29 -06:00
Micha Reiser	ba4f4f4672	Upgrade dependencies (#4064 )	2023-04-22 18:04:01 +01:00
Charlie Marsh	25a6bfa9ee	Bump version to 0.0.262 (#4032 )	2023-04-19 15:49:28 -04:00
Micha Reiser	280dffb5e1	Add parser benchmark (#3990 )	2023-04-17 16:43:59 +02:00
Micha Reiser	e8aebee3f6	Pretty print `Diagnostic`s in snapshot tests (#3906 )	2023-04-11 09:03:00 +00:00
Micha Reiser	381203c084	Store source code on message (#3897 )	2023-04-11 07:57:36 +00:00
Micha Reiser	76c47a9a43	Cheap cloneable LineIndex (#3896 )	2023-04-11 07:33:40 +00:00
Micha Reiser	9209e57c5a	Extract message emitters from Printer (#3895 )	2023-04-11 07:24:25 +00:00
Charlie Marsh	255b094b33	Bump version to 0.0.261 (#3881 )	2023-04-04 22:31:01 -04:00
Charlie Marsh	d919adc13c	Introduce a `ruff_python_semantic` crate (#3865 )	2023-04-04 16:50:47 +00:00
Chris Chan	10504eb9ed	Generate `ImportMap` from module path to imported dependencies (#3243 )	2023-04-04 03:31:37 +00:00
konstin	f4cda31708	Use crates.io version of pep440_rs (#3812 ) * Use crates.io version of pep440_rs * Update Cargo.lock	2023-03-30 12:47:07 +00:00
Charlie Marsh	9d3b8eb67b	Bump version to v0.0.260 (#3799 )	2023-03-29 14:51:50 -04:00
Micha Reiser	113a8b8fda	perf(pycodestyle): Reduce allocations when computing logical lines (#3715 )	2023-03-28 09:09:27 +02:00
Charlie Marsh	e603382cf0	Allow diagnostics to generate multi-edit fixes (#3709 )	2023-03-26 16:45:19 -04:00
Agriya Khetarpal	c0befb4670	Use `wild::args()` and add `wild` as a dependency (#3739 )	2023-03-26 14:32:45 +00:00
Charlie Marsh	5c7898124f	Traverse over nested string type annotations (#3724 )	2023-03-25 21:56:09 -04:00
Micha Reiser	7af83460ce	Use unicode-width to determine line-length instead of character count (#3714 )	2023-03-24 17:17:05 -04:00
Jonathan Plasse	dc4d7619ee	Add `Diagnostic.try_amend()` to simplify error handling (#3701 )	2023-03-24 17:10:11 -04:00
Charlie Marsh	f58345dee3	Bump version to v0.0.259 (#3691 )	2023-03-23 14:52:42 -04:00
Charlie Marsh	615887a7fe	Bump version to v0.0.258 (#3671 )	2023-03-22 15:02:57 -04:00
Micha Reiser	f59a22b6e5	Remove unused dependencies (#3644 )	2023-03-21 11:02:41 +01:00
konstin	81d0884974	Add basic jupyter notebook support (#3440 ) * Add basic jupyter notebook support behind a feature flag * Address review comments * Rename in separate commit to make both git and clippy happy * cfg(feature = "jupyter_notebook") another test * Address more review comments * Address more review comments * and clippy and windows * More review comment	2023-03-20 12:06:01 +01:00
Charlie Marsh	0c4926ff7b	Bump version to v0.0.257 (#3591 )	2023-03-17 22:34:10 -04:00
Micha Reiser	87fab4a2e1	Benchmark all rules (#3570 )	2023-03-17 19:29:39 +01:00
Charlie Marsh	12dfd57211	Bump version to v0.0.256 (#3531 )	2023-03-14 22:52:21 -04:00
Charlie Marsh	106a93eab0	Make Clap an optional feature for ruff crate (#3498 )	2023-03-14 11:02:05 -04:00
Micha Reiser	d5700d7c69	Add Micro Benchmark (#3466 )	2023-03-14 08:35:07 +01:00
Charlie Marsh	aa97a092bd	Bump version to v0.0.255 (#3485 )	2023-03-13 14:06:51 -04:00
Jonathan Plasse	b540407b74	Infer target-version from project metadata (#3470 ) * Infer target-version from project metadata * Fix requires-python with ">=3.8.16" * Load requires-python at runtime * Use upstream VersionSpecifiers * Add debug information when parsing ruff.toml * Display debug only if target_version is not set * Bump pep440-rs to add impl Error for Pep440Error	2023-03-13 18:16:01 +01:00
Charlie Marsh	3a5fbd6d74	Upgrade RustPython to fix Serde dependency (#3481 )	2023-03-13 12:29:31 -04:00
Charlie Marsh	c2750a59ab	Implement an iterator for universal newlines (#3454 ) # Summary We need to support CR line endings (as opposed to LF and CRLF line endings, which are already supported). They're rare, but they do appear in Python code, and we tend to panic on any file that uses them. Our `Locator` abstraction now supports CR line endings. However, Rust's `str#lines` implementation does _not_. This PR adds a `UniversalNewlineIterator` implementation that respects all of CR, LF, and CRLF line endings, and plugs it into most of the `.lines()` call sites. As an alternative design, it could be nice if we could leverage `Locator` for this. We've already computed all of the line endings, so we could probably iterate much more efficiently? # Test Plan Largely relying on automated testing, however, also ran over some known failure cases, like #3404.	2023-03-13 00:01:29 -04:00
Charlie Marsh	7a80bcec58	Output GitLab paths relative to `CI_PROJECT_DIR` (#3475 )	2023-03-13 03:03:37 +00:00
Charlie Marsh	024caca233	Introduce a `ruff_diagnostics` crate (#3409 ) ## Summary This PR moves `Diagnostic`, `DiagnosticKind`, and `Fix` into their own crate, which will enable us to further split up Ruff, since sub-linter crates (which need to implement functions that return `Diagnostic`) can now depend on `ruff_diagnostics` rather than Ruff.	2023-03-09 20:48:57 +00:00
Micha Reiser	229f1c34cb	refactor: Extract `ruff_wasm` (#3401 )	2023-03-09 10:07:39 +00:00
Charlie Marsh	ff2c0dd491	Use shared `leading_quote` implementation in ruff_python_formatter (#3396 )	2023-03-08 18:21:59 +00:00
Charlie Marsh	bad6bdda1f	Create a `rust_python_ast` crate (#3370 ) This PR productionizes @MichaReiser's suggestion in https://github.com/charliermarsh/ruff/issues/1820#issuecomment-1440204423, by creating a separate crate for the `ast` module (`rust_python_ast`). This will enable us to further split up the `ruff` crate, as we'll be able to create (e.g.) separate sub-linter crates that have access to these common AST utilities. This was mostly a straightforward copy (with adjustments to module imports), as the few dependencies that _did_ require modifications were handled in #3366, #3367, and #3368.	2023-03-07 15:18:40 +00:00
Charlie Marsh	d1c48016eb	Rename `ruff_python` crate to `ruff_python_stdlib` (#3354 ) In hindsight, `ruff_python` is too general. A good giveaway is that it's actually a prefix of some other crates. The intent of this crate is to reimplement pieces of the Python standard library and CPython itself, so `ruff_python_stdlib` feels appropriate.	2023-03-06 13:43:22 +00:00
konstin	22e6778e17	Add `cargo dev generate-all --check` and catch outdated docs in `cargo test` (#3320 )	2023-03-06 11:28:38 +01:00
Charlie Marsh	376ef929b1	Upgrade RustPython (#3341 )	2023-03-04 14:01:03 -05:00
Jonathan Plasse	8828e12283	Bump dependencies and move more shared dependencies into workspace (#3340 )	2023-03-04 12:36:26 -05:00
Charlie Marsh	bbbc44336e	Bump version to 0.0.254 (#3331 )	2023-03-03 19:11:07 -05:00
Charlie Marsh	dedf8aa5cc	Use presence of convention-specific sections during docstring inference (#3325 )	2023-03-03 17:13:11 -05:00
Micha Reiser	cdbe2ee496	refactor: Introduce `CacheKey` trait (#3323 ) This PR introduces a new `CacheKey` trait for types that can be used as a cache key. I'm not entirely sure if this is worth the "overhead", but I was surprised to find `HashableHashSet` and got scared when I looked at the time complexity of the `hash` function. These implementations must be extremely slow in hashed collections. I then searched for usages and quickly realized that only the cache uses these `Hash` implementations, where performance is less sensitive. This PR introduces a new `CacheKey` trait to communicate the difference between a hash and computing a key for the cache. The new trait can be implemented for types that don't implement `Hash` for performance reasons, and we can define additional constraints on the implementation: For example, we'll want to enforce portability when we add remote caching support. Using a different trait further allows us not to implement it for types without stable identities (e.g. pointers) or use other implementations than the standard hash function.	2023-03-03 18:29:49 +00:00
Charlie Marsh	4de3882088	Upgrade RustPython (#3316 )	2023-03-02 22:59:29 -05:00
Andy Freeland	0ed9fccce9	Upgrade RustPython (#3277 ) Fixes #3207.	2023-02-28 12:21:28 -05:00
Charlie Marsh	061495a9eb	Make BoolOp its own located token (#3265 )	2023-02-28 03:43:28 +00:00
Charlie Marsh	d2a6ed7be6	Upgrade RustPython (#3252 )	2023-02-27 18:21:06 +00:00
Charlie Marsh	386ca7c9a1	Bump version to 0.0.253 (#3245 )	2023-02-26 23:10:04 -05:00
Jeong YunWon	84e96cdcd9	More enum work (#3212 )	2023-02-25 11:40:16 -05:00
Charlie Marsh	eef85067c8	Exclude globsets for --show-settings (#3201 )	2023-02-24 04:23:00 +00:00
Jeong YunWon	da98fab4ae	Adapt is-macro for a few enums (#3182 )	2023-02-24 04:06:56 +00:00
Charlie Marsh	c9fe0708cb	Run `cargo update` (#3179 )	2023-02-23 12:09:36 -05:00
Charlie Marsh	09f8c487ea	Update RustPython to support *tuple annotations (#3178 )	2023-02-23 16:58:38 +00:00
Charlie Marsh	f967f344fc	Add support for basic `Constant::Str` formatting (#3173 ) This PR enables us to apply the proper quotation marks, including support for escapes. There are some significant TODOs, especially around implicit concatenations like: ```py ( "abc" "def" ) ``` Which are represented as a single AST node, which requires us to tokenize _within_ the formatter to identify all the individual string parts.	2023-02-23 16:23:10 +00:00
Charlie Marsh	095f005bf4	Move RustPython vendored and helper code into its own crate (#3171 )	2023-02-23 14:14:16 +00:00
Jeong YunWon	77d43795f8	Replace Autofix::is_enabled to result_like::BoolLike (#3165 )	2023-02-23 07:29:13 -05:00
Charlie Marsh	1c41789c2a	Bump version to 0.0.252 (#3142 )	2023-02-22 14:50:14 -05:00
Charlie Marsh	2f9de335db	Upgrade RustPython to match new flattened exports (#3141 )	2023-02-22 19:36:13 +00:00
Charlie Marsh	f0e0efc46f	Upgrade RustPython to handle trailing commas in map patterns (#3130 )	2023-02-22 11:17:13 -05:00
Charlie Marsh	1efa2e07ad	Avoid match statement misidentification in token rules (#3129 )	2023-02-22 15:44:45 +00:00
Micha Reiser	ed33b75bad	test(ruff_python_formatter): Run all Black tests (#2993 ) This PR changes the testing infrastructure to run all black tests and: * Pass if Ruff and Black generate the same formatting * Fail and write a markdown snapshot that shows the input code, the differences between Black and Ruff, Ruffs output, and Blacks output This is achieved by introducing a new `fixture` macro (open to better name suggestions) that "duplicates" the attributed test for every file that matches the specified glob pattern. Creating a new test for each file over having a test that iterates over all files has the advantage that you can run a single test, and that test failures indicate which case is failing. The `fixture` macro also makes it straightforward to e.g. setup our own spec tests that test very specific formatting by creating a new folder and use insta to assert the formatted output.	2023-02-22 09:25:06 -05:00
Charlie Marsh	e37e9c2ca3	Skip EXE001 and EXE002 rules on Windows (#3111 )	2023-02-21 23:39:56 -05:00
Charlie Marsh	9645790a8b	Support shell expansion for --config argument (#3107 )	2023-02-21 23:33:41 +00:00
Charlie Marsh	fd638a2e54	Bump version to 0.0.251 (#3105 )	2023-02-21 18:13:59 -05:00
Charlie Marsh	06e426f509	Bump version to 0.0.250 (#3095 )	2023-02-21 15:20:46 -05:00
Charlie Marsh	d5c65b5f1b	Add support for structural pattern matching (#3047 )	2023-02-21 18:52:10 +00:00
Charlie Marsh	cdc4e86158	Add support for TryStar (#3089 )	2023-02-21 13:42:20 -05:00
Charlie Marsh	4cfa350112	Bump version to 0.0.249 (#3063 )	2023-02-20 13:11:29 -05:00
Charlie Marsh	2ff3dd5fbe	Bump version to 0.0.248 (#3034 )	2023-02-19 16:21:30 +00:00
Charlie Marsh	370c3a5daf	Remove mdcat dependency (#2959 )	2023-02-16 12:09:37 -05:00
Martin Fischer	294cd95c54	Update clap to fix `ruff check --help` description My two clap bug fixes[1][2] have been merged and released (see the change in README.md). [1]: https://github.com/clap-rs/clap/pull/4710 [2]: https://github.com/clap-rs/clap/pull/4712	2023-02-15 13:30:06 -05:00
Charlie Marsh	39fdc71b49	Bump version to 0.0.247 (#2932 )	2023-02-15 12:06:58 -05:00
Charlie Marsh	ca49b00e55	Add initial formatter implementation (#2883 ) # Summary This PR contains the code for the autoformatter proof-of-concept. ## Crate structure The primary formatting hook is the `fmt` function in `crates/ruff_python_formatter/src/lib.rs`. The current formatter approach is outlined in `crates/ruff_python_formatter/src/lib.rs`, and is structured as follows: - Tokenize the code using the RustPython lexer. - In `crates/ruff_python_formatter/src/trivia.rs`, extract a variety of trivia tokens from the token stream. These include comments, trailing commas, and empty lines. - Generate the AST via the RustPython parser. - In `crates/ruff_python_formatter/src/cst.rs`, convert the AST to a CST structure. As of now, the CST is nearly identical to the AST, except that every node gets a `trivia` vector. But we might want to modify it further. - In `crates/ruff_python_formatter/src/attachment.rs`, attach each trivia token to the corresponding CST node. The logic for this is mostly in `decorate_trivia` and is ported almost directly from Prettier (given each token, find its preceding, following, and enclosing nodes, then attach the token to the appropriate node in a second pass). - In `crates/ruff_python_formatter/src/newlines.rs`, normalize newlines to match Black’s preferences. This involves traversing the CST and inserting or removing `TriviaToken` values as we go. - Call `format!` on the CST, which delegates to type-specific formatter implementations (e.g., `crates/ruff_python_formatter/src/format/stmt.rs` for `Stmt` nodes, and similar for `Expr` nodes; the others are trivial). Those type-specific implementations delegate to kind-specific functions (e.g., `format_func_def`). ## Testing and iteration The formatter is being developed against the Black test suite, which was copied over in-full to `crates/ruff_python_formatter/resources/test/fixtures/black`. The Black fixtures had to be modified to create `[insta](https://github.com/mitsuhiko/insta)`-compatible snapshots, which now exist in the repo. My approach thus far has been to try and improve coverage by tackling fixtures one-by-one. ## What works, and what doesn’t - Most nodes are supported at a basic level (though there are a few stragglers at time of writing, like `StmtKind::Try`). - Newlines are properly preserved in most cases. - Magic trailing commas are properly preserved in some (but not all) cases. - Trivial leading and trailing standalone comments mostly work (although maybe not at the end of a file). - Inline comments, and comments within expressions, often don’t work -- they work in a few cases, but it’s one-off right now. (We’re probably associating them with the “right” nodes more often than we are actually rendering them in the right place.) - We don’t properly normalize string quotes. (At present, we just repeat any constants verbatim.) - We’re mishandling a bunch of wrapping cases (if we treat Black as the reference implementation). Here are a few examples (demonstrating Black's stable behavior): ```py # In some cases, if the end expression is "self-closing" (functions, # lists, dictionaries, sets, subscript accesses, and any length-two # boolean operations that end in these elments), Black # will wrap like this... if some_expression and f( b, c, d, ): pass # ...whereas we do this: if ( some_expression and f( b, c, d, ) ): pass # If function arguments can fit on a single line, then Black will # format them like this, rather than exploding them vertically. if f( a, b, c, d, e, f, g, ... ): pass ``` - We don’t properly preserve parentheses in all cases. Black preserves parentheses in some but not all cases.	2023-02-15 04:06:35 +00:00
Charlie Marsh	f661c90bd7	Remove dependency on `ruff_rowan` (#2875 ) This PR removes the dependency on `ruff_rowan` (i.e., Rome's fork of rust-analyzer's `rowan`), and in turn, trims out a lot of code in `ruff_formatter` that isn't necessary (or isn't _yet_ necessary) to power the autoformatter. We may end up pulling some of this back in -- TBD. For example, the autoformatter has its own comment representation right now, but we may eventually want to use the `comments.rs` data structures defined in `rome_formatter`.	2023-02-15 03:54:08 +00:00
Charlie Marsh	3ef1c2e303	Add `rome_formatter` fork as `ruff_formatter` (#2872 ) The Ruff autoformatter is going to be based on an intermediate representation (IR) formatted via [Wadler's algorithm](https://homepages.inf.ed.ac.uk/wadler/papers/prettier/prettier.pdf). This is architecturally similar to [Rome](https://github.com/rome/tools), Prettier, [Skip](https://github.com/skiplang/skip/blob/master/src/tools/printer/printer.sk), and others. This PR adds a fork of the `rome_formatter` crate from [Rome](https://github.com/rome/tools), renamed here to `ruff_formatter`, which provides generic definitions for a formatter IR as well as a generic IR printer. (We've also pulled in `rome_rowan`, `rome_text_size`, and `rome_text_edit`, though some of these will be removed in future PRs.) Why fork? `rome_formatter` contains code that's specific to Rome's AST representation (e.g., it relies on a fork of rust-analyzer's `rowan`), and we'll likely want to support different abstractions and formatting capabilities (there are already a few changes coming in future PRs). Once we've dropped `ruff_rowan` and trimmed down `ruff_formatter` to the code we currently need, it's also not a huge surface area to maintain and update.	2023-02-14 19:22:55 -05:00
Martin Fischer	a77b4566e4	Fix option links in mkdocs rule pages In `28c9263722` I introduced automatic linkification of option references in rule documentation, which automatically converted the following: ## Options * `namespace-packages` to: ## Options * [`namespace-packages`] [`namespace-packages`]: ../../settings#namespace-packages While the above is a correct CommonMark[1] link definition, what I was missing was that we used mkdocs for our documentation generation, which as it turns out uses a non-CommonMark-compliant Markdown parser, namely Python-Markdown, which contrary to CommonMark doesn't support link definitions containing code tags. This commit fixes the broken links via a regex hack. [1]: https://commonmark.org/	2023-02-14 17:56:21 -05:00
Charlie Marsh	4dd2032687	Unversion unpublished crates (#2882 )	2023-02-14 03:03:49 +00:00
Charlie Marsh	2bf7b35268	Re-enable custom allocators (#2876 )	2023-02-14 02:37:22 +00:00
Charlie Marsh	48a5cd1dd9	Revert "perf: Use custom allocator (#2768 )" (#2841 ) This is causing wheel creation to fail on some of our more exotic build targets: https://github.com/charliermarsh/ruff/actions/runs/4159524132. Let's figure out how to gate appropriately, but for now, reverting to get the release out.	2023-02-12 22:31:34 -05:00
Charlie Marsh	83f6e52c92	Bump version to 0.0.246 (#2834 )	2023-02-12 23:39:51 +00:00
Charlie Marsh	1666e8ba1e	Add a `--show-fixes` flag to include applied fixes in output (#2707 )	2023-02-12 20:48:01 +00:00
Charlie Marsh	9089ef74bc	Upgrade RustPython (#2821 )	2023-02-12 18:45:59 +00:00
Nick Pope	551b810aeb	Add rendering of rule markdown for terminal output (#2747 ) Add rendering of rule markdown for terminal output This is achieved by making use of the `mdcat` crate. See the following links for details: - https://crates.io/crates/mdcat - https://github.com/swsnr/mdcat - https://docs.rs/mdcat/latest/mdcat/	2023-02-12 02:32:45 +00:00
Micha Reiser	863e39fe5f	perf: Use custom allocator (#2768 ) This PR replaces the system allocator with a custom allocator to improve performance: * Windows: mimalloc * Unix: tikv-jemallocator ## Performance: * Linux * `cpython --no-cache`: 208.8ms -> 190.5ms * `cpython`: 32.8ms -> 31ms * Mac: * `cpython --no-cache`: 436.3ms -> 380ms * `cpython`: 40.9ms -> 39.6ms * Windows: * `cpython --no-cache`: 367ms -> 268ms * `cpython`: 92.5ms -> 92.3ms ## Size * Linux: +5MB from 13MB -> 18MB (I need to double check this) * Mac: +0.7MB from 8.3MB-> 9MB * Windows: -0.16MB from 8.29MB -> 8.13MB (that's unexpected)	2023-02-11 13:26:07 -05:00
Charlie Marsh	24faabf1f4	Bump version to 0.0.245	2023-02-10 22:15:27 -05:00
Martin Fischer	bd09a1819f	Drop unused once_cell dependency from ruff_macros	2023-02-10 09:25:29 -05:00
Charlie Marsh	0f622f0126	Upgrade RustPython to pull in newline-handling optimizations (#2688 )	2023-02-09 11:12:43 -05:00
Charlie Marsh	125615af12	Bump version to 0.0.244	2023-02-08 17:28:59 -05:00
Charlie Marsh	f1cdd108e6	Derive `explanation` method on Rule struct via rustdoc (#2642 ) ```console ❯ cargo run rule B017 Finished dev [unoptimized + debuginfo] target(s) in 0.13s Running `target/debug/ruff rule B017` no-assert-raises-exception Code: B017 (flake8-bugbear) ### What it does Checks for `self.assertRaises(Exception)`. ## Why is this bad? `assertRaises(Exception)` can lead to your test passing even if the code being tested is never executed due to a typo. Either assert for a more specific exception (builtin or custom), use `assertRaisesRegex` or the context manager form of `assertRaises`. ```	2023-02-07 17:23:29 -05:00
Charlie Marsh	8fd29b3b60	Remove dependency on `"unparse"` feature (#2641 )	2023-02-07 17:23:09 -05:00
Charlie Marsh	bf718fdf26	Bump Ruff version to 0.0.243	2023-02-06 21:22:54 -05:00
Charlie Marsh	e59b75d31b	Bump version to 0.0.242	2023-02-06 16:25:29 -05:00
Charlie Marsh	87d0aa5561	Move `python` into its own `ruff_python` crate (#2593 )	2023-02-05 17:53:58 -05:00
Micha Reiser	cd8be8c0be	refactor: Introduce crates folder (#2088 ) This PR introduces a new `crates` directory and moves all "product" crates into that folder. Part of #2059.	2023-02-05 16:47:48 -05:00
Charlie Marsh	e3dfa2e04e	Implement pycodestyle's logical line detection (#1130 ) Along with the logical line detection, this adds 14 of the missing `pycodestyle` rules. For now, this is all gated behind a `logical_lines` feature that's off-by-default, which will let us implement all rules prior to shipping, since we want to couple the release of these rules with new defaults and instructions.	2023-02-05 15:06:02 -05:00
Charlie Marsh	7d4f0a8320	Bump Ruff version to 0.0.241	2023-02-03 19:25:12 -05:00
Charlie Marsh	da0374f360	Update RustPython to pull in lexer optimizations (#2551 )	2023-02-03 14:31:53 -05:00
Aarni Koskela	38addbe50d	Soft-deprecate update_check (#2530 )	2023-02-03 11:33:38 -05:00
Charlie Marsh	651f6b6bce	Bump Ruff version to 0.0.240	2023-02-02 12:45:23 -05:00
Charlie Marsh	c15595325c	Bump version to 0.0.239	2023-01-31 19:06:22 -05:00
Charlie Marsh	1cbd929a0a	Bump version to 0.0.238	2023-01-30 16:44:19 -05:00
Charlie Marsh	add7fefeb5	Bump version to 0.0.237	2023-01-28 10:52:14 -05:00
Charlie Marsh	685d9ab848	Bump version to 0.0.236	2023-01-26 18:47:00 -05:00
Charlie Marsh	b346f74915	Run cargo update (#2185 )	2023-01-25 21:32:44 -05:00
Charlie Marsh	edd0e16a02	Bump version to 0.0.235	2023-01-25 18:28:27 -05:00
Simon Brugman	413acdf83c	feat: introduce macro for testing snap files across platforms	2023-01-25 18:00:39 -05:00
Charlie Marsh	9b07d0bd92	Bump version to 0.0.234	2023-01-25 16:55:57 -05:00
Charlie Marsh	605416922d	Bump version to 0.0.233	2023-01-24 10:46:49 -05:00

... 3 4 5 6 7 ...

795 Commits