Python/ruff - ruff - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
dependabot[bot]	8d1d5b8d80	Bump pep440_rs from 0.3.11 to 0.3.12 (#7755 )	2023-10-02 09:49:50 -04:00
dependabot[bot]	9ba5bc26f6	Bump insta from 1.32.0 to 1.33.0 (#7754 )	2023-10-02 09:49:43 -04:00
Charlie Marsh	8c8988ea40	Improve performance of `commented-out-code` (~50-80%) (#7706 ) ## Summary This PR implements a variety of optimizations to improve performance of the Eradicate rule, which always shows up in all-rules benchmarks and bothers me. (These improvements are not hugely important, but it was kind of a fun Friday thing to spent a bit of time on.) The improvements include: - Doing cheaper work first (checking for some explicit substrings upfront). - Using `aho-corasick` to speed an exact substring search. - Merging multiple regular expressions using a `RegexSet`. - Removing some unnecessary `\s*` and other pieces from the regular expressions (since we already trim strings before matching on them). ## Test Plan I benchmarked this function in a standalone crate using a variety of cases. Criterion reports that this version is up to 80% faster, and almost every case is at least 50% faster: ``` Eradicate/Detection/# Warn if we are installing over top of an existing installation. This can time: [101.84 ns 102.32 ns 102.82 ns] change: [-77.166% -77.062% -76.943%] (p = 0.00 < 0.05) Performance has improved. Found 3 outliers among 100 measurements (3.00%) 3 (3.00%) high mild Eradicate/Detection/#from foo import eradicate time: [74.872 ns 75.096 ns 75.314 ns] change: [-84.180% -84.131% -84.079%] (p = 0.00 < 0.05) Performance has improved. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high mild Eradicate/Detection/# encoding: utf8 time: [46.522 ns 46.862 ns 47.237 ns] change: [-29.408% -28.918% -28.471%] (p = 0.00 < 0.05) Performance has improved. Found 7 outliers among 100 measurements (7.00%) 6 (6.00%) high mild 1 (1.00%) high severe Eradicate/Detection/# Issue #999 time: [16.942 ns 16.994 ns 17.058 ns] change: [-57.243% -57.064% -56.815%] (p = 0.00 < 0.05) Performance has improved. Found 3 outliers among 100 measurements (3.00%) 2 (2.00%) high mild 1 (1.00%) high severe Eradicate/Detection/# type: ignore time: [43.074 ns 43.163 ns 43.262 ns] change: [-17.614% -17.390% -17.152%] (p = 0.00 < 0.05) Performance has improved. Found 5 outliers among 100 measurements (5.00%) 3 (3.00%) high mild 2 (2.00%) high severe Eradicate/Detection/# user_content_type, _ = TimelineEvent.objects.using(db_alias).get_or_create( time: [209.40 ns 209.81 ns 210.23 ns] change: [-32.806% -32.630% -32.470%] (p = 0.00 < 0.05) Performance has improved. Eradicate/Detection/# this is = to that :( time: [72.659 ns 73.068 ns 73.473 ns] change: [-68.884% -68.775% -68.655%] (p = 0.00 < 0.05) Performance has improved. Found 9 outliers among 100 measurements (9.00%) 7 (7.00%) high mild 2 (2.00%) high severe Eradicate/Detection/#except Exception: time: [92.063 ns 92.366 ns 92.691 ns] change: [-64.204% -64.052% -63.909%] (p = 0.00 < 0.05) Performance has improved. Found 4 outliers among 100 measurements (4.00%) 2 (2.00%) high mild 2 (2.00%) high severe Eradicate/Detection/#print(1) time: [68.359 ns 68.537 ns 68.725 ns] change: [-72.424% -72.356% -72.278%] (p = 0.00 < 0.05) Performance has improved. Found 2 outliers among 100 measurements (2.00%) 1 (1.00%) low mild 1 (1.00%) high mild Eradicate/Detection/#'key': 1 + 1, time: [79.604 ns 79.865 ns 80.135 ns] change: [-69.787% -69.667% -69.549%] (p = 0.00 < 0.05) Performance has improved. ```	2023-09-29 20:13:12 +00:00
Dhruv Manilawala	e62e245c61	Add support for PEP 701 (#7376 ) ## Summary This PR adds support for PEP 701 in Ruff. This is a rollup PR of all the other individual PRs. The separate PRs were created for logic separation and code reviews. Refer to each pull request for a detail description on the change. Refer to the PR description for the list of pull requests within this PR. ## Test Plan ### Formatter ecosystem checks Explanation for the change in ecosystem check: https://github.com/astral-sh/ruff/pull/7597#issue-1908878183 #### `main` ``` \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76083 \| 1789 \| 1631 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 319 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| ``` #### `dhruv/pep-701` ``` \| project \| similarity index \| total files \| changed files \| \|--------------\|------------------:\|------------------:\|------------------:\| \| cpython \| 0.76051 \| 1789 \| 1632 \| \| django \| 0.99983 \| 2760 \| 36 \| \| transformers \| 0.99963 \| 2587 \| 319 \| \| twine \| 1.00000 \| 33 \| 0 \| \| typeshed \| 0.99983 \| 3496 \| 18 \| \| warehouse \| 0.99967 \| 648 \| 15 \| \| zulip \| 0.99972 \| 1437 \| 21 \| ```	2023-09-29 02:55:39 +00:00
dependabot[bot]	2cb5e43dd7	Bump clap from 4.4.4 to 4.4.5 (#7666 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-26 13:02:16 +02:00
Charlie Marsh	93b5d8a0fb	Implement our own small-integer optimization (#7584 ) ## Summary This is a follow-up to #7469 that attempts to achieve similar gains, but without introducing malachite. Instead, this PR removes the `BigInt` type altogether, instead opting for a simple enum that allows us to store small integers directly and only allocate for values greater than `i64`: ```rust /// A Python integer literal. Represents both small (fits in an `i64`) and large integers. #[derive(Clone, PartialEq, Eq, Hash)] pub struct Int(Number); #[derive(Debug, Clone, PartialEq, Eq, Hash)] pub enum Number { /// A "small" number that can be represented as an `i64`. Small(i64), /// A "large" number that cannot be represented as an `i64`. Big(Box<str>), } impl std::fmt::Display for Number { fn fmt(&self, f: &mut std::fmt::Formatter<'_>) -> std::fmt::Result { match self { Number::Small(value) => write!(f, "{value}"), Number::Big(value) => write!(f, "{value}"), } } } ``` We typically don't care about numbers greater than `isize` -- our only uses are comparisons against small constants (like `1`, `2`, `3`, etc.), so there's no real loss of information, except in one or two rules where we're now a little more conservative (with the worst-case being that we don't flag, e.g., an `itertools.pairwise` that uses an extremely large value for the slice start constant). For simplicity, a few diagnostics now show a dedicated message when they see integers that are out of the supported range (e.g., `outdated-version-block`). An additional benefit here is that we get to remove a few dependencies, especially `num-bigint`. ## Test Plan `cargo test`	2023-09-25 15:13:21 +00:00
dependabot[bot]	10e35e38d7	Bump semver from 1.0.18 to 1.0.19 (#7641 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-25 09:09:17 +00:00
dependabot[bot]	f169cb5d92	Bump wild from 2.1.0 to 2.2.0 (#7640 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-25 09:06:10 +00:00
Charlie Marsh	8bfe9bda41	Bump version to v0.0.291 (#7606 )	2023-09-22 13:25:37 -04:00
Micha Reiser	9d16e46129	Add most formatter options to `ruff.toml` / `pyproject.toml` (#7566 )	2023-09-22 15:47:57 +00:00
dependabot[bot]	82978ac9b5	Bump indicatif from 0.17.6 to 0.17.7 (#7592 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-22 08:54:48 +00:00
dependabot[bot]	ab643017f9	Bump smallvec from 1.11.0 to 1.11.1 (#7563 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-21 11:53:27 +02:00
dependabot[bot]	6eb9a9a633	Bump insta from 1.31.0 to 1.32.0 (#7565 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-21 11:53:14 +02:00
dependabot[bot]	288c07d911	Bump rayon from 1.7.0 to 1.8.0 (#7564 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-21 11:52:41 +02:00
Micha Reiser	f8f1cd5016	Introduce `FormatterSettings` (#7545 )	2023-09-21 08:01:24 +02:00
Micha Reiser	6540321966	Move `Settings` and `ResolverSettings` to `ruff_workspace` ## Summary ## Stack Summary This stack splits `Settings` into `FormatterSettings` and `LinterSettings` and moves it into `ruff_workspace`. This change is necessary to add the `FormatterSettings` to `Settings` without adding `ruff_python_formatter` as a dependency to `ruff_linter` (and the linter should not contain the formatter settings). A quick overview of our settings struct at play: * `Options`: 1:1 representation of the options in the `pyproject.toml` or `ruff.toml`. Used for deserialization. * `Configuration`: Resolved `Options`, potentially merged from multiple configurations (when using `extend`). The representation is very close if not identical to the `Options`. * `Settings`: The resolved configuration that uses a data format optimized for reading. Optional fields are initialized with their default values. Initialized by `Configuration::into_settings` . The goal of this stack is to split `Settings` into tool-specific resolved `Settings` that are independent of each other. This comes at the advantage that the individual crates don't need to know anything about the other tools. The downside is that information gets duplicated between `Settings`. Right now the duplication is minimal (`line-length`, `tab-width`) but we may need to come up with a solution if more expensive data needs sharing. This stack focuses on `Settings`. Splitting `Configuration` into some smaller structs is something I'll follow up on later. ## PR Summary This PR moves the `ResolverSettings` and `Settings` struct to `ruff_workspace`. `LinterSettings` remains in `ruff_linter` because it gets passed to lint rules, the `Checker` etc. ## Test Plan `cargo test`	2023-09-20 17:24:28 +02:00
Micha Reiser	ca3c15858d	Use portable hash function for `CacheKey` (#7533 )	2023-09-20 15:19:59 +02:00
dependabot[bot]	c43542896f	Bump unicode-width from 0.1.10 to 0.1.11 (#7534 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-20 10:26:54 +02:00
Charlie Marsh	5849a75223	Rename `ruff` crate to `ruff_linter` (#7529 )	2023-09-20 08:38:27 +02:00
konsti	ef34c5cbec	Update itertools to 0.11 (#7513 ) Preparation for #7469. Changelog: https://github.com/rust-itertools/itertools/blob/master/CHANGELOG.md#0110	2023-09-19 09:53:14 +00:00
dependabot[bot]	fdbefd777c	Bump syn from 2.0.33 to 2.0.37 (#7512 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-19 11:08:56 +02:00
dependabot[bot]	078547adbb	Bump clap from 4.4.3 to 4.4.4 (#7511 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-19 11:08:39 +02:00
dependabot[bot]	42a0bec146	Bump schemars from 0.8.13 to 0.8.15 (#7510 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-19 11:08:22 +02:00
dependabot[bot]	a902d14c31	Bump chrono from 0.4.30 to 0.4.31 (#7481 )	2023-09-18 13:11:51 -04:00
dependabot[bot]	70ea49bf72	Bump test-case from 3.1.0 to 3.2.1 (#7484 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-18 11:31:41 +02:00
dependabot[bot]	8e255974bc	Bump unicode-ident from 1.0.11 to 1.0.12 (#7482 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-18 11:31:06 +02:00
dependabot[bot]	d358604464	Bump serde_json from 1.0.106 to 1.0.107 (#7480 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-18 11:30:06 +02:00
dependabot[bot]	8243db74fe	Bump indoc from 2.0.3 to 2.0.4 (#7483 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-18 11:29:50 +02:00
konsti	2cbe1733c8	Use CommentRanges in backwards lexing (#7360 ) ## Summary The tokenizer was split into a forward and a backwards tokenizer. The backwards tokenizer uses the same names as the forwards ones (e.g. `next_token`). The backwards tokenizer gets the comment ranges that we already built to skip comments. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-16 03:21:45 +00:00
Zanie Blue	0c030b5bf3	Bump version to 0.0.290 (#7413 ) See also: - https://github.com/astral-sh/astral-sh/pull/41 - https://github.com/astral-sh/ruff-pre-commit/pull/51	2023-09-15 13:51:46 -05:00
dependabot[bot]	f936d319cc	Bump toml from 0.7.6 to 0.7.8 (#7405 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-15 09:14:58 +00:00
dependabot[bot]	85d8b6228f	Bump chrono from 0.4.28 to 0.4.30 (#7406 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-15 09:00:17 +00:00
dependabot[bot]	7594dadc1d	Bump syn from 2.0.29 to 2.0.33 (#7404 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-15 08:49:52 +00:00
dependabot[bot]	de37fbfac9	Bump serde-wasm-bindgen from 0.5.0 to 0.6.0 (#7403 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-15 08:47:42 +00:00
dependabot[bot]	4e2769a16c	Bump mimalloc from 0.1.38 to 0.1.39 (#7402 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-15 08:46:55 +00:00
dependabot[bot]	2ddea7c657	Bump regex from 1.9.4 to 1.9.5 (#7384 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-14 14:58:24 +00:00
dependabot[bot]	45eabdd2c3	Bump shlex from 1.1.0 to 1.2.0 (#7381 ) Bumps [shlex](https://github.com/comex/rust-shlex) from 1.1.0 to 1.2.0. <details> <summary>Changelog</summary> <p><em>Sourced from <a href="https://github.com/comex/rust-shlex/blob/master/CHANGELOG.md">shlex's changelog</a>.</em></p> <blockquote> <h1>1.2.0</h1> <ul> <li>Adds <code>bytes</code> module to support operating directly on byte strings.</li> </ul> </blockquote> </details> <details> <summary>Commits</summary> <ul> <li>See full diff in <a href="https://github.com/comex/rust-shlex/commits">compare view</a></li> </ul> </details> <br /> [![Dependabot compatibility score](https://dependabot-badges.githubapp.com/badges/compatibility_score?dependency-name=shlex&package-manager=cargo&previous-version=1.1.0&new-version=1.2.0)](https://docs.github.com/en/github/managing-security-vulnerabilities/about-dependabot-security-updates#about-compatibility-scores) Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> <br /> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-14 09:40:05 -05:00
dependabot[bot]	1f8e2b8f14	Bump proc-macro2 from 1.0.66 to 1.0.67 (#7383 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-14 09:49:57 +00:00
Micha Reiser	58b3040342	chore: Upgrade notify (#7338 )	2023-09-14 09:47:55 +00:00
dependabot[bot]	6a9b8aede1	Bump thiserror from 1.0.47 to 1.0.48 (#7385 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-14 09:21:47 +00:00
dependabot[bot]	f48126ad00	Bump clap from 4.4.1 to 4.4.3 (#7382 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-14 09:13:12 +00:00
Micha Reiser	4f26002dd5	chore: Upgrade strum (#7337 ) ## Summary This PR upgrades `strum` from 0.24.x to 0.25.x. The breaking changes are: * strum macros now uses syn2 * The `to_string` behavior changed when using `default`. I did a quick search, we aren't using `strum(default)` `strum` now has a `#[derive(EnumIs)]` macro that generates `is_` methods. ## Test Plan cargo test	2023-09-13 18:33:27 +02:00
dependabot[bot]	d1a9c198e3	Bump path-absolutize from 3.1.0 to 3.1.1 (#7345 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-13 16:21:51 +00:00
dependabot[bot]	7a4f699fba	Bump argfile from 0.1.5 to 0.1.6 (#7344 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-13 16:18:21 +00:00
dependabot[bot]	3fb5418c2c	Bump memchr from 2.6.2 to 2.6.3 (#7343 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-13 18:14:42 +02:00
dependabot[bot]	9fcc009a0c	Bump serde_json from 1.0.105 to 1.0.106 (#7342 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-09-13 18:13:24 +02:00
Micha Reiser	bf8e5a167b	chore: Upgrade walkdir (#7336 ) ## Summary The only commit is [api: add follow_root_links() option to WalkDir](`dcc527d832`) whicih addsa new option wheter `walkdir` should follow a root symlink or not. The new option defaults to `true` which is the same as before. ## Test Plan `cargo test`	2023-09-13 18:07:24 +02:00
Micha Reiser	8a001dfc3d	chore: Upgrade pyproject-toml crate (#7335 ) ## Summary This PR bumps the pyproject-toml crate to 0.7.0. The only difference is that it now depends on indexmap 2. I reviewed the indexmap 2 changes and they don't seem relevant to us. I used this opportunity to remove the default features from `serde_with` which removes our indexmap 1 dependency (and some other unused dependencies) ## Test Plan `cargo test`	2023-09-13 17:55:03 +02:00
Micha Reiser	7531bb3b21	Upgrade is-macros to 0.3.0 (#7334 )	2023-09-13 15:27:40 +02:00
konsti	56440ad835	Introduce `ArgOrKeyword` to keep call parameter order (#7302 ) ## Motivation The `ast::Arguments` for call argument are split into positional arguments (args) and keywords arguments (keywords). We currently assume that call consists of first args and then keywords, which is generally the case, but not always: ```python f(args, a=2, args2, *kwargs) class A(args, a=2, args2, *kwargs): pass ``` The consequence is accidentally reordering arguments (https://github.com/astral-sh/ruff/pull/7268). ## Summary `Arguments::args_and_keywords` returns an iterator of an `ArgOrKeyword` enum that yields args and keywords in the correct order. I've fixed the obvious `args` and `keywords` usages, but there might be some cases with wrong assumptions remaining. ## Test Plan The generator got new test cases, otherwise the stacked PR (https://github.com/astral-sh/ruff/pull/7268) which uncovered this.	2023-09-13 08:45:46 +00:00
Charlie Marsh	e7a2779402	Bump version to v0.0.289 (#7308 )	2023-09-12 12:00:11 -04:00
Dhruv Manilawala	ee0f1270cf	Add `NotebookIndex` to the cache (#6863 ) ## Summary This PR updates the `FileCache` to include an optional `NotebookIndex` to support caching for Jupyter Notebooks. We only require the index to compute the diagnostics and thus we don't really need to store the entire `Notebook` on the `Diagnostics` struct. This means we only need the index to be stored in the cache to reconstruct the `Diagnostics`. ## Test Plan Update an existing test case to run over the fixtures under `ruff_notebook` crate where there are multiple Jupyter Notebook. Locally, the following commands were run in order: 1. Remove the cache: `rm -rf .ruff_cache` 2. Run without cache: `cargo run --bin ruff -- check --isolated crates/ruff_notebook/resources/test/fixtures/jupyter/unused_variable.ipynb --no-cache` 3. Run with cache: `cargo run --bin ruff -- check --isolated crates/ruff_notebook/resources/test/fixtures/jupyter/unused_variable.ipynb` 4. Check whether the `.ruff_cache` directory was created or not 5. Run with cache again and verify: `cargo run --bin ruff -- check --isolated crates/ruff_notebook/resources/test/fixtures/jupyter/unused_variable.ipynb` ## Benchmarks https://github.com/astral-sh/ruff/pull/6863#issuecomment-1715675186 fixes: #6671	2023-09-12 18:29:03 +05:30
Dhruv Manilawala	f5701fcc63	Use snapshots for remaining lexer tests (#7264 ) ## Summary This PR updates the remaining lexer test cases to use the snapshots. This is mainly a mechanical refactor. ## Motivation The main motivation is so that when we add the token range values to the test case output, it's easier to update the test cases. The reason they were not using the snapshots before was because of the usage of `test_case` macro. The macros is mainly used for different EOL test cases. If we just generate the snapshots directly, then the snapshot name would be suffixed with `-1`, `-2`, etc. as the test function is still the same. So, we'll create the snapshot ourselves with the platform name for the respective EOL test cases. ## Test Plan `cargo test`	2023-09-12 00:16:38 +05:30
Micha Reiser	7c9bbcf4e2	Bump version to 0.0.288 (#7271 ) Co-authored-by: Zanie Blue <contact@zanie.dev>	2023-09-11 18:18:11 +02:00
Micha Reiser	47a253fb62	Add PreviewMode option to formatter ## Summary This PR adds the `--preview` and `--no-preview` options to the `format` command (hidden) and passes it through to the formatte. ## Test Plan I added the `dbg(f.options().preview())` statement in `FormatNodeRule::fmt` and verified that the option gets correctly passed to the formatter.	2023-09-08 12:04:28 +02:00
Micha Reiser	d9544a2d37	Disable default criterion features (#7241 )	2023-09-08 10:03:14 +00:00
Micha Reiser	e376c3ff7e	Split implicit concatenated strings before binary expressions (#7145 )	2023-09-08 06:51:26 +00:00
Micha Reiser	f1a4eb9c28	Use the unicode-ident crate (#7212 )	2023-09-07 08:19:25 +00:00
Victor Hugo Gomes	041cdb95e0	Update identifier Unicode character validation to match Python spec (#7209 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-07 07:08:42 +00:00
Manuel Martinez	2e58ad437e	build: add libcst from crates (#7179 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-09-06 08:24:28 +00:00
konsti	e02d76f070	Use insta_cmd (#6737 )	2023-09-05 12:21:27 +00:00
Micha Reiser	93ca8ebbc0	Formatter: Detect line endings (#7054 )	2023-09-04 08:09:31 +02:00
Dhruv Manilawala	4eaa412370	Update LibCST (#7062 ) ## Summary This PR updates the revision of `LibCST` dependency to `9c263aa897` inorder to fix https://github.com/astral-sh/ruff/issues/4899 ## Test Plan The test case including the carriage return (`\r`) character was added for `F504` and then `cargo test`. fixes: #4899	2023-09-03 09:11:24 +05:30
Charlie Marsh	2f3a950f6f	Bump version to 0.0.287 (#7038 )	2023-09-01 17:32:26 +01:00
Charlie Marsh	afcd00da56	Create `ruff_notebook` crate (#7039 ) ## Summary This PR moves `ruff/jupyter` into its own `ruff_notebook` crate. Beyond the move itself, there were a few challenges: 1. `ruff_notebook` relies on the source map abstraction. I've moved the source map into `ruff_diagnostics`, since it doesn't have any dependencies on its own and is used alongside diagnostics. 2. `ruff_notebook` has a couple tests for end-to-end linting and autofixing. I had to leave these tests in `ruff` itself. 3. We had code in `ruff/jupyter` that relied on Python lexing, in order to provide a more targeted error message in the event that a user saves a `.py` file with a `.ipynb` extension. I removed this in order to avoid a dependency on the parser, it felt like it wasn't worth retaining just for that dependency. ## Test Plan `cargo test`	2023-09-01 13:56:44 +00:00
Charlie Marsh	7c1aa98f43	Run cargo update (#6964 )	2023-08-31 13:11:11 -05:00
Micha Reiser	edfd888bd6	Add unicode benchmark (#7002 )	2023-08-30 09:57:57 +02:00
Charlie Marsh	fad23bbe60	Add a --check flag to the formatter CLI (#6982 ) ## Summary Returns an exit code of 1 if any files would be reformatted: ``` ruff on  charlie/format-check:main [$?⇡] is 📦 v0.0.286 via 🐍 v3.11.2 via 🦀 v1.72.0 ❯ cargo run -p ruff_cli -- format foo.py --check Compiling ruff_cli v0.0.286 (/Users/crmarsh/workspace/ruff/crates/ruff_cli) Finished dev [unoptimized + debuginfo] target(s) in 1.69s Running `target/debug/ruff format foo.py --check` warning: `ruff format` is a work-in-progress, subject to change at any time, and intended only for experimentation. 1 file would be reformatted ruff on  charlie/format-check:main [$?⇡] is 📦 v0.0.286 via 🐍 v3.11.2 via 🦀 v1.72.0 took 2s ❯ echo $? 1 ``` Closes #6966.	2023-08-29 12:40:00 -04:00
Chris Pryer	fa25dabf17	Add comments option to playground (#6911 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-08-28 07:26:23 +00:00
Micha Reiser	a6aa16630d	Move `Configuration` to `ruff_workspace` crate (#6920 )	2023-08-28 06:21:35 +00:00
konsti	c2413dcd2c	Add prototype of `ruff format` for projects (#6871 ) Summary Add recursive formatting based on `ruff check` file discovery for `ruff format`, as a prototype for the formatter alpha. This allows e.g. `format ../projects/django/`. It's still lacking support for any settings except line length. Note just like the existing `ruff format` this will become part of the production build, i.e. you'll be able to use it - hidden by default and with a prominent warning - with `ruff format .` after the next release. Error handling works in my manual tests (the colors do also work): ``` $ target/debug/ruff format scripts/ warning: `ruff format` is a work-in-progress, subject to change at any time, and intended for internal use only. ``` (the above changes `add_rule.py` where we have the wrong bin op breaking) ``` $ target/debug/ruff format ../projects/django/ warning: `ruff format` is a work-in-progress, subject to change at any time, and intended for internal use only. Failed to format /home/konsti/projects/django/tests/test_runner_apps/tagged/tests_syntax_error.py: source contains syntax errors: ParseError { error: UnrecognizedToken(Name { name: "syntax_error" }, None), offset: 131, source_path: "<filename>" } ``` ``` $ target/debug/ruff format a warning: `ruff format` is a work-in-progress, subject to change at any time, and intended for internal use only. Failed to read /home/konsti/ruff/a/d.py: Permission denied (os error 13) ``` Test Plan Missing! I'm not sure if it's worth building tests at this stage or how they should look like.	2023-08-27 19:12:18 +00:00
Charlie Marsh	059757a8c8	Implement `Ranged` on more structs (#6921 ) Now that it's in `ruff_text_size`, we can use it in a few places that we couldn't before.	2023-08-27 19:03:08 +00:00
Micha Reiser	ed1b4122d0	Use Codspeed for continous benchmarking (#6896 )	2023-08-26 16:34:35 +02:00
Charlie Marsh	91880b8273	Bump version to 0.0.286 (#6876 )	2023-08-25 14:59:26 -04:00
konsti	0b6dab5e3f	Add jupyter notebook cell ids in 4.5+ if missing (#6853 ) Summary See https://github.com/astral-sh/ruff/issues/6834#issuecomment-1691202417 Test Plan Added a new notebook	2023-08-25 08:34:42 +00:00
Micha Reiser	04a9a8dd03	Maybe parenthesize long constants and names (#6816 )	2023-08-24 09:47:57 +00:00
Dhruv Manilawala	fb7caf43c8	Update lexer tests to use snapshots (#6658 ) ## Summary This PR updates the lexer tests to use the snapshot testing framework. It also makes the following changes: * Remove the use of macros in the lexer tests * Use `test_case` for EOL tests ## Test Plan ``` cargo test --package ruff_python_parser --lib --all-features -- lexer::tests --no-capture ```	2023-08-22 18:23:19 +00:00
Zanie Blue	5892c691ea	Bump version to 0.0.285 (#6660 ) Requires - https://github.com/astral-sh/ruff/pull/6655 - https://github.com/astral-sh/ruff/pull/6657	2023-08-17 15:46:28 -05:00
Zanie Blue	6253d8e2c8	Remove unused runtime string formatting logic (#6624 ) In https://github.com/astral-sh/ruff/pull/6616 we are adding support for nested replacements in format specifiers which makes actually formatting strings infeasible without a great deal of complexity. Since we're not using these functions (they just exist for runtime use in RustPython), we can just remove them.	2023-08-16 17:38:33 +00:00
Micha Reiser	232b44a8ca	Indent statements in suppressed ranges (#6507 )	2023-08-15 08:00:35 +02:00
Micha Reiser	51ae47ad56	Remove lex and parsing from formatter benchmark (#6547 )	2023-08-14 10:25:37 +02:00
Charlie Marsh	768686148f	Add support for unions to our Python builtins type system (#6541 ) ## Summary Fixes some TODOs introduced in https://github.com/astral-sh/ruff/pull/6538. In short, given an expression like `1 if x > 0 else "Hello, world!"`, we now return a union type that says the expression can resolve to either an `int` or a `str`. The system remains very limited, it only works for obvious primitive types, and there's no attempt to do inference on any more complex variables. (If any expression yields `Unknown` or `TypeError`, we propagate that result throughout and abort on the client's end.)	2023-08-13 18:00:50 -04:00
konsti	0c9ded9d84	Use a faster diffing library for the formatter ecosystem checks (#6497 ) Summary Some files seems notoriously slow in the formatter (secons in debug mode). This time was however almost exclusively spent in the diff algorithm to collect the similarity index, so i replaced that. I kept `similar` for printing actual diff to avoid rewriting that too, with the disadvantage that we now have to diff libraries in format_dev. I used this PR to remove the spinner from tracing-indicatif and changed `flamegraph --perfdata perf.data` to `flamegraph --perfdata perf.data --no-inline` as the former wouldn't finish for me on release builds with debug info.	2023-08-11 15:51:54 +02:00
Zanie Blue	3ecd263b4d	Bump version to 0.0.284 (#6453 ) ## What's Changed This release fixes a few bugs, notably the previous release announced a breaking change where the default target Python version changed from 3.10 to 3.8 but it was not applied. Thanks to @rco-ableton for fixing this in https://github.com/astral-sh/ruff/pull/6444 ### Bug Fixes * Do not trigger `S108` if path is inside `tempfile.` call by @dhruvmanila in https://github.com/astral-sh/ruff/pull/6416 Do not allow on zero tab width by @tjkuson in https://github.com/astral-sh/ruff/pull/6429 * Fix false-positive in submodule resolution by @charliermarsh in https://github.com/astral-sh/ruff/pull/6435 ## New Contributors * @rco-ableton made their first contribution in https://github.com/astral-sh/ruff/pull/6444 Full Changelog: https://github.com/astral-sh/ruff/compare/v0.0.283...v0.0.284	2023-08-09 13:32:33 -05:00
Zanie Blue	fe9590f39f	Bump version number to 0.0.283 (#6407 )	2023-08-08 12:31:30 -05:00
Charlie Marsh	743118ae9a	Bump version to 0.0.282 (#6241 )	2023-08-01 13:21:33 +00:00
Micha Reiser	ecfdd8d58b	Add static assertions to nodes (#6228 )	2023-08-01 11:54:49 +02:00
konsti	0fddb31235	Use tracing for format_dev (#6177 ) ## Summary [tracing](https://github.com/tokio-rs/tracing) is library for logging, tracing and related features that has a large ecosystem. Using [tracing-subscriber](https://docs.rs/tracing-subscriber) and [tracing-indicatif](https://github.com/emersonford/tracing-indicatif), we get a nice logging output that you can configure with `RUST_LOG` (e.g. `RUST_LOG=debug`) and a live look into the formatter progress. Default: ![Screenshot from 2023-07-30 13-59-53](https://github.com/astral-sh/ruff/assets/6826232/6432f835-9ff1-4771-955b-398e54c406dc) `RUST_LOG=debug`: ![Screenshot from 2023-07-30 14-01-32](https://github.com/astral-sh/ruff/assets/6826232/5f2c87da-0867-4159-82e7-b5757eebb8eb) It's easy to see in this output which files take a disproportionate amount of time. [Peek 2023-07-30 14-35.webm](https://github.com/astral-sh/ruff/assets/6826232/2c92db5c-1354-465b-a6bc-ddfb281d6f9d) It opens up further integration with the tracing ecosystem, [tracing-timing](https://docs.rs/tracing-timing/latest/tracing_timing/) and [tokio-console](https://github.com/tokio-rs/console) can e.g. show histograms and the json output allows us building better pipelines than grepping a log file. One caveat is using `parent: None` for the logging statements because tracing subscriber does not allow deactivating the span without reimplementing all the other log message formatting, too, and we don't need span information, esp. since it would currently show the progress bar span. ## Test Plan n/a	2023-07-31 19:14:01 +00:00
Charlie Marsh	dbd60b2cf5	Bump version to 0.0.281 (#6195 )	2023-07-31 13:21:43 -04:00
Micha Reiser	40f54375cb	Pull in RustPython parser (#6099 )	2023-07-27 09:29:11 +00:00
Micha Reiser	2cf00fee96	Remove parser dependency from ruff-python-ast (#6096 )	2023-07-26 17:47:22 +02:00
Micha Reiser	16e1737d1b	Use cursor based lexer (#6012 )	2023-07-26 11:32:26 +02:00
Dhruv Manilawala	025fa4eba8	Integrate the new Jupyter AST nodes in Ruff (#6086 ) ## Summary This PR adds the implementation for the new Jupyter AST nodes i.e., `ExprLineMagic` and `StmtLineMagic`. ## Test Plan Add test cases for `unparse` containing magic commands resolves: #6087	2023-07-26 08:20:30 +00:00
Zanie Blue	3000a47fe8	Include file permissions in key for cached files (#5901 ) Reimplements https://github.com/astral-sh/ruff/pull/3104 Closes https://github.com/astral-sh/ruff/issues/5726 Note that we will generate the hash for a cache key twice in normal operation. Once to check for the cached item and again to update the cache. We could optimize this by generating the hash once in `diagnostics::lint_file` and passing the `u64` into `get` and `update`. We'd probably want to wrap it in a `CacheKeyHash` enum for type safety. ## Test plan Unit tests for Windows and Unix. Manual test with case from issue ``` ❯ touch fake.py ❯ chmod +x fake.py ❯ ./target/debug/ruff --select EXE fake.py fake.py:1:1: EXE002 The file is executable but no shebang is present Found 1 error. ❯ chmod -x fake.py ❯ ./target/debug/ruff --select EXE fake.py ```	2023-07-25 17:06:47 +00:00
Charlie Marsh	ed72c027a3	Replace `NoHashHasher` usages with `FxHashMap` (#6049 ) ## Summary I had always assumed that `NoHashHasher` would be faster when using integer keys, but benchmarking shows otherwise: ``` linter/default-rules/numpy/globals.py time: [66.544 µs 66.606 µs 66.678 µs] thrpt: [44.253 MiB/s 44.300 MiB/s 44.342 MiB/s] change: time: [-0.1843% +0.1087% +0.3718%] (p = 0.46 > 0.05) thrpt: [-0.3704% -0.1086% +0.1847%] No change in performance detected. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high mild linter/default-rules/pydantic/types.py time: [1.3787 ms 1.3811 ms 1.3837 ms] thrpt: [18.431 MiB/s 18.466 MiB/s 18.498 MiB/s] change: time: [-0.4827% -0.1074% +0.1927%] (p = 0.56 > 0.05) thrpt: [-0.1924% +0.1075% +0.4850%] No change in performance detected. linter/default-rules/numpy/ctypeslib.py time: [624.82 µs 625.96 µs 627.17 µs] thrpt: [26.550 MiB/s 26.601 MiB/s 26.650 MiB/s] change: time: [-0.7071% -0.4908% -0.2736%] (p = 0.00 < 0.05) thrpt: [+0.2744% +0.4932% +0.7122%] Change within noise threshold. linter/default-rules/large/dataset.py time: [3.1585 ms 3.1634 ms 3.1685 ms] thrpt: [12.840 MiB/s 12.861 MiB/s 12.880 MiB/s] change: time: [-1.5338% -1.3463% -1.1476%] (p = 0.00 < 0.05) thrpt: [+1.1610% +1.3647% +1.5577%] Performance has improved. linter/all-rules/numpy/globals.py time: [140.17 µs 140.37 µs 140.58 µs] thrpt: [20.989 MiB/s 21.020 MiB/s 21.051 MiB/s] change: time: [-0.1066% +0.3140% +0.7479%] (p = 0.14 > 0.05) thrpt: [-0.7423% -0.3130% +0.1067%] No change in performance detected. Found 3 outliers among 100 measurements (3.00%) 2 (2.00%) high mild 1 (1.00%) high severe linter/all-rules/pydantic/types.py time: [2.7030 ms 2.7069 ms 2.7112 ms] thrpt: [9.4064 MiB/s 9.4216 MiB/s 9.4351 MiB/s] change: time: [-0.6721% -0.4874% -0.2974%] (p = 0.00 < 0.05) thrpt: [+0.2982% +0.4898% +0.6766%] Change within noise threshold. Found 14 outliers among 100 measurements (14.00%) 12 (12.00%) high mild 2 (2.00%) high severe linter/all-rules/numpy/ctypeslib.py time: [1.4709 ms 1.4727 ms 1.4749 ms] thrpt: [11.290 MiB/s 11.306 MiB/s 11.320 MiB/s] change: time: [-1.1617% -0.9766% -0.8094%] (p = 0.00 < 0.05) thrpt: [+0.8160% +0.9862% +1.1754%] Change within noise threshold. Found 12 outliers among 100 measurements (12.00%) 9 (9.00%) high mild 3 (3.00%) high severe linter/all-rules/large/dataset.py time: [5.8086 ms 5.8163 ms 5.8240 ms] thrpt: [6.9854 MiB/s 6.9946 MiB/s 7.0038 MiB/s] change: time: [-1.5651% -1.3536% -1.1584%] (p = 0.00 < 0.05) thrpt: [+1.1720% +1.3721% +1.5900%] Performance has improved. ``` My guess is that `NoHashHasher` underperforms because the keys are not randomly distributed... Anyway, it's a ~1% (significant) performance gain on some of the above, plus we get to remove a dependency.	2023-07-24 23:41:57 +00:00
Charlie Marsh	574c0e0105	Use `match` instead of `phf` for confusable lookup (#5953 ) I don't know whether we want to make this change but here's some data... Binary size: - `main`: 30,384 - `charlie/match-phf`: 30,416 llvm-lines: - `main`: 1,784,148 - `charlie/match-phf`: 1,789,877 llvm-lines and binary size are both unchanged (or, by < 5) when moving from `u8` to `u32` return types, and even when moving to `char` keys and values. I didn't expect this, but I'm not very knowledgable on this topic. Performance: ``` Confusables/match/src time: [4.9102 µs 4.9352 µs 4.9777 µs] change: [+1.7469% +2.2421% +2.8710%] (p = 0.00 < 0.05) Performance has regressed. Found 12 outliers among 100 measurements (12.00%) 2 (2.00%) low mild 4 (4.00%) high mild 6 (6.00%) high severe Confusables/match-with-skip/src time: [2.0676 µs 2.0945 µs 2.1317 µs] change: [+0.9384% +1.6000% +2.3920%] (p = 0.00 < 0.05) Change within noise threshold. Found 8 outliers among 100 measurements (8.00%) 3 (3.00%) high mild 5 (5.00%) high severe Confusables/phf/src time: [31.087 µs 31.188 µs 31.305 µs] change: [+1.9262% +2.2188% +2.5496%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 3 (3.00%) low mild 6 (6.00%) high mild 6 (6.00%) high severe Confusables/phf-with-skip/src time: [2.0470 µs 2.0486 µs 2.0502 µs] change: [-0.3093% -0.1446% +0.0106%] (p = 0.08 > 0.05) No change in performance detected. Found 4 outliers among 100 measurements (4.00%) 2 (2.00%) high mild 2 (2.00%) high severe ``` The `-with-skip` variants add our optimization which first checks whether the character is ASCII. So `match` is way, way faster than PHF, but it tends not to matter since almost all source code is ASCII anyway.	2023-07-24 02:23:36 +00:00
Charlie Marsh	f5a2fb5b5d	Bump version to 0.0.280 (#5965 )	2023-07-21 22:36:13 -04:00
Charlie Marsh	f1f89f2a7e	Bump version to 0.0.279 (#5949 )	2023-07-21 15:46:53 -04:00
konsti	196cc9b655	Fix RustPython rev to main branch (#5950 ) Summary I accidentally merged earlier while the RustPython parser rev was still pointing to the feature branch instead of to the merged main. This make the rev point to the RustPython parser repo main again	2023-07-21 15:53:14 +00:00
konsti	972f9a9c15	Fix formatting lambda with empty arguments (#5944 ) Summary Fix implemented in https://github.com/astral-sh/RustPython-Parser/pull/35: Previously, empty lambda arguments (e.g. `lambda: 1`) would get the range of the entire expression, which leads to incorrect comment placement. Now empty lambda arguments get an empty range between the `lambda` and the `:` tokens. Test Plan Added a regression test. 149 instances of unstable formatting remaining. ``` $ cargo run --bin ruff_dev --release -- format-dev --stability-check --error-file formatter-ecosystem-errors.txt --multi-project target/checkouts > formatter-ecosystem-progress.txt $ rg "Unstable formatting" target/formatter-ecosystem-errors.txt \| wc -l 149 ```	2023-07-21 15:48:45 +02:00
Micha Reiser	76e9ce6dc0	Fix `SimpleTokenizer`'s backward lexing of `# ` (#5878 )	2023-07-20 11:54:18 +02:00
konsti	92f471a666	Handle io errors gracefully (#5611 ) ## Summary It can happen that we can't read a file (a python file, a jupyter notebook or pyproject.toml), which needs to be handled and handled consistently for all file types. Instead of using `Err` or `error!`, we emit E602 with the io error as message and continue. This PR makes sure we handle all three cases consistently, emit E602. I'm not convinced that it should be possible to disable io errors, but we now handle the regular case consistently and at least print warning consistently. I went with `warn!` but i can change them all to `error!`, too. It also checks the error case when a pyproject.toml is not readable. The error message is not very helpful, but it's now a bit clearer that actually ruff itself failed instead vs this being a diagnostic. ## Examples This is how an Err of `run` looks now: ![image](https://github.com/astral-sh/ruff/assets/6826232/890f7ab2-2309-4b6f-a4b3-67161947cc83) With an unreadable file and `IOError` disabled: ![image](https://github.com/astral-sh/ruff/assets/6826232/fd3d6959-fa23-4ddf-b2e5-8d6022df54b1) (we lint zero files but count files before linting not during so we exit 0) I'm not sure if it should (or if we should take a different path with manual ExitStatus), but this currently also triggers when `files` is empty: ![image](https://github.com/astral-sh/ruff/assets/6826232/f7ede301-41b5-4743-97fd-49149f750337) ## Test Plan Unix only: Create a temporary directory with files with permissions `000` (not readable by the owner) and run on that directory. Since this breaks the assumptions of most of the test code (single file, `ruff` instead of `ruff_cli`), the test code is rather cumbersome and looks a bit misplaced; i'm happy about suggestions to fit it in closer with the other tests or streamline it in other ways. I added another test for when the entire directory is not readable.	2023-07-20 11:30:14 +02:00
Charlie Marsh	5f3da9955a	Rename `ruff_python_whitespace` to `ruff_python_trivia` (#5886 ) ## Summary This crate now contains utilities for dealing with trivia more broadly: whitespace, newlines, "simple" trivia lexing, etc. So renaming it to reflect its increased responsibilities. To avoid conflicts, I've also renamed `Token` and `TokenKind` to `SimpleToken` and `SimpleTokenKind`.	2023-07-19 11:48:27 -04:00
Micha Reiser	46a17d11f3	playground: Add AST/Tokens/Formatter panels (#5859 )	2023-07-19 14:46:08 +00:00
Micha Reiser	cda90d071c	Upgrade cargo insta (#5872 )	2023-07-19 12:56:32 +02:00
Charlie Marsh	4204fc002d	Remove exception-handler lexing from `unused-bound-exception` fix (#5851 ) ## Summary The motivation here is that it will make this rule easier to rewrite as a deferred check. Right now, we can't run this rule in the deferred phase, because it depends on the `except_handler` to power its autofix. Instead of lexing the `except_handler`, we can use the `SimpleTokenizer` from the formatter, and just lex forwards and backwards. For context, this rule detects the unused `e` in: ```python try: pass except ValueError as e: pass ```	2023-07-18 18:27:46 +00:00
konsti	730e6b2b4c	Refactor `StmtIf`: Formatter and Linter (#5459 ) ## Summary Previously, `StmtIf` was defined recursively as ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub orelse: Vec<Stmt>, } ``` Every `elif` was represented as an `orelse` with a single `StmtIf`. This means that this representation couldn't differentiate between ```python if cond1: x = 1 else: if cond2: x = 2 ``` and ```python if cond1: x = 1 elif cond2: x = 2 ``` It also makes many checks harder than they need to be because we have to recurse just to iterate over an entire if-elif-else and because we're lacking nodes and ranges on the `elif` and `else` branches. We change the representation to a flat ```rust pub struct StmtIf { pub range: TextRange, pub test: Box<Expr>, pub body: Vec<Stmt>, pub elif_else_clauses: Vec<ElifElseClause>, } pub struct ElifElseClause { pub range: TextRange, pub test: Option<Expr>, pub body: Vec<Stmt>, } ``` where `test: Some(_)` represents an `elif` and `test: None` an else. This representation is different tradeoff, e.g. we need to allocate the `Vec<ElifElseClause>`, the `elif`s are now different than the `if`s (which matters in rules where want to check both `if`s and `elif`s) and the type system doesn't guarantee that the `test: None` else is actually last. We're also now a bit more inconsistent since all other `else`, those from `for`, `while` and `try`, still don't have nodes. With the new representation some things became easier, e.g. finding the `elif` token (we can use the start of the `ElifElseClause`) and formatting comments for if-elif-else (no more dangling comments splitting, we only have to insert the dangling comment after the colon manually and set `leading_alternate_branch_comments`, everything else is taken of by having nodes for each branch and the usual placement.rs fixups). ## Merge Plan This PR requires coordination between the parser repo and the main ruff repo. I've split the ruff part, into two stacked PRs which have to be merged together (only the second one fixes all tests), the first for the formatter to be reviewed by @michareiser and the second for the linter to be reviewed by @charliermarsh. * MH: Review and merge https://github.com/astral-sh/RustPython-Parser/pull/20 * MH: Review and merge or move later in stack https://github.com/astral-sh/RustPython-Parser/pull/21 * MH: Review and approve https://github.com/astral-sh/RustPython-Parser/pull/22 * MH: Review and approve formatter PR https://github.com/astral-sh/ruff/pull/5459 * CM: Review and approve linter PR https://github.com/astral-sh/ruff/pull/5460 * Merge linter PR in formatter PR, fix ecosystem checks (ecosystem checks can't run on the formatter PR and won't run on the linter PR, so we need to merge them first) * Merge https://github.com/astral-sh/RustPython-Parser/pull/22 * Create tag in the parser, update linter+formatter PR * Merge linter+formatter PR https://github.com/astral-sh/ruff/pull/5459 --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-18 13:40:15 +02:00
konsti	d098256c96	Add a tool for shrinking failing examples (#5731 ) ## Summary For formatter instabilities, the message we get look something like this: ```text Unstable formatting /home/konsti/ruff/target/checkouts/deepmodeling:dpdispatcher/dpdispatcher/slurm.py @@ -47,9 +47,9 @@ - script_header_dict["slurm_partition_line"] = ( - NOT_YET_IMPLEMENTED_ExprJoinedStr - ) + script_header_dict[ + "slurm_partition_line" + ] = NOT_YET_IMPLEMENTED_ExprJoinedStr Unstable formatting /home/konsti/ruff/target/checkouts/deepmodeling:dpdispatcher/dpdispatcher/pbs.py @@ -26,9 +26,9 @@ - pbs_script_header_dict["select_node_line"] += ( - NOT_YET_IMPLEMENTED_ExprJoinedStr - ) + pbs_script_header_dict[ + "select_node_line" + ] += NOT_YET_IMPLEMENTED_ExprJoinedStr ``` For ruff crashes. you don't even get that but just the file that crashed it. To extract the actual bug, you'd need to manually remove parts of the file, rerun to see if the bug still occurs (and revert if it doesn't) until you have a minimal example. With this script, you run ```shell cargo run --bin ruff_shrinking -- target/checkouts/deepmodeling:dpdispatcher/dpdispatcher/slurm.py target/minirepo/code.py "Unstable formatting" "target/debug/ruff_dev format-dev --stability-check target/minirepo" ``` and get ```python class Slurm(): def gen_script_header(self, job): if resources.queue_name != "": script_header_dict["slurm_partition_line"] = f"#SBATCH --partition {resources.queue_name}" ``` which is an nice minimal example. I've been using this script and it would be easier for me if this were part of main. The main disadvantage to merging is that it adds additional dependencies. ## Test Plan I've been using this for a number of minimization. This is an internal helper script you only run manually. I could add a test that minimizes a rule violation if required. --------- Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-18 08:03:35 +00:00
David Szotten	52aa2fc875	upgrade rustpython to remove tuple-constants (#5840 ) c.f. https://github.com/astral-sh/RustPython-Parser/pull/28 Tests: No snapshots changed --------- Co-authored-by: Zanie <contact@zanie.dev>	2023-07-17 22:50:31 +00:00
konsti	7dd30f0270	Read black options in format_dev script (#5827 ) ## Summary Comparing repos with black requires that we use the settings as black, notably line length and magic trailing comma behaviour. Excludes and preserving quotes (vs. a preference for either quote style) is not yet implemented because they weren't needed for the test projects. In the other two commits i fixed the output when the progress bar is hidden (this way is recommonded in the indicatif docs), added a `scratch.pyi` file to gitignore because black formats stub files differently and also updated the ecosystem readme with the projects json without forks. ## Test Plan I added a `line-length` vs `line_length` test. Otherwise only my personal usage atm, a PR to integrate the script into the CI to check some projects will follow.	2023-07-17 13:29:43 +00:00
Tom Kuson	8420008e79	Avoid checking `EXE001` and `EXE002` on WSL (#5735 ) ## Summary Do not raise `EXE001` and `EXE002` if WSL is detected. Uses the [`wsl`](https://crates.io/crates/wsl) crate. Closes #5445. ## Test Plan `cargo test` I don't use Windows, so was unable to test on a WSL environment. It would be good if someone who runs Windows could check the functionality.	2023-07-13 07:36:07 -04:00
Charlie Marsh	0ead9a16ac	Bump version to 0.0.278 (#5714 )	2023-07-12 12:39:56 -04:00
David Szotten	1e894f328c	formatter: multi char tokens in SimpleTokenizer (#5610 )	2023-07-10 09:00:59 +01:00
Charlie Marsh	401d172e47	Use a simple match statement for case-insensitive noqa lookup (#5633 ) ## Summary It turns out that just doing this match directly without `AhoCorasick` is much faster, like 2x (and removes one dependency, though we likely already rely on this transitively).	2023-07-09 22:15:23 -04:00
Charlie Marsh	072358e26b	Use Instagram's LibCST rather than our fork (#5593 ) ## Summary Historically, we only used a fork to enable building without pyo3. But pyo3 is an optional feature. I may've just not understood how to accomplish this way back when.	2023-07-07 10:00:44 -04:00
konsti	b22e6c3d38	Extend ruff_dev formatter script to compute statistics and format a project (#5492 ) ## Summary This extends the `ruff_dev` formatter script util. Instead of only doing stability checks, you can now choose different compatible options on the CLI and get statistics. * It adds an option the formats all files that ruff would check to allow looking at an entire black-formatted repository with `git diff` * It computes the [Jaccard index](https://en.wikipedia.org/wiki/Jaccard_index) as a measure of deviation between input and output, which is useful as single number metric for assessing our current deviations from black. * It adds progress bars to both the single projects as well as the multi-project mode. * It adds an option to write the multi-project output to a file Sample usage: ``` $ cargo run --bin ruff_dev -- format-dev --stability-check crates/ruff/resources/test/cpython $ cargo run --bin ruff_dev -- format-dev --stability-check /home/konsti/projects/django Syntax error in /home/konsti/projects/django/tests/test_runner_apps/tagged/tests_syntax_error.py: source contains syntax errors (parser error): BaseError { error: UnrecognizedToken(Name { name: "syntax_error" }, None), offset: 131, source_path: "<filename>" } Found 0 stability errors in 2755 files (jaccard index 0.911) in 9.75s $ cargo run --bin ruff_dev -- format-dev --write /home/konsti/projects/django ``` Options: ``` Several utils related to the formatter which can be run on one or more repositories. The selected set of files in a repository is the same as for `ruff check`. * Check formatter stability: Format a repository twice and ensure that it looks that the first and second formatting look the same. * Format: Format the files in a repository to be able to check them with `git diff` * Statistics: The subcommand the Jaccard index between the (assumed to be black formatted) input and the ruff formatted output Usage: ruff_dev format-dev [OPTIONS] [FILES]... Arguments: [FILES]... Like `ruff check`'s files. See `--multi-project` if you want to format an ecosystem checkout Options: --stability-check Check stability We want to ensure that once formatted content stays the same when formatted again, which is known as formatter stability or formatter idempotency, and that the formatter prints syntactically valid code. As our test cases cover only a limited amount of code, this allows checking entire repositories. --write Format the files. Without this flag, the python files are not modified --format <FORMAT> Control the verbosity of the output [default: default] Possible values: - minimal: Filenames only - default: Filenames and reduced diff - full: Full diff and invalid code -x, --exit-first-error Print only the first error and exit, `-x` is same as pytest --multi-project Checks each project inside a directory, useful e.g. if you want to check all of the ecosystem checkouts --error-file <ERROR_FILE> Write all errors to this file in addition to stdout. Only used in multi-project mode ``` ## Test Plan I ran this on django (2755 files, jaccard index 0.911) and discovered a magic trailing comma problem and that we really needed to implement import formatting. I ran the script on cpython to identify https://github.com/astral-sh/ruff/pull/5558.	2023-07-07 11:30:12 +00:00
Charlie Marsh	cc822082a7	Refactor `noqa` directive parsing away from regex-based implementation (#5554 ) ## Summary I'll write up a more detailed description tomorrow, but in short, this PR removes our regex-based implementation in favor of "manual" parsing. I tried a couple different implementations. In the benchmarks below: - `Directive/Regex` is our implementation on `main`. - `Directive/Find` just uses `text.find("noqa")`, which is insufficient, since it doesn't cover case-insensitive variants like `NOQA`, and doesn't handle multiple `noqa` matches in a single like, like ` # Here's a noqa comment # noqa: F401`. But it's kind of a baseline. - `Directive/Memchr` uses three `memchr` iterative finders (one for `noqa`, `NOQA`, and `NoQA`). - `Directive/AhoCorasick` is roughly the variant checked-in here. The raw results: ``` Directive/Regex/# noqa: F401 time: [273.69 ns 274.71 ns 276.03 ns] change: [+1.4467% +1.8979% +2.4243%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 3 (3.00%) low mild 8 (8.00%) high mild 4 (4.00%) high severe Directive/Find/# noqa: F401 time: [66.972 ns 67.048 ns 67.132 ns] change: [+2.8292% +2.9377% +3.0540%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 1 (1.00%) low severe 3 (3.00%) low mild 8 (8.00%) high mild 3 (3.00%) high severe Directive/AhoCorasick/# noqa: F401 time: [76.922 ns 77.189 ns 77.536 ns] change: [+0.4265% +0.6862% +0.9871%] (p = 0.00 < 0.05) Change within noise threshold. Found 8 outliers among 100 measurements (8.00%) 1 (1.00%) low mild 3 (3.00%) high mild 4 (4.00%) high severe Directive/Memchr/# noqa: F401 time: [62.627 ns 62.654 ns 62.679 ns] change: [-0.1780% -0.0887% -0.0120%] (p = 0.03 < 0.05) Change within noise threshold. Found 11 outliers among 100 measurements (11.00%) 1 (1.00%) low severe 5 (5.00%) low mild 3 (3.00%) high mild 2 (2.00%) high severe Directive/Regex/# noqa: F401, F841 time: [321.83 ns 322.39 ns 322.93 ns] change: [+8602.4% +8623.5% +8644.5%] (p = 0.00 < 0.05) Performance has regressed. Found 5 outliers among 100 measurements (5.00%) 1 (1.00%) low severe 2 (2.00%) low mild 1 (1.00%) high mild 1 (1.00%) high severe Directive/Find/# noqa: F401, F841 time: [78.618 ns 78.758 ns 78.896 ns] change: [+1.6909% +1.8771% +2.0628%] (p = 0.00 < 0.05) Performance has regressed. Found 3 outliers among 100 measurements (3.00%) 3 (3.00%) high mild Directive/AhoCorasick/# noqa: F401, F841 time: [87.739 ns 88.057 ns 88.468 ns] change: [+0.1843% +0.4685% +0.7854%] (p = 0.00 < 0.05) Change within noise threshold. Found 11 outliers among 100 measurements (11.00%) 5 (5.00%) low mild 3 (3.00%) high mild 3 (3.00%) high severe Directive/Memchr/# noqa: F401, F841 time: [80.674 ns 80.774 ns 80.860 ns] change: [-0.7343% -0.5633% -0.4031%] (p = 0.00 < 0.05) Change within noise threshold. Found 14 outliers among 100 measurements (14.00%) 4 (4.00%) low severe 9 (9.00%) low mild 1 (1.00%) high mild Directive/Regex/# noqa time: [194.86 ns 195.93 ns 196.97 ns] change: [+11973% +12039% +12103%] (p = 0.00 < 0.05) Performance has regressed. Found 6 outliers among 100 measurements (6.00%) 5 (5.00%) low mild 1 (1.00%) high mild Directive/Find/# noqa time: [25.327 ns 25.354 ns 25.383 ns] change: [+3.8524% +4.0267% +4.1845%] (p = 0.00 < 0.05) Performance has regressed. Found 9 outliers among 100 measurements (9.00%) 6 (6.00%) high mild 3 (3.00%) high severe Directive/AhoCorasick/# noqa time: [34.267 ns 34.368 ns 34.481 ns] change: [+0.5646% +0.8505% +1.1281%] (p = 0.00 < 0.05) Change within noise threshold. Found 5 outliers among 100 measurements (5.00%) 5 (5.00%) high mild Directive/Memchr/# noqa time: [21.770 ns 21.818 ns 21.874 ns] change: [-0.0990% +0.1464% +0.4046%] (p = 0.26 > 0.05) No change in performance detected. Found 10 outliers among 100 measurements (10.00%) 4 (4.00%) low mild 4 (4.00%) high mild 2 (2.00%) high severe Directive/Regex/# type: ignore # noqa: E501 time: [278.76 ns 279.69 ns 280.72 ns] change: [+7449.4% +7469.8% +7490.5%] (p = 0.00 < 0.05) Performance has regressed. Found 3 outliers among 100 measurements (3.00%) 1 (1.00%) low mild 1 (1.00%) high mild 1 (1.00%) high severe Directive/Find/# type: ignore # noqa: E501 time: [67.791 ns 67.976 ns 68.184 ns] change: [+2.8321% +3.1735% +3.5418%] (p = 0.00 < 0.05) Performance has regressed. Found 6 outliers among 100 measurements (6.00%) 5 (5.00%) high mild 1 (1.00%) high severe Directive/AhoCorasick/# type: ignore # noqa: E501 time: [75.908 ns 76.055 ns 76.210 ns] change: [+0.9269% +1.1427% +1.3955%] (p = 0.00 < 0.05) Change within noise threshold. Found 1 outliers among 100 measurements (1.00%) 1 (1.00%) high severe Directive/Memchr/# type: ignore # noqa: E501 time: [72.549 ns 72.723 ns 72.957 ns] change: [+1.5881% +1.9660% +2.3974%] (p = 0.00 < 0.05) Performance has regressed. Found 15 outliers among 100 measurements (15.00%) 10 (10.00%) high mild 5 (5.00%) high severe Directive/Regex/# type: ignore # nosec time: [66.967 ns 67.075 ns 67.207 ns] change: [+1713.0% +1715.8% +1718.9%] (p = 0.00 < 0.05) Performance has regressed. Found 10 outliers among 100 measurements (10.00%) 1 (1.00%) low severe 3 (3.00%) low mild 2 (2.00%) high mild 4 (4.00%) high severe Directive/Find/# type: ignore # nosec time: [18.505 ns 18.548 ns 18.597 ns] change: [+1.3520% +1.6976% +2.0333%] (p = 0.00 < 0.05) Performance has regressed. Found 4 outliers among 100 measurements (4.00%) 4 (4.00%) high mild Directive/AhoCorasick/# type: ignore # nosec time: [16.162 ns 16.206 ns 16.252 ns] change: [+1.2919% +1.5587% +1.8430%] (p = 0.00 < 0.05) Performance has regressed. Found 4 outliers among 100 measurements (4.00%) 3 (3.00%) high mild 1 (1.00%) high severe Directive/Memchr/# type: ignore # nosec time: [39.192 ns 39.233 ns 39.276 ns] change: [+0.5164% +0.7456% +0.9790%] (p = 0.00 < 0.05) Change within noise threshold. Found 13 outliers among 100 measurements (13.00%) 2 (2.00%) low severe 4 (4.00%) low mild 3 (3.00%) high mild 4 (4.00%) high severe Directive/Regex/# some very long comment that # is interspersed with characters but # no directive time: [81.460 ns 81.578 ns 81.703 ns] change: [+2093.3% +2098.8% +2104.2%] (p = 0.00 < 0.05) Performance has regressed. Found 4 outliers among 100 measurements (4.00%) 2 (2.00%) low mild 2 (2.00%) high mild Directive/Find/# some very long comment that # is interspersed with characters but # no directive time: [26.284 ns 26.331 ns 26.387 ns] change: [+0.7554% +1.1027% +1.3832%] (p = 0.00 < 0.05) Change within noise threshold. Found 6 outliers among 100 measurements (6.00%) 5 (5.00%) high mild 1 (1.00%) high severe Directive/AhoCorasick/# some very long comment that # is interspersed with characters but # no direc... time: [28.643 ns 28.714 ns 28.787 ns] change: [+1.3774% +1.6780% +2.0028%] (p = 0.00 < 0.05) Performance has regressed. Found 2 outliers among 100 measurements (2.00%) 2 (2.00%) high mild Directive/Memchr/# some very long comment that # is interspersed with characters but # no directive time: [55.766 ns 55.831 ns 55.897 ns] change: [+1.5802% +1.7476% +1.9021%] (p = 0.00 < 0.05) Performance has regressed. Found 2 outliers among 100 measurements (2.00%) 2 (2.00%) low mild ``` While memchr is faster than aho-corasick in some of the common cases (like `# noqa: F401`), the latter is way, way faster when there _isn't_ a match (like 2x faster -- see the last two cases). Since most comments _aren't_ `noqa` comments, this felt like the right tradeoff. Note that all implementations are significantly faster than the regex version. (I know I originally reported a 10x speedup, but I ended up improving the regex version a bit in some prior PRs, so it got unintentionally faster via some refactors.) There's also one behavior change in here, which is that we now allow variable spaces, e.g., `#noqa` or `# noqa`. Previously, we required exactly one space. This thus closes #5177.	2023-07-06 16:03:10 +00:00
Charlie Marsh	9a8e5f7877	Run `cargo update` (#5534 ) ```console ❯ cargo update Updating crates.io index Updating git repository `https://github.com/charliermarsh/LibCST` Updating git repository `https://github.com/astral-sh/RustPython-Parser.git` Updating git repository `https://github.com/youknowone/unicode_names2.git` Updating bitflags v2.3.2 -> v2.3.3 Updating bstr v1.5.0 -> v1.6.0 Updating clap v4.3.8 -> v4.3.11 Updating clap_builder v4.3.8 -> v4.3.11 Updating clap_complete v4.3.1 -> v4.3.2 Updating colored v2.0.0 -> v2.0.4 Removing hermit-abi v0.2.6 Removing hermit-abi v0.3.1 Adding hermit-abi v0.3.2 Updating is-terminal v0.4.7 -> v0.4.8 Updating itoa v1.0.6 -> v1.0.8 Adding linux-raw-sys v0.4.3 Updating num_cpus v1.15.0 -> v1.16.0 Updating paste v1.0.12 -> v1.0.13 Updating pin-project-lite v0.2.9 -> v0.2.10 Updating quote v1.0.28 -> v1.0.29 Updating regex v1.8.4 -> v1.9.0 Updating regex-automata v0.1.10 -> v0.3.0 Updating regex-syntax v0.7.2 -> v0.7.3 Removing rustix v0.37.20 Adding rustix v0.37.23 Adding rustix v0.38.3 Updating rustversion v1.0.12 -> v1.0.13 Updating ryu v1.0.13 -> v1.0.14 Updating serde v1.0.164 -> v1.0.166 Updating serde_derive v1.0.164 -> v1.0.166 Updating serde_json v1.0.99 -> v1.0.100 Updating syn v2.0.22 -> v2.0.23 Updating thiserror v1.0.40 -> v1.0.41 Updating thiserror-impl v1.0.40 -> v1.0.41 Updating unicode-ident v1.0.9 -> v1.0.10 Updating uuid v1.3.4 -> v1.4.0 Updating windows-targets v0.48.0 -> v0.48.1 ```	2023-07-05 12:34:15 -04:00
Charlie Marsh	324455f580	Bump version to 0.0.277 (#5515 )	2023-07-04 17:31:32 -04:00
Thomas de Zeeuw	0b963ddcfa	Add unreachable code rule (#5384 ) Co-authored-by: Thomas de Zeeuw <thomas@astral.sh> Co-authored-by: Micha Reiser <micha@reiser.io>	2023-07-04 14:27:23 +00:00
Charlie Marsh	3992c47c00	Bump version to 0.0.276 (#5488 )	2023-07-03 18:02:49 +00:00
Micha Reiser	dc65007fe9	Use rayon to parallelize the stability check <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR uses rayon to parallelize the stability check by scheduling each project as its own task. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I ran the ecosystem check. It now makes use of all cores (except at the end, there are some large projects). ## Performance The check now completes in minutes where it took about 30 minutes before. <!-- How was it tested? -->	2023-06-30 10:05:25 +02:00
Micha Reiser	955e9ef821	Fix invalid syntax for binary expression in unary op (#5370 )	2023-06-29 08:09:26 +02:00
Charlie Marsh	6587fb844a	Add snapshot tests for resolver (#5404 ) ## Summary This PR adds some snapshot tests for the resolver based on executing resolutions within a "mock" of the Airflow repo (that is: a folder that contains a subset of the repo's files, but all empty, and with an only-partially-complete virtual environment). It's intended to act as a lightweight integration test, to enable us to test resolutions on a "real" project without adding a dependency on Airflow itself.	2023-06-28 13:38:51 +00:00
Charlie Marsh	1ed227a1e0	Port Pyright's import resolver to Rust (#5381 ) ## Summary This PR contains the first step towards enabling robust first-party, third-party, and standard library import resolution in Ruff (including support for `typeshed`, stub files, native modules, etc.) by porting Pyright's import resolver to Rust. The strategy taken here was to start with a more-or-less direct port of the Pyright's TypeScript resolver. The code is intentionally similar, and the test suite is effectively a superset of Pyright's test suite for its own resolver. Due to the nature of the port, the code is very, very non-idiomatic for Rust. The code is also entirely unused outside of the test suite, and no effort has been made to integrate it with the rest of the codebase. Future work will include: - Refactoring the code (now that it works) to match Rust and Ruff idioms. - Further testing, in practice, to ensure that the resolver can resolve imports in a complex project, when provided with a virtual environment path. - Caching, to minimize filesystem lookups and redundant resolutions. - Integration into Ruff itself (use Ruff's existing settings, find rules that can make use of robust resolution, etc.)	2023-06-27 16:15:07 +00:00
konstin	7f6cb9dfb5	Format call expressions (without call chaining) (#5341 ) ## Summary This formats call expressions with magic trailing comma and parentheses behaviour but without call chaining ## Test Plan Lots of new test fixtures, including some that don't work yet	2023-06-27 09:29:40 +00:00
Dhruv Manilawala	2fc38d81e6	Experimental release for Jupyter notebook integration (#5363 ) ## Summary Experimental release for Jupyter Notebook integration. Currently, this requires a user to explicitly opt-in using the [include](https://beta.ruff.rs/docs/settings/#include) configuration: ```toml [tool.ruff] include = [".py", ".pyi", "*/pyproject.toml", ".ipynb"] ``` Or, a user can pass in the file directly: ```sh ruff check path/to/notebook.ipynb ``` For known limitations, please refer #5188 ## Test Plan Following command should work without the `--all-features` flag: ```sh cargo dev round-trip /path/to/notebook.ipynb ``` Following command should work with the above config file along with `select = ["ALL"]`: ```sh cargo run --bin ruff -- check --no-cache --config=../test-repos/openai-cookbook/pyproject.toml --fix ../test-repos/openai-cookbook/ ``` Passing the Jupyter notebook directly: ```sh cargo run --bin ruff -- check --no-cache --isolated --select=ALL --fix ../test-repos/openai-cookbook/examples/Classification_using_embeddings.ipynb ```	2023-06-26 21:22:42 +05:30
Micha Reiser	f18a1f70de	Add tests for skip magic trailing comma <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR adds tests that verify that the magic trailing comma is not respected if disabled in the formatter options. Our test setup now allows to create a `<fixture-name>.options.json` file that contains an array of configurations that should be tested. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan It's all about tests :) <!-- How was it tested? -->	2023-06-26 14:15:55 +02:00
Micha Reiser	8879927b9a	Use `insta::glob` instead of `fixture` macro (#5364 )	2023-06-26 08:46:18 +00:00
Charlie Marsh	18c73c1f9b	Improve backslash-detection rule for docstrings (#5360 )	2023-06-26 01:58:20 +00:00
Charlie Marsh	b233763156	Run `cargo update` (#5357 )	2023-06-25 18:16:59 -04:00
Micha Reiser	6ba9d5d5a4	Upgrade RustPython (#5334 )	2023-06-23 20:39:47 +00:00
Charlie Marsh	f45d1c2b84	Remove HashMap and HashSet for known-standard-library detection (#5345 ) ## Summary This is a lot more concise and probably much more performant (with fewer instructions).	2023-06-23 19:59:03 +00:00
Micha Reiser	c52aa8f065	Basic string formatting <!-- Thank you for contributing to Ruff! To help us out with reviewing, please consider the following: - Does this pull request include a summary of the change? (See below.) - Does this pull request include a descriptive title? - Does this pull request include references to any relevant issues? --> ## Summary This PR implements formatting for non-f-string Strings that do not use implicit concatenation. Docstring formatting is out of the scope of this PR. <!-- What's the purpose of the change? What does it do, and why? --> ## Test Plan I added a few tests for simple string literals. ## Performance Ouch. This is hitting performance somewhat hard. This is probably because we now iterate each string a couple of times: 1. To detect if it is an implicit string continuation 2. To detect if the string contains any new lines 3. To detect the preferred quote 4. To normalize the string Edit: I integrated the detection of newlines into the preferred quote detection so that we only iterate the string three time. We can probably do better by merging the implicit string continuation with the quote detection and new line detection by iterating till the end of the string part and returning the offset. We then use our simple tokenizer to skip over any comments or whitespace until we find the first non trivia token. From there we keep continue doing this in a loop until we reach the end o the string. I'll leave this improvement for later.	2023-06-23 09:46:05 +02:00
Micha Reiser	f7e1cf4b51	Format `class` definitions (#5289 )	2023-06-22 09:09:43 +00:00
Charlie Marsh	1c0a3a467f	Bump version to 0.0.275 (#5276 )	2023-06-21 21:53:37 -04:00
Charlie Marsh	e0339b538b	Bump version to 0.0.274 (#5230 )	2023-06-20 22:12:32 -04:00
Micha Reiser	e520a3a721	Fix ArgWithDefault comments handling (#5204 )	2023-06-20 20:48:07 +00:00
Charlie Marsh	fde5dbc9aa	Bump version to 0.0.273 (#5218 )	2023-06-20 14:37:28 -04:00
Charlie Marsh	6331598511	Upgrade `RustPython` to access ranged names (#5194 ) ## Summary In https://github.com/astral-sh/RustPython-Parser/pull/8, we modified RustPython to include ranges for any identifiers that aren't `Expr::Name` (which already has an identifier). For example, the `e` in `except ValueError as e` was previously un-ranged. To extract its range, we had to do some lexing of our own. This change should improve performance and let us remove a bunch of code. ## Test Plan `cargo test`	2023-06-20 15:43:38 +00:00
Charlie Marsh	36e01ad6eb	Upgrade RustPython (#5192 ) ## Summary This PR upgrade RustPython to pull in the changes to `Arguments` (zip defaults with their identifiers) and all the renames to `CmpOp` and friends.	2023-06-19 21:09:53 +00:00
Dhruv Manilawala	48f4f2d63d	Maintain consistency when deserializing to JSON (#5114 ) ## Summary Maintain consistency while deserializing Jupyter notebook to JSON. The following changes were made: 1. Use string array to store the source value as that's the default (`5781720423/nbformat/v4/nbjson.py (L56-L57)`) 2. Remove unused structs and enums 3. Reorder the keys in alphabetical order as that's the default. (`5781720423/nbformat/v4/nbjson.py (L51)`) ### Side effect Removing the `preserve_order` feature means that the order of keys in JSON output (`--format json`) will be in alphabetical order. This is because the value is represented using `serde_json::Value` which internally is a `BTreeMap`, thus sorting it as per the string key. For posterity if this turns out to be not ideal, then we could define a struct representing the JSON object and the order of struct fields will determine the order in the JSON string. ## Test Plan Add a test case to assert the raw JSON string.	2023-06-19 23:47:56 +05:30
Thomas de Zeeuw	e3c12764f8	Only use a single cache file per Python package (#5117 ) ## Summary This changes the caching design from one cache file per source file, to one cache file per package. This greatly reduces the amount of cache files that are opened and written, while maintaining roughly the same (combined) size as bincode is very compact. Below are some very much not scientific performance tests. It uses projects/sources to check: * small.py: single, 31 bytes Python file with 2 errors. * test.py: single, 43k Python file with 8 errors. * fastapi: FastAPI repo, 1134 files checked, 0 errors. Source \| Before # files \| After # files \| Before size \| After size -------\|-------\|-------\|-------\|------- small.py \| 1 \| 1 \| 20 K \| 20 K test.py \| 1 \| 1 \| 60 K \| 60 K fastapi \| 1134 \| 518 \| 4.5 M \| 2.3 M One question that might come up is why fastapi still has 518 cache files and not 1? That is because this is using the existing package resolution, which sees examples, docs, etc. as separate from the "main" source code (in the fastapi directory in the repo). In this future it might be worth consider switching to a one cache file per repo strategy. This new design is not perfect and does have a number of known issues. First, like the old design it doesn't remove the cache for a source file that has been (re)moved until `ruff clean` is called. Second, this currently uses a large mutex around the mutation of the package cache (e.g. inserting result). This could be (or become) a bottleneck. It's future work to test and improve this (if needed). Third, currently the packages and opened and stored in a sequential loop, this could be done parallel. This is also future work. ## Test Plan Run `ruff check` (with caching enabled) twice on any Python source code and it should produce the same results.	2023-06-19 17:46:13 +02:00
konstin	b8d378b0a3	Add a script that tests formatter stability on repositories (#5055 ) ## Summary We want to ensure that once formatted content stays the same when formatted again, which is known as formatter stability or formatter idempotency, and that the formatter prints syntactically valid code. As our test cases cover only a limited amount of code, this allows checking entire repositories. This adds a new subcommand to `ruff_dev` which can be invoked as `cargo run --bin ruff_dev -- check-formatter-stability <repo>`. While initially only intended to check stability, it has also found cases where the formatter printed invalid syntax or panicked. ## Test Plan Running this on cpython is already identifying bugs (https://github.com/astral-sh/ruff/pull/5089)	2023-06-19 14:13:38 +00:00
Thomas de Zeeuw	b0f89fa814	Support glob patterns in pep8_naming ignore-names (#5024 ) ## Summary Support glob patterns in pep8_naming ignore-names. Closes #2787 ## Test Plan Added new tests.	2023-06-13 17:37:13 +02:00
Dhruv Manilawala	d8f5d2d767	Add support for auto-fix in Jupyter notebooks (#4665 ) ## Summary Add support for applying auto-fixes in Jupyter Notebook. ### Solution Cell offsets are the boundaries for each cell in the concatenated source code. They are represented using `TextSize`. It includes the start and end offset as well, thus creating a range for each cell. These offsets are updated using the `SourceMap` markers. ### SourceMap `SourceMap` contains markers constructed from each edits which tracks the original source code position to the transformed positions. The following drawing might make it clear: ![SourceMap visualization](https://github.com/astral-sh/ruff/assets/67177269/3c94e591-70a7-4b57-bd32-0baa91cc7858) The center column where the dotted lines are present are the markers included in the `SourceMap`. The `Notebook` looks at these markers and updates the cell offsets after each linter loop. If you notice closely, the destination takes into account all of the markers before it. The index is constructed only when required as it's only used to render the diagnostics. So, a `OnceCell` is used for this purpose. The cell offsets, cell content and the index will be updated after each iteration of linting in the mentioned order. The order is important here as the content is updated as per the new offsets and index is updated as per the new content. ## Limitations ### 1 Styling rules such as the ones in `pycodestyle` will not be applicable everywhere in Jupyter notebook, especially at the cell boundaries. Let's take an example where a rule suggests to have 2 blank lines before a function and the cells contains the following code: ```python import something # --- def first(): pass def second(): pass ``` (Again, the comment is only to visualize cell boundaries.) In the concatenated source code, the 2 blank lines will be added but it shouldn't actually be added when we look in terms of Jupyter notebook. It's as if the function `first` is at the start of a file. `nbqa` solves this by recording newlines before and after running `autopep8`, then running the tool and restoring the newlines at the end (refer https://github.com/nbQA-dev/nbQA/pull/807). ## Test Plan Three commands were run in order with common flags (`--select=ALL --no-cache --isolated`) to isolate which stage the problem is occurring: 1. Only diagnostics 2. Fix with diff (`--fix --diff`) 3. Fix (`--fix`) ### https://github.com/facebookresearch/segment-anything ``` ------------------------------------------------------------------------------- Jupyter Notebooks 3 0 0 0 0 \|- Markdown 3 98 0 94 4 \|- Python 3 513 468 4 41 (Total) 611 468 98 45 ------------------------------------------------------------------------------- ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/segment-anything/*/.ipynb --fix ... Found 180 errors (89 fixed, 91 remaining). ``` ### https://github.com/openai/openai-cookbook ``` ------------------------------------------------------------------------------- Jupyter Notebooks 65 0 0 0 0 \|- Markdown 64 3475 12 2507 956 \|- Python 65 9700 7362 1101 1237 (Total) 13175 7374 3608 2193 =============================================================================== ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/openai-cookbook/*/.ipynb --fix error: Failed to parse /path/to/openai-cookbook/examples/vector_databases/Using_vector_databases_for_embeddings_search.ipynb:cell 4:29:18: unexpected token '-' ... Found 4227 errors (2165 fixed, 2062 remaining). ``` ### https://github.com/tensorflow/docs ``` ------------------------------------------------------------------------------- Jupyter Notebooks 150 0 0 0 0 \|- Markdown 1 55 0 46 9 \|- Python 1 402 289 60 53 (Total) 457 289 106 62 ------------------------------------------------------------------------------- ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/tensorflow-docs/*/.ipynb --fix error: Failed to parse /path/to/tensorflow-docs/site/en/guide/extension_type.ipynb:cell 80:1:1: unexpected token Indent error: Failed to parse /path/to/tensorflow-docs/site/en/r1/tutorials/eager/custom_layers.ipynb:cell 20:1:1: unexpected token Indent error: Failed to parse /path/to/tensorflow-docs/site/en/guide/data.ipynb:cell 175:5:14: unindent does not match any outer indentation level error: Failed to parse /path/to/tensorflow-docs/site/en/r1/tutorials/representation/unicode.ipynb:cell 30:1:1: unexpected token Indent ... Found 12726 errors (5140 fixed, 7586 remaining). ``` ### https://github.com/tensorflow/models ``` ------------------------------------------------------------------------------- Jupyter Notebooks 46 0 0 0 0 \|- Markdown 1 11 0 6 5 \|- Python 1 328 249 19 60 (Total) 339 249 25 65 ------------------------------------------------------------------------------- ``` ```console $ cargo run --all-features --bin ruff -- check --no-cache --isolated --select=ALL /path/to/tensorflow-models/*/.ipynb --fix ... Found 4856 errors (2690 fixed, 2166 remaining). ``` resolves: #1218 fixes: #4556	2023-06-12 14:14:15 +00:00
Charlie Marsh	1d756dc3a7	Move Python whitespace utilities into new `ruff_python_whitespace` crate (#4993 ) ## Summary `ruff_newlines` becomes `ruff_python_whitespace`, and includes the existing "universal newline" handlers alongside the Python whitespace-specific utilities.	2023-06-10 00:59:57 +00:00
konstin	651d89794c	Use phf for confusables to reduce llvm lines (#4926 ) * Use phf for confusables to reduce llvm lines ## Summary This replaces FxHashMap for the confusables with a perfect hash map from the [phf crate](https://github.com/rust-phf/rust-phf) to reduce the generated llvm instructions. A perfect hash function is one that doesn't have any collisions. We can build one because we know all keys at compile time. This improves hashmap efficiency, even though this is likely not noticeable in our case (except someone has a large non-english crate to test on). The original hashmap contained a lot of duplicates, which i had to remove when phf_map complained, i did so by sorting the keys. The important part that it reduces the llvm instructions generated (#3808, `RUSTFLAGS="-Csymbol-mangling-version=v0" cargo llvm-lines -p ruff --lib \| head -20`): ``` Lines Copies Function name ----- ------ ------------- 1740502 38973 (TOTAL) 27423 (1.6%, 1.6%) 1 (0.0%, 0.0%) ruff[cef4c65d96248843]::rules::ruff::rules::confusables::CONFUSABLES::{closure#0} 10193 (0.6%, 2.2%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::RuleCodePrefix>::iter 8107 (0.5%, 2.6%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::Rule>::noqa_code 7345 (0.4%, 3.0%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::checkers::ast::Checker as ruff_python_ast[3778b140caf21545]::visitor::Visitor>::visit_stmt 6412 (0.4%, 3.4%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:spanned::SpannedDeserializer<toml_edit[7e3a6c5e67260672]:🇩🇪:value::ValueDeserializer>> 6412 (0.4%, 3.8%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:table::TableMapAccess> 6409 (0.4%, 4.2%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:datetime::DatetimeDeserializer> 5696 (0.3%, 4.5%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::checkers::ast::Checker as ruff_python_ast[3778b140caf21545]::visitor::Visitor>::visit_expr 4448 (0.3%, 4.7%) 1 (0.0%, 0.0%) ruff[cef4c65d96248843]::flake8_to_ruff::converter::convert 3702 (0.2%, 4.9%) 1 (0.0%, 0.0%) <&ruff[cef4c65d96248843]::registry::Linter as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter 3349 (0.2%, 5.1%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::registry::Linter>::code_for_rule 3132 (0.2%, 5.3%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::Rule as core[da82827a87f140f9]::fmt::Debug>::fmt 3130 (0.2%, 5.5%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<&ruff[cef4c65d96248843]::codes::Rule>>::from 3130 (0.2%, 5.7%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<ruff[cef4c65d96248843]::codes::Rule>>::from 3130 (0.2%, 5.9%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::Rule as core[da82827a87f140f9]::convert::AsRef<str>>::as_ref 3128 (0.2%, 6.0%) 1 (0.0%, 0.0%) <ruff[cef4c65d96248843]::codes::RuleIter>::get 2669 (0.2%, 6.2%) 1 (0.0%, 0.0%) <<ruff[cef4c65d96248843]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_seq::<toml_edit[7e3a6c5e67260672]:🇩🇪:array::ArraySeqAccess> ``` After: ``` Lines Copies Function name ----- ------ ------------- 1710487 38900 (TOTAL) 10193 (0.6%, 0.6%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::RuleCodePrefix>::iter 8107 (0.5%, 1.1%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::Rule>::noqa_code 7345 (0.4%, 1.5%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::checkers::ast::Checker as ruff_python_ast[5588cd60041c8605]::visitor::Visitor>::visit_stmt 6412 (0.4%, 1.9%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:spanned::SpannedDeserializer<toml_edit[7e3a6c5e67260672]:🇩🇪:value::ValueDeserializer>> 6412 (0.4%, 2.2%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:table::TableMapAccess> 6409 (0.4%, 2.6%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_map::<toml_edit[7e3a6c5e67260672]:🇩🇪:datetime::DatetimeDeserializer> 5696 (0.3%, 3.0%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::checkers::ast::Checker as ruff_python_ast[5588cd60041c8605]::visitor::Visitor>::visit_expr 4448 (0.3%, 3.2%) 1 (0.0%, 0.0%) ruff[52408f46d2058296]::flake8_to_ruff::converter::convert 3702 (0.2%, 3.4%) 1 (0.0%, 0.0%) <&ruff[52408f46d2058296]::registry::Linter as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter 3349 (0.2%, 3.6%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::registry::Linter>::code_for_rule 3132 (0.2%, 3.8%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::Rule as core[da82827a87f140f9]::fmt::Debug>::fmt 3130 (0.2%, 4.0%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<&ruff[52408f46d2058296]::codes::Rule>>::from 3130 (0.2%, 4.2%) 1 (0.0%, 0.0%) <&str as core[da82827a87f140f9]::convert::From<ruff[52408f46d2058296]::codes::Rule>>::from 3130 (0.2%, 4.4%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::Rule as core[da82827a87f140f9]::convert::AsRef<str>>::as_ref 3128 (0.2%, 4.5%) 1 (0.0%, 0.0%) <ruff[52408f46d2058296]::codes::RuleIter>::get 2669 (0.2%, 4.7%) 1 (0.0%, 0.0%) <<ruff[52408f46d2058296]::settings::options::Options as serde[d89b1b632568f5a3]:🇩🇪:Deserialize>::deserialize::__Visitor as serde[d89b1b632568f5a3]:🇩🇪:Visitor>::visit_seq::<toml_edit[7e3a6c5e67260672]:🇩🇪:array::ArraySeqAccess> 2659 (0.2%, 4.9%) 1 (0.0%, 0.0%) <&ruff[52408f46d2058296]::codes::Pylint as core[da82827a87f140f9]::iter::traits::collect::IntoIterator>::into_iter ``` I'd assume this has a positive effect both on compile time and on runtime, but i don't know the actual effect on compile times and can't really measure. ## Test plan Check CI for any performance regressions. This should fix #3808 if we merge it. * clippy * Update update_ambiguous_characters.py	2023-06-08 08:13:20 +02:00
Micha Reiser	39a1f3980f	Upgrade RustPython (#4900 )	2023-06-08 05:53:14 +00:00
Charlie Marsh	5235977abc	Bump version to 0.0.272 (#4948 )	2023-06-08 02:17:29 +00:00
Charlie Marsh	2b5fb70482	Bump version to 0.0.271 (#4890 )	2023-06-06 15:11:48 -04:00
Micha Reiser	33434fcb9c	Add Formatter benchmark (#4860 )	2023-06-05 21:05:42 +02:00
Zanie Adkins	5ae4667fd5	Upgrade `criterion` to `0.5.1` (#4838 )	2023-06-03 21:33:44 +01:00
Charlie Marsh	b8f45c93b4	Use a separate fix-isolation group for every parent node (#4774 )	2023-06-02 03:07:55 +00:00
Charlie Marsh	621718784a	Replace deletion-tracking with enforced isolation levels (#4766 )	2023-06-02 02:45:56 +00:00
konstin	d4027d8b65	Use new formatter infrastructure in CLI and test (#4767 ) * Use dummy verbatim formatter for all nodes * Use new formatter infrastructure in CLI and test * Expose the new formatter in the CLI * Merge import blocks	2023-06-01 11:55:04 +02:00
Charlie Marsh	399eb84d5e	Add a `ruff_textwrap` crate (#4731 )	2023-05-31 16:35:23 +00:00
Charlie Marsh	9d0ffd33ca	Move universal newline handling into its own crate (#4729 )	2023-05-31 12:00:47 -04:00
Micha Reiser	6c1ff6a85f	Upgrade RustPython (#4747 )	2023-05-31 08:26:35 +00:00
Charlie Marsh	a4f73ea8c7	Remove unused `getrandom` dependency (#4734 )	2023-05-30 14:34:20 -04:00
Charlie Marsh	ea31229be0	Track `TYPE_CHECKING` blocks in `Importer` (#4593 )	2023-05-30 16:18:10 +00:00
Micha Reiser	0cd453bdf0	Generic "comment to node" association logic (#4642 )	2023-05-30 09:28:01 +00:00
Micha Reiser	6146b75dd0	Add `MultiMap` implementation for storing comments (#4639 )	2023-05-30 09:51:25 +02:00
Micha Reiser	33a7ed058f	Create `PreorderVisitor` trait (#4658 )	2023-05-26 06:14:08 +00:00
konstin	b6a382eeaf	Lint pyproject.toml (#4496 ) This adds a new rule `InvalidPyprojectToml` that lints pyproject.toml by checking if https://github.com/PyO3/pyproject-toml-rs can parse it. This means the linting is currently very basic, e.g. we don't check whether the name is actually a valid python project name or appropriately normalized. It does catch errors e.g. with invalid dependency requirements or problems withs the license specifications. It is open to be extended in the future (validate name, SPDX expressions, classifiers, ...), either in ruff or in pyproject-toml-rs. Test plan: ``` scripts/ecosystem_all_check.sh check --select RUF200 ``` This lead to a bunch of ``` RUF200 Failed to parse pyproject.toml: missing field `name` ``` (e.g. https://github.com/amitsk/fastapi-todos/blob/main/pyproject.toml) which is indeed invalid (https://packaging.python.org/en/latest/specifications/declaring-project-metadata/#specification). Filtering those out, the following other problems were found by `cd target/ecosystem_all_results/ && rg RUF200`: ``` UCL-ARC:rred-reports.stdout.txt 1:pyproject.toml:27:16: RUF200 Failed to parse pyproject.toml: Version specifier `>='3.9'` doesn't match PEP 440 rules EndlessTrax:python-start-project.stdout.txt 1:pyproject.toml:14:16: RUF200 Failed to parse pyproject.toml: Expected package name starting with an alphanumeric character, found '#' redjax:gardening-api.stdout.txt 1:pyproject.toml:7:11: RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules ajslater:codex.stdout.txt 2: 3:17 RUF200 Failed to parse pyproject.toml: invalid type: sequence, expected a string LDmitriy7:404_AvatarsBot.stdout.txt 1:pyproject.toml:3:11: RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules ajslater:comicbox.stdout.txt 1:pyproject.toml:3:17: RUF200 Failed to parse pyproject.toml: invalid type: sequence, expected a string manueldevillena:forecast-earnings.stdout.txt 1:pyproject.toml:24:12: RUF200 Failed to parse pyproject.toml: Expected one of `@`, `(`, `<`, `=`, `>`, `~`, `!`, `;`, found `^` redjax:ohio_utility_scraper.stdout.txt 1:pyproject.toml:11:11: RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules agronholm:typeguard.stdout.txt 1:pyproject.toml:40:8: RUF200 Failed to parse pyproject.toml: Expected a valid marker name, found 'python_implementation' cyuss:decathlon-turnover.stdout.txt 1:pyproject.toml:7:12: RUF200 Failed to parse pyproject.toml: invalid type: string "Youcef", expected a table with 'name' and 'email' keys ajslater:boilerplate.stdout.txt 1:pyproject.toml:3:17: RUF200 Failed to parse pyproject.toml: invalid type: sequence, expected a string kaparoo:lightning-project-template.stdout.txt 1:pyproject.toml:56:16: RUF200 Failed to parse pyproject.toml: You can't mix a >= operator with a local version (`+cu117`) dijital20:pytexas2023-decorators.stdout.txt 1:pyproject.toml:5:11: RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules pfouque:django-anymail-history.stdout.txt 1:pyproject.toml:137:12: RUF200 Failed to parse pyproject.toml: Version specifier `> = 1.2.0` doesn't match PEP 440 rules pfouque:django-fakemessages.stdout.txt 1:pyproject.toml:130:12: RUF200 Failed to parse pyproject.toml: Version specifier `> = 1.2.0` doesn't match PEP 440 rules pypa:build.stdout.txt 1:tests/packages/test-invalid-requirements/pyproject.toml:2:12: RUF200 Failed to parse pyproject.toml: Expected one of `@`, `(`, `<`, `=`, `>`, `~`, `!`, `;`, found `i` 4:tests/packages/test-no-requires/pyproject.toml:1:1: RUF200 Failed to parse pyproject.toml: missing field `requires` UnoYakshi:FRAAND.stdout.txt 2: 3:11 RUF200 Failed to parse pyproject.toml: Version `` doesn't match PEP 440 rules DHolmanCoding:python-template.stdout.txt 1:pyproject.toml:22:1: RUF200 Failed to parse pyproject.toml: missing field `requires` ``` Overall, this emitted errors in 43 out of 3408 projects (`rg -c RUF200 target/ecosystem_all_results/ \| wc -l`) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-05-25 12:05:28 +00:00
Charlie Marsh	f4572fe40b	Bump version to 0.0.270 (#4637 )	2023-05-24 16:34:29 +00:00
Micha Reiser	6943beee66	Remove source position from `FormatElement::DynamicText` (#4619 )	2023-05-24 16:36:14 +02:00
Micha Reiser	652c644c2a	Introduce `ruff_index` crate (#4597 )	2023-05-23 17:40:35 +02:00
Micha Reiser	154439728a	Add `AnyNode` and `AnyNodeRef` unions (#4578 )	2023-05-23 08:53:22 +02:00
Christopher Covington	3b8121379d	Name ambiguous characters (#4448 )	2023-05-22 17:16:57 +00:00
Micha Reiser	daadd24bde	Include decorators in `Function` and `Class` definition ranges (#4467 )	2023-05-22 17:50:42 +02:00
Micha Reiser	063431cb0f	Upgrade RustPython (#4576 )	2023-05-22 14:50:49 +02:00
Micha Reiser	2f35099f81	Remove `regex` dependency from `ruff_python_ast` (#4518 )	2023-05-19 06:44:18 +00:00
Charlie Marsh	d4c0a41b00	Bump version to 0.0.269 (#4506 )	2023-05-18 19:45:20 +00:00
Charlie Marsh	8702b5a40a	Bump version to 0.0.268 (#4501 )	2023-05-18 15:35:46 -04:00
figsoda	bab818e801	Update RustPython dependencies (#4503 )	2023-05-18 15:28:13 -04:00
Jeong, YunWon	4b05ca1198	Specialize ConversionFlag (#4450 )	2023-05-16 18:00:13 +02:00
Charlie Marsh	f0465bf106	Emit non-logical newlines for "empty" lines (#4444 )	2023-05-16 14:58:56 +00:00
Jeong, YunWon	6049aabe27	Update RustPyhon and enable full-lexer feature (#4442 )	2023-05-16 07:19:57 +00:00
Micha Reiser	fa26860296	Refactor range from `Attributed` to `Node`s (#4422 )	2023-05-16 06:36:32 +00:00
Micha Reiser	7e7be05ddf	Upgrade dependencies (#4389 )	2023-05-13 13:00:25 +00:00
Micha Reiser	f5afa8198c	Use new `rustpython_format` crate over `rustpython-common` (#4388 )	2023-05-13 12:35:02 +00:00
Charlie Marsh	dcedd5cd9d	Bump version to 0.0.267 (#4400 )	2023-05-12 19:04:56 +00:00
Charlie Marsh	67076b2dcb	Bump version to 0.0.266 (#4391 )	2023-05-12 13:11:03 -04:00
Jeong, YunWon	bbadbb5de5	Refactor code to use the new RustPython `is` method (#4369 )	2023-05-11 16:16:36 +02:00
Jeong, YunWon	be6e00ef6e	Re-integrate RustPython parser repository (#4359 ) Co-authored-by: Micha Reiser <micha@reiser.io>	2023-05-11 07:47:17 +00:00
Charlie Marsh	11e1380df4	Bump version to 0.0.265 (#4248 )	2023-05-05 13:16:05 -04:00
Charlie Marsh	8cb76f85eb	Bump version to 0.0.264 (#4179 )	2023-05-01 23:33:38 -07:00
Micha Reiser	e04ef42334	Use `memchr` to speedup newline search on x86 (#3985 )	2023-04-26 20:15:47 +01:00
Micha Reiser	cab65b25da	Replace row/column based `Location` with byte-offsets. (#3931 )	2023-04-26 18:11:02 +00:00
Charlie Marsh	7266eb0d69	Add support for providing command-line arguments via `argfile` (#4087 )	2023-04-25 17:58:21 -06:00
Charlie Marsh	fd7ccb4c9e	Bump version to 0.0.263 (#4086 )	2023-04-24 23:32:29 -06:00
Micha Reiser	ba4f4f4672	Upgrade dependencies (#4064 )	2023-04-22 18:04:01 +01:00
Charlie Marsh	25a6bfa9ee	Bump version to 0.0.262 (#4032 )	2023-04-19 15:49:28 -04:00
Micha Reiser	280dffb5e1	Add parser benchmark (#3990 )	2023-04-17 16:43:59 +02:00
Micha Reiser	e8aebee3f6	Pretty print `Diagnostic`s in snapshot tests (#3906 )	2023-04-11 09:03:00 +00:00
Micha Reiser	381203c084	Store source code on message (#3897 )	2023-04-11 07:57:36 +00:00
Micha Reiser	76c47a9a43	Cheap cloneable LineIndex (#3896 )	2023-04-11 07:33:40 +00:00
Micha Reiser	9209e57c5a	Extract message emitters from Printer (#3895 )	2023-04-11 07:24:25 +00:00
Charlie Marsh	255b094b33	Bump version to 0.0.261 (#3881 )	2023-04-04 22:31:01 -04:00

... 2 3 4 5 6 ...

795 Commits