Python/uv - uv - Gitea: Git with a cup of tea

Commit Graph

Author	SHA1	Message	Date
Zanie Blue	5fe3444e5a	Use more realistic names in scenario snapshots (#978 ) This is helpful to make the error messages more realistic and the names are indisputably cuter.	2024-01-19 10:01:34 -06:00
Charlie Marsh	5adb08a304	Allow relative paths and environment variables in all editable representations (#1000 ) ## Summary I don't know if this is actually a good change, but it tries to make the editable install experience more consistent. Specifically, we now support... ``` # Use a relative path with a `file://` prefix. # Prior to this PR, we supported `file:../foo`, but not `file://../foo`, which felt inconsistent. -e file://../foo # Use environment variables with paths, not just URLs. # Prior to this PR, we supported `file://${PROJECT_ROOT}/../foo`, but not the below. -e ${PROJECT_ROOT}/../foo ``` Importantly, `-e file://../foo` is actually not supported by pip... `-e file:../foo` _is_ supported though. We support both, as of this PR. Open to feedback.	2024-01-19 09:00:37 -05:00
konsti	cd2fb6fd60	Box `PrioritizedDistribution` (#948 ) On top of https://github.com/astral-sh/puffin/pull/947, we can also box `PrioritizedDistribution`. In a simple benchmark, this seems to slightly improve performance when comparing only this commit to main, even though the benchmark is too noisy to establish significance: ``` $ hyperfine --warmup 30 --runs 300 "target/profiling/main-dev resolve meine_stadt_transparent" "target/profiling/puffin-dev resolve meine_stadt_transparent" Benchmark 1: target/profiling/main-dev resolve meine_stadt_transparent Time (mean ± σ): 83.6 ms ± 2.0 ms [User: 77.7 ms, System: 20.0 ms] Range (min … max): 81.4 ms … 98.2 ms 300 runs Warning: Statistical outliers were detected. Consider re-running this benchmark on a quiet system without any interferences from other programs. It might help to use the '--warmup' or '--prepare' options. Benchmark 2: target/profiling/puffin-dev resolve meine_stadt_transparent Time (mean ± σ): 80.8 ms ± 2.2 ms [User: 75.4 ms, System: 19.5 ms] Range (min … max): 78.6 ms … 98.6 ms 300 runs Warning: Statistical outliers were detected. Consider re-running this benchmark on a quiet system without any interferences from other programs. It might help to use the '--warmup' or '--prepare' options. Summary target/profiling/puffin-dev resolve meine_stadt_transparent ran 1.03 ± 0.04 times faster than target/profiling/main-dev resolve meine_stadt_transparent ``` The effect on type sizes however is considerable ([downstack PR](https://gist.github.com/konstin/38e6c774db541db46d61f1d4ea6b498f) vs. [this PR](https://gist.github.com/konstin/003a77fe7d7d246b0d535e3fc843cb36)): ```patch --- branch.txt 2024-01-17 14:26:01.826085176 +0100 +++ boxed-prioritized-dist.txt 2024-01-17 14:25:57.101900963 +0100 @@ -1,19 +1,3 @@ -9264 alloc::collections::btree::node::InternalNode<pep440_rs::version::Version, distribution_types::PrioritizedDistribution> align=8 - 9168 data - 96 edges - -9264 alloc::collections::btree::node::InternalNode<pep440_rs::Version, distribution_types::PrioritizedDistribution> align=8 - 9168 data - 96 edges - -9168 alloc::collections::btree::node::LeafNode<pep440_rs::version::Version, distribution_types::PrioritizedDistribution> align=8 - 9064 vals - 88 keys - -9168 alloc::collections::btree::node::LeafNode<pep440_rs::Version, distribution_types::PrioritizedDistribution> align=8 - 9064 vals - 88 keys - 8992 tokio::sync::mpsc::block::Block<hyper::client::dispatch::Envelope<http::request::Request<reqwest::async_impl::body::ImplStream>, http::response::Response<hyper::body::body::Body>>> align=8 8960 values 32 header @@ -74,10 +58,23 @@ 40 __tracing_attr_span 64 variant Unresumed, Returned, Panicked +5648 {async fn body@crates/puffin-client/src/registry_client.rs:224:5: 224:30} align=8 + 5647 variant Suspend0 + 5576 __awaitee align=8 + 40 __tracing_attr_span ```	2024-01-19 10:44:41 +01:00
konsti	47fc90d1b3	Reduce stack usage by boxing `File` in `Dist`, `CachePolicy` and large futures (#1004 ) This is https://github.com/astral-sh/puffin/pull/947 again but this time merging into main instead of downstack, sorry for the noise. --- Windows has a default stack size of 1MB, which makes puffin often fail with stack overflows. The PR reduces stack size by three changes: * Boxing `File` in `Dist`, reducing the size from 496 to 240. * Boxing the largest futures. * Boxing `CachePolicy` ## Method Debugging happened on linux using https://github.com/astral-sh/puffin/pull/941 to limit the stack size to 1MB. Used ran the command below. ``` RUSTFLAGS=-Zprint-type-sizes cargo +nightly build -p puffin-cli -j 1 > type-sizes.txt && top-type-sizes -w -s -h 10 < type-sizes.txt > sizes.txt ``` The main drawback is top-type-sizes not saying what the `__awaitee` is, so it requires manually looking up with a future with matching size. When the `brotli` features on `reqwest` is active, a lot of brotli types show up. Toggling this feature however seems to have no effect. I assume they are false positives since the `brotli` crate has elaborate control about allocation. The sizes are therefore shown with the feature off. ## Results The largest future goes from 12208B to 6416B, the largest type (`PrioritizedDistribution`, see also #948) from 17448B to 9264B. Full diff: https://gist.github.com/konstin/62635c0d12110a616a1b2bfcde21304f For the second commit, i iteratively boxed the largest file until the tests passed, then with an 800KB stack limit looked through the backtrace of a failing test and added some more boxing. Quick benchmarking showed no difference: ```console $ hyperfine --warmup 2 "target/profiling/main-dev resolve meine_stadt_transparent" "target/profiling/puffin-dev resolve meine_stadt_transparent" Benchmark 1: target/profiling/main-dev resolve meine_stadt_transparent Time (mean ± σ): 49.2 ms ± 3.0 ms [User: 39.8 ms, System: 24.0 ms] Range (min … max): 46.6 ms … 63.0 ms 55 runs Warning: Statistical outliers were detected. Consider re-running this benchmark on a quiet system without any interferences from other programs. It might help to use the '--warmup' or '--prepare' options. Benchmark 2: target/profiling/puffin-dev resolve meine_stadt_transparent Time (mean ± σ): 47.4 ms ± 3.2 ms [User: 41.3 ms, System: 20.6 ms] Range (min … max): 44.6 ms … 60.5 ms 62 runs Warning: Statistical outliers were detected. Consider re-running this benchmark on a quiet system without any interferences from other programs. It might help to use the '--warmup' or '--prepare' options. Summary target/profiling/puffin-dev resolve meine_stadt_transparent ran 1.04 ± 0.09 times faster than target/profiling/main-dev resolve meine_stadt_transparent ```	2024-01-19 09:38:36 +00:00
konsti	66e651901e	Add an env var to artificially limit the stack size (#941 ) By default, windows has a stack size limit of 1MB which we run against in debug without any explicit culprit. A new environment variable `PUFFIN_STACK_SIZE` allows setting an artificially smaller stack size.	2024-01-19 09:34:46 +00:00
Charlie Marsh	69c72b6fa1	Validate wheel metadata against filename (#1002 ) Closes #983.	2024-01-19 05:48:55 +00:00
Charlie Marsh	f86d9b1c31	Add tests for missing file errors (#1001 )	2024-01-19 05:47:25 +00:00
Charlie Marsh	c8285cb5ef	Bump version to v0.0.3 (#999 )	2024-01-18 23:39:35 -05:00
Charlie Marsh	9b24fcd306	Remove verbatim URL from path file location (#998 ) ## Summary I got confused by why `VerbatimUrl` was on `Path`. Since it's directly computed from it, I think we should just compute it as-needed. I think it's also possibly-buggy because the URL is the URL of the _directory_, not the artifact itself, which differs from other distributions.	2024-01-18 22:40:48 -05:00
Charlie Marsh	732ef7adb7	Bump version to v0.0.2 (#987 ) Bumping the version so that I can test the release process again (including PyPI publish).	2024-01-18 20:56:09 -05:00
Charlie Marsh	fe180804b5	Avoid encoding current version in test output (#988 )	2024-01-19 01:50:23 +00:00
Charlie Marsh	3a1cd44fc6	Add Puffin Docker image (#985 ) Missing piece for the release. ## Test Plan Built the image locally: ```shell ❯ docker run 99956098e1f8f04e209dcfc4a0afcee67df1fe8a726c164884e67f035b1a0f42 Usage: puffin [OPTIONS] <COMMAND> Commands: pip Resolve and install Python packages venv Create a virtual environment clean Clear the cache help Print this message or the help of the given subcommand(s) Options: -q, --quiet Do not print any output -v, --verbose Use verbose output -n, --no-cache Avoid reading from or writing to the cache --cache-dir <CACHE_DIR> Path to the cache directory [env: PUFFIN_CACHE_DIR=] -h, --help Print help -V, --version Print version ```	2024-01-18 20:21:31 -05:00
Charlie Marsh	5e2b715366	Rename `puffin-cli` crate to `puffin` (#976 ) ## Summary Like in Ruff, this simplifies a few things.	2024-01-18 19:02:52 -05:00
Charlie Marsh	6cad0f609c	Mark `puffin-dev` as `publish = false` (#975 )	2024-01-18 17:20:44 -05:00
Charlie Marsh	8eadca4f8d	Remove unused path method (#974 )	2024-01-18 21:59:12 +00:00
Charlie Marsh	a262936366	Allow file:-relative paths in editable installs (#970 ) Supports editable install via (e.g.) `puffin pip install -e file:.`, which pip seems to support. Closes #964.	2024-01-18 21:15:42 +00:00
Charlie Marsh	f9154e8297	Add release workflow (#961 ) ## Summary This PR adds a release workflow powered by `cargo-dist`. It's similar to the version that's PR'd in Ruff (https://github.com/astral-sh/ruff/pull/9559), with the exception that it doesn't include the Docker build or the "update dependents" step for pre-commit.	2024-01-18 15:44:11 -05:00
Charlie Marsh	a883de4fb0	Enforce modification freshness checks against virtual environment (#959 ) ## Summary This PR is like #957, but for validating the virtual environment, rather than the cache. So, if you have a local wheel, and you rebuild it, we'll now correctly uninstall and reinstall it in the virtual environment.	2024-01-18 20:21:16 +00:00
Charlie Marsh	96a61fb351	Remove RFC2047 decoder (#967 ) ## Summary - This was inherited from `d719988323/src/metadata.rs (LL78C2-L91C26)` - ...which introduced this code here: `9cd1d43f7c` - ...with the originating issue here: https://github.com/PyO3/maturin/issues/612 - ...and the upstream issue here: https://github.com/staktrace/mailparse/issues/50 It seems like the goal was to support Unicode in certain header fields, but I don't think this is necessary for us. We only use `get_first_value` for `Requires-Python`, which has to be ASCII, doesn't it? In my testing, it seems like the `charset` hack can also be removed. The tests I copied over actually work without it, which makes me a bit skeptical. The main benefit here is that we get to a remove a _big_ dependency stack, including Chumsky and Stacker and psm which have limited cross-platform support.	2024-01-18 15:09:45 -05:00
Charlie Marsh	f17bad0a75	Mark path-based cache entries as stale during install plan (#957 ) ## Summary This is a small correctness improvement that ensures that we avoid using stale cache entries for local dependencies in the install plan. We already have some logic like this in the source distribution builder, but it didn't apply in the install plan, and so we'd end up using stale wheels. Specifically, now, if you create a new local wheel, and run `pip sync`, we'll mark the cache entries as stale and make sure we unzip it and install it. (If the wheel is _already_ installed, we won't reinstall it though, which will be a separate change. This is just about reading from the cache, not the environment.)	2024-01-18 19:13:29 +00:00
konsti	a11744e438	Normalize base python in venv creation (#966 ) Fixes #965 We have to canonicalize the interpreter path, otherwise the home is set to the venv dir instead of the real root. This would make python-build-standalone fail with the encodings module not being found because its home is wrong.	2024-01-18 15:32:30 +00:00
konsti	7acde5a9a0	Fix `pep508_rs` doc test (#963 ) Since nextest does not run doctests, this did not show up on CI.	2024-01-18 14:24:30 +00:00
konsti	5ec5a3243c	Set miette hook in all of puffin-cli (#962 ) Fixes #938	2024-01-18 08:37:26 -05:00
Charlie Marsh	8ae8ddc7d9	Fix 3-to-2 reference in pip sync test (#958 )	2024-01-18 04:33:46 +00:00
Charlie Marsh	fbe70f4218	Split install plan into builder and struct (#955 ) The `InstallPlan` does a lot of work in the constructor, which I tend to feel is an anti-pattern. With cache refresh, it's also going to need to be made `async`, so it really feels like it should be a clearer method rather than an async, fallible constructor that does a bunch of IO. This PR splits into a `Planner` (with a `build` method) and a `Plan`.	2024-01-17 15:28:46 -05:00
Charlie Marsh	055fd64eb1	Add an `--update-package` setting to allow individual package upgrades (#953 ) Closes #950.	2024-01-17 14:31:52 -05:00
Zanie Blue	a4204d00c1	Bump to latest packse version with "extras" scenarios (#935 ) Includes: - https://github.com/zanieb/packse/pull/83 (replaces some of the post-processing here) - https://github.com/zanieb/packse/pull/82 - https://github.com/zanieb/packse/pull/81	2024-01-17 13:25:48 -06:00
Charlie Marsh	a0420114c3	Avoid storing absolute URLs for files (#944 ) ## Summary It turns out that storing an absolute URL for every file caused a significant performance regression. This PR attempts to address the regression with two changes. The first is that we now store the raw string if the URL is an absolute URL. If the URL is relative, we store the base URL alongside the raw relative string. As such, we avoid serializing and deserializing URLs until we need them (later on), except for the base URL. The second is that we now use the internal `Url` crate methods for serializing and deserializing. If you look inside `Url`, its standard serializer and deserialization actually convert it to a string, then parse the string. But the crate exposes some other methods for faster serialization and deserialization (with fewer guarantees). I think this is totally fine since the cache is entirely internal. If we _just_ change the `Url` serialization (and no other code -- so continue to store URLs for every file), then the regression goes down to about 5%: ```shell ❯ python -m scripts.bench \ --puffin-path ./target/release/main \ --puffin-path ./target/release/relative --puffin-path ./target/release/puffin \ scripts/requirements/home-assistant.in --benchmark resolve-warm Benchmark 1: ./target/release/main (resolve-warm) Time (mean ± σ): 496.3 ms ± 4.3 ms [User: 452.4 ms, System: 175.5 ms] Range (min … max): 487.3 ms … 502.4 ms 10 runs Benchmark 2: ./target/release/relative (resolve-warm) Time (mean ± σ): 284.8 ms ± 2.1 ms [User: 245.8 ms, System: 165.6 ms] Range (min … max): 280.3 ms … 288.0 ms 10 runs Benchmark 3: ./target/release/puffin (resolve-warm) Time (mean ± σ): 300.4 ms ± 3.2 ms [User: 255.5 ms, System: 178.1 ms] Range (min … max): 295.4 ms … 305.1 ms 10 runs Summary './target/release/relative (resolve-warm)' ran 1.05 ± 0.01 times faster than './target/release/puffin (resolve-warm)' 1.74 ± 0.02 times faster than './target/release/main (resolve-warm)' ``` So I considered _just_ making that change. But 5% is kind of borderline... With both of these changes, the regression is down to 1-2%: ``` Benchmark 1: ./target/release/relative (resolve-warm) Time (mean ± σ): 282.6 ms ± 7.4 ms [User: 244.6 ms, System: 181.3 ms] Range (min … max): 275.1 ms … 318.5 ms 30 runs Benchmark 2: ./target/release/puffin (resolve-warm) Time (mean ± σ): 286.8 ms ± 2.2 ms [User: 247.0 ms, System: 169.1 ms] Range (min … max): 282.3 ms … 290.7 ms 30 runs Summary './target/release/relative (resolve-warm)' ran 1.01 ± 0.03 times faster than './target/release/puffin (resolve-warm)' ``` It's consistently ~2%-ish, but at this point it's unclear if that's due to the URL change or something other change between now and then. Closes #943.	2024-01-17 09:15:21 -05:00
Charlie Marsh	b8fbd529a1	Move `OnceMap` into its own crate (#946 ) ## Summary This is extremely generic (like `WaitMap`), and I want to use it in the cache.	2024-01-17 04:09:15 +00:00
konsti	5051b2c004	Use tempfile to prevent install io race crashes (#929 ) On ubuntu and python 3.10, ``` cargo run -q -- pip-install --find-links https://storage.googleapis.com/jax-releases/jax_cuda_releases.html "jax[cuda12_pip]==0.4.23" ``` non-deterministically but for me consistently fails to install with messages such as ``` error: Failed to install: nvidia_nccl_cu12-2.19.3-py3-none-manylinux1_x86_64.whl (nvidia-nccl-cu12==2.19.3) Caused by: failed to remove file `/home/konsti/projects/puffin/.venv/lib/python3.10/site-packages/nvidia/__init__.py` Caused by: No such file or directory (os error 2) ``` ``` error: Failed to install: nvidia_cublas_cu12-12.3.4.1-py3-none-manylinux1_x86_64.whl (nvidia-cublas-cu12==12.3.4.1) Caused by: Replacing an existing file or directory failed ``` ``` error: Failed to install: nvidia_cuda_nvcc_cu12-12.3.107-py3-none-manylinux1_x86_64.whl (nvidia-cuda-nvcc-cu12==12.3.107) Caused by: failed to hardlink file from /home/konsti/.cache/puffin/wheels-v0/pypi/nvidia-cuda-nvcc-cu12/nvidia_cuda_nvcc_cu12-12.3.107-py3-none-manylinux1_x86_64/nvidia/__init__.py to /home/konsti/projects/puffin/.venv/lib/python3.10/site-packages/nvidia/__init__.py Caused by: File exists (os error 17) ``` We install a lot of nvidia package, that all contain `nvidia/__init__.py`, since they all install themselves into the `nvidia` module: ``` nvidia-cublas-cu12==12.3.4.1 nvidia-cuda-cupti-cu12==12.3.101 nvidia-cuda-nvcc-cu12==12.3.107 nvidia-cuda-nvrtc-cu12==12.3.107 nvidia-cuda-runtime-cu12==12.3.101 nvidia-cudnn-cu12==8.9.7.29 nvidia-cufft-cu12==11.0.12.1 nvidia-cusolver-cu12==11.5.4.101 nvidia-cusparse-cu12==12.2.0.103 nvidia-nccl-cu12==2.19.3 nvidia-nvjitlink-cu12==12.3.101 ``` ``` $ tree -L 1 .venv/lib/python3.10/site-packages/nvidia .venv/lib/python3.10/site-packages/nvidia ├── cublas ├── cuda_cupti ├── cuda_nvcc ├── cuda_nvrtc ├── cuda_runtime ├── cudnn ├── cufft ├── cusolver ├── cusparse ├── __init__.py ├── nccl └── nvjitlink ``` When installing we get a race condition, each package installation is its own thread: * Installer Thread 1 creates `nvidia/__init__.py` * Installer Thread 2 sees an existing `nvidia/__init__.py` * Installer Thread 3 sees an existing `nvidia/__init__.py` * Installer Thread 2 removes `nvidia/__init__.py` * Installer Thread 3 tries to remove `nvidia/__init__.py`, it doesn't exist anymore -> failure. We switch to a new strategy: When the target files exists, we don't remove it, but instead hardlink the source file to a tempfile first, then renaming the tempfile to the target file. Renaming is considered an atomic operation. I've put the logging on debug level because they cases indicate a conflict between two packages while being rare. Closes #925 --------- Co-authored-by: Charlie Marsh <charlie.r.marsh@gmail.com>	2024-01-16 21:07:39 +00:00
Charlie Marsh	b50e5fcbc5	Fetch `--find-links` indexes in parallel (#934 ) ## Summary Removes a TODO. ## Test Plan Tested manually with: ```shell cargo run -p puffin-cli -- \ pip compile requirements.in -n \ --find-links 'https://download.pytorch.org/whl/torch_stable.html' \ --find-links 'https://storage.googleapis.com/jax-releases/jax_cuda_releases.html' \ --verbose ``` And inspecting the logs to ensure that the two requests were kicked off concrurently.	2024-01-16 11:37:35 +01:00
Charlie Marsh	2f8f126f2f	Share a single `Index` across resolutions (#906 ) ## Summary This PR uses a single `Index` that's shared between the top-level resolver and any sub-resolutions happen in the course of that top-level resolution (namely, to resolve build dependencies for any source distributions). In theory it's an optimization, since (e.g.) if we have two packages that both need the `flit-core` build system, and we attempt to build them both at once, we'll only fetch its metadata _once_, and share it across the two resolutions. In practice, I haven't been able to get this to show up in benchmarks. I suspect you'd need a _lot_ of source distributions for it to matter... Though it may still be worth doing, it strikes me as a cleaner design. Closes #200. Closes #541.	2024-01-16 05:37:15 +00:00
Charlie Marsh	0f592b67bb	Remove clone from `RegistryWheelIndex` (#937 ) Doesn't need to own the package names.	2024-01-15 16:18:12 -05:00
Charlie Marsh	2a69b273ce	Use a standalone error type for `--find-links` registry (#936 )	2024-01-15 19:48:48 +00:00
Charlie Marsh	e71e3e8dd1	Refresh `BuildDispatch` when running pip install with `--reinstall` (#933 ) ## Summary This fixes an extremely subtle bug in `pip install --reinstall`, whereby if you depend on `setuptools` at the top level, we end up uninstalling it after resolving, which breaks some cached state. If we have `--reinstall`, we need to reset that cached state between resolving and installing. ## Test Plan Running `pip install --reinstall` with: ```txt setuptools devpi @ `e334eb4dc9`bb023329e4b610e4515b/devpi-2.2.0.tar.gz ``` Fails on `main`, but passes.	2024-01-15 18:56:18 +00:00
Charlie Marsh	116da6b7de	Share in-flight map across resolutions (#932 ) ## Summary This PR fixes a subtle bug in `pip install` when using `--reinstall`. If a package depends on a build system directly (e.g., `waitress` depends on `setuptools`), and then you have other packages that also need the build system to build a source distribution, right now, we don't share the `OnceMap` between those cases. This lifts the `InFlight` tracking up a level, so that it's initialized once per command, then shared everywhere. ## Test Plan I'm having trouble coming up with an identical test-case and hesitant to add this slow test to the suite... But if you run `pip install --reinstall` with: ``` waitress @ git+https://github.com/zanieb/waitress devpi-server @ git+https://github.com/zanieb/devpi#subdirectory=server ``` It fails consistently on `main` and passes here.	2024-01-15 13:11:22 -05:00
Charlie Marsh	249ca10765	Move Puffin subcommands to a pip namespace (#921 ) ## Summary This makes the separation clearer between the legacy `pip` API and the API we'll add in the future for the package manager itself. It also enables seamless `puffin pip` aliasing for those that want it. Closes #918.	2024-01-15 16:36:45 +00:00
Charlie Marsh	e54fdea93f	Continue to respect `--find-links` with `--no-index` (#931 ) Like `pip`, we should allow `--find-links` with `--no-index`.	2024-01-15 16:19:27 +00:00
Charlie Marsh	42888a9609	Share flat index across resolutions (#930 ) ## Summary This PR restructures the flat index fetching in a few ways: 1. It now lives in its own `FlatIndexClient`, since it felt a bit awkward (in my opinion) for it to live in `RegistryClient`. 2. We now fetch the `FlatIndex` outside of the resolver. This has a few benefits: (1) the resolver construct is no longer `async` and no longer returns `Result`, which feels better for a resolver; and (2) we can share the `FlatIndex` across resolutions rather than re-fetching it for every source distribution build.	2024-01-15 11:02:02 -05:00
Charlie Marsh	e6d7124147	Add an extra struct around the package-to-flat index map (#923 ) ## Summary `FlatIndex` is now the thing that's keyed on `PackageName`, while `FlatDistributions` is what used to be called `FlatIndex` (a map from version to `PrioritizedDistribution`, for a single package). I find this a bit clearer, since we can also remove the `from_files` that doesn't return `Self`, which I had trouble following.	2024-01-15 14:48:10 +00:00
Charlie Marsh	9a3f3d385c	Remove `PubGrubVersion` (#924 ) ## Summary I'm running into some annoyances converting `&Version` to `&PubGrubVersion` (which is just a wrapper type around `Version`), and I realized... We don't even need `PubGrubVersion`? The reason we "need" it today is due to the orphan trait rule: `Version` is defined in `pep440_rs`, but we want to `impl pubgrub::version::Version for Version` in the resolver crate. Instead of introducing a new type here, which leads to a lot of awkwardness around conversion and API isolation, what if we instead just implement `pubgrub::version::Version` in `pep440_rs` via a feature? That way, we can just use `Version` everywhere without any confusion and conversion for the wrapper type.	2024-01-15 08:51:12 -05:00
konsti	8860a9c29e	Add flat index urls to registry wheel index (#928 ) Previously, we were missing flat index wheels in the cache.	2024-01-15 10:21:59 +00:00
konsti	95f3cca28d	Use fs_err in more places (#926 ) Before: ``` error: Failed to download distributions Caused by: Failed to fetch wheel: jaxlib==0.4.23+cuda12.cudnn89 Caused by: Directory not empty (os error 39) ``` After: ``` error: Failed to download distributions Caused by: Failed to fetch wheel: jaxlib==0.4.23+cuda12.cudnn89 Caused by: failed to rename file from /home/konsti/.cache/puffin/.tmpcG7tVP/jaxlib-0.4.23+cuda12.cudnn89-cp310-cp310-manylinux2014_x86_64.whl to /home/konsti/.cache/puffin/wheels-v0/index/9ff50b883297fa9d/jaxlib/jaxlib-0.4.23+cuda12.cudnn89-cp310-cp310-manylinux2014_x86_64 Caused by: Directory not empty (os error 39) ```	2024-01-15 09:39:33 +00:00
konsti	82ff136a74	Add find links supports to pip-sync (#914 ) Closes #877	2024-01-15 03:04:55 +00:00
konsti	f63776b894	Support HTML indexes in `--find-links` (#913 ) The simple html format parser luckily seems to work for find links too, at least it can parse https://storage.googleapis.com/jax-releases/jax_cuda_releases.html.	2024-01-15 02:54:34 +00:00
konsti	e9b6b6fa36	Implement `--find-links` as flat indexes (directories in pip-compile) (#912 ) Add directory `--find-links` support for local paths to pip-compile. It seems that pip joins all sources and then picks the best package. We explicitly give find links packages precedence if the same exists on an index and locally by prefilling the `VersionMap`, otherwise they are added as another index and the existing rules of precedence apply. Internally, the feature is called _flat index_, which is more meaningful than _find links_: We're not looking for links, we're picking up local directories, and (TBD) support another index format that's just a flat list of files instead of a nested index. `RegistryBuiltDist` and `RegistrySourceDist` now use `WheelFilename` and `SourceDistFilename` respectively. The `File` inside `RegistryBuiltDist` and `RegistrySourceDist` gained the ability to represent both a url and a path so that `--find-links` with a url and with a path works the same, both being locked as `<package_name>@<version>` instead of `<package_name> @ <url>`. (This is more of a detail, this PR in general still work if we strip that and have directory find links represented as `<package_name> @ file:///path/to/file.ext`) `PrioritizedDistribution` and `FlatIndex` have been moved to locations where we can use them in the upstack PR. I added a `scripts/wheels` directory with stripped down wheels to use for testing. We're lacking tests for correct tag priority precedence with flat indexes, i only confirmed this manually since it is not covered in the pip-compile or pip-sync output. Closes #876	2024-01-15 02:04:10 +00:00
konsti	5ffbfadf66	Make hashes optional (#910 ) There is no guarantee that indexes provide hashes at all or the sha256 we support specifically. [PEP 503](https://peps.python.org/pep-0503/#specification): > The URL SHOULD include a hash in the form of a URL fragment with the following syntax: #<hashname>=<hashvalue>, where <hashname> is the lowercase name of the hash function (such as sha256) and <hashvalue> is the hex encoded digest. We instead use the url as input to generate a hash when caching.	2024-01-14 16:32:55 -05:00
Zanie Blue	9ad19b7e54	Bump to the latest packse version (#916 )	2024-01-14 12:49:23 -06:00
konsti	a53bdeba4c	Remove `base` from `RegistryBuiltDist` and `RegistrySourceDist` (#919 ) Follow-up to https://github.com/astral-sh/puffin/pull/917 i found rebasing the find-links PRs, this field became unused through the absolute URLs.	2024-01-14 17:46:16 +00:00
Charlie Marsh	0374000ec0	Normalize extras when evaluating PEP 508 markers (#915 ) ## Summary We always normalize extra names in our requirements (e.g., `cuda12_pip` to `cuda12-pip`), but we weren't normalizing within PEP 508 markers, which meant we ended up comparing `cuda12-pip` (normalized) against `cuda12_pip` (unnormalized). Closes https://github.com/astral-sh/puffin/issues/911.	2024-01-14 17:16:54 +00:00
konsti	a99e5e00f2	Use absolute urls in `distribution_type::File` (#917 ) Previously, the url on file could either be a relative or an absolute url, depending on the index, and we would finalize it lazily. Now we finalize the url when converting `pypi_types::File` to `distribution_types::File`. This change is required to make the hashes on `File` optional (https://github.com/astral-sh/puffin/pull/910), which are currently the only unique field usable for caching.	2024-01-14 17:15:24 +00:00
Charlie Marsh	6e18e56789	Adjust markers to match target Python version (#909 ) ## Summary This PR ensures that when the user passes in `--python-version`, we adjust the _markers_ to match the target version, thus forcing us to select compatible wheels for the `--python-version`, rather than the installed version. ## Context Let's call Python 3.10 the "installed" environment and Python 3.12 the "target" environment. For each version, we have _both_ a Python version (to match against `Requires-Python`) and a set of tags (to match against wheels). The rules for resolution are as follows... - For each package, for each version, we try to find the "best candidate" for resolution and installation. - We first look for a wheel that's compatible with the _target_ environment. This requires testing against both the `Requires-Python` and the markers. (We won't have to build or run this code, so the _installed_ version is irrelevant.) (This PR corrects _this_ bullet -- previously, we validated against the _installed_ markers, rather than the target markers.) - If we can't find a compatible wheel, we accept any _incompatible_ wheel as long as there's a source distribution. The source distribution _must_ be compatible with the target environment. (We won't have to build or run this code, so the _installed_ version is irrelevant.) - If there are no wheels, then the source distribution must be compatible with _both_ the installed and target environments, since we need to build it. This is all true for the top-level resolution. When we perform a sub-resolution (when resolving the build dependencies of a source distribution), we should _only_ use the installed environment, and ignore the target environment, since we assume that the dependencies will be the same in both environments once built -- so our goal is "just" to build the distribution, without concern for which build dependencies it uses. Closes https://github.com/astral-sh/puffin/issues/883.	2024-01-14 15:39:15 +00:00
Charlie Marsh	8187c05d8a	Use `DashMap` for redirects (#908 ) ## Summary We don't need to wait on these, so it's simpler to use a standard concurrent hash map.	2024-01-13 20:36:02 +00:00
Charlie Marsh	f527f2add9	Remove erroneous local `Index` in resolver (#907 )	2024-01-13 15:19:00 -05:00
Charlie Marsh	231686e71b	Remove `incompatibilities` from index (#905 ) This isn't really part of the "index", it's part of the resolution.	2024-01-13 02:57:15 +00:00
Charlie Marsh	477186dcb3	Remove `ResolutionGraph#requirements` (#903 )	2024-01-12 20:09:19 +00:00
Charlie Marsh	d3f65c317d	Avoid some additional clones for `PackageName` (#896 )	2024-01-12 17:54:40 +00:00
konsti	aee6aed684	Make install_editable test faster (#901 ) Remove a test case from the `install_editable` that slows it down from 3.6s to 6.5s while providing low test coverage. It also seems to block other tests sometimes, `cargo nextest run -E "test(editable)" --all-features` has more consistent and lower runtimes. Surprisingly this seems to have bigger effect than switching from pyo3 to cffi. Used test commands: ``` rm -rf scripts/editable-installs/maturin_editable/target/ && time cargo nextest run -E "test(=install_editable)" --all-features rm -rf scripts/editable-installs/maturin_editable/target/ && time cargo nextest run -E "test(editable)" --all-features ``` Part of #878	2024-01-12 18:50:27 +01:00
konsti	878bc4bf8d	Stub out DTLSsocket test (#900 ) Replace the DTLSsocket test with a dummy package that does nothing but contain the build system specs that we need. This should speed up one of the slowest tests. Part of #878	2024-01-12 18:50:16 +01:00
Charlie Marsh	06039e1293	Add hashes to `pip-compile` output (#894 ) ## Summary Adds hashes to `pip-compile` output, though we don't actually check those hashes in `pip-sync` yet. Closes https://github.com/astral-sh/puffin/issues/131.	2024-01-12 12:44:19 -05:00
konsti	0cc98c771e	Fix a tracing panic (#899 )	2024-01-12 14:47:58 +00:00
Charlie Marsh	11b11d04a7	Ignore installed version when determining wheel compatibility (#890 )	2024-01-12 08:57:00 -05:00
Charlie Marsh	5fd2c380a7	Add `into_cached_dist` to `LocalWheel` (#893 ) Simplifies `unzip_wheel` a bit and avoids unnecessarily cloning in the common case.	2024-01-12 09:01:30 +00:00
Charlie Marsh	35c1faa575	Move in-flight tracking to the download level (#892 ) ## Summary Now that `get_or_build_wheel` will often _also_ handle the unzip step, we need to move our per-target locking (`OnceMap`) up a level. Previously, it was only applied to the unzip step, to prevent us from attempting to unzip into the same target concurrently; now, it's applied at the `get_wheel` level, which includes both downloading and unzipping. ## Test Plan It seems like none of our existing tests catch this -- perhaps because they're too "simple"? You need to run into a situation in which you're doing multiple source distribution builds concurrently (since they'll all try to download `setuptools`): ``` rm -rf foo && virtualenv --clear .venv && cargo run -p puffin-cli -- pip-compile ./scripts/requirements/pydantic.in --verbose --cache-dir foo ```	2024-01-12 09:52:22 +01:00
Charlie Marsh	60cea0f07d	Use consistent parse terminology in pyproject error (#891 ) We use `parse` for the other file types.	2024-01-11 21:25:47 -05:00
bojanserafimov	4c047f858f	Remove InMemoryWheel and dead code (#879 )	2024-01-11 10:11:07 -05:00
bojanserafimov	10227a74f8	Unzip while downloading (#856 )	2024-01-11 09:41:46 -05:00
konsti	0dfbddd275	Shorten resolve many dev output (#885 )	2024-01-11 13:53:13 +00:00
konsti	8c2b7d55af	Cleanup deps and docs (#882 ) Fix warnings from `cargo +nightly udeps` and `cargo doc`. Removes all mentions of regex from pep440_rs.	2024-01-11 10:43:40 +00:00
Zanie Blue	d6fa628e11	Fix failing test (#880 )	2024-01-11 00:41:37 +00:00
Zanie Blue	811332eacc	Improve handling of "full" version ranges (#868 ) Reduces the number of implementation branches handling `Range:full`, deferring it to `PackageRange`. Improves some user-facing messages, e.g. saying `all versions of <package>` instead of `<package>*`. Changes the member names of the `PackageRangeKind` enum — they were not very clear.	2024-01-10 21:03:55 +00:00
Zanie Blue	a65c55ff4a	Say "cannot be used" and "must be used" instead of "forbidden" and "mandatory" (#867 ) Closes #858	2024-01-10 20:49:40 +00:00
Zanie Blue	845ba6801d	Improve formatting of incompatible terms when there are two items (#866 )	2024-01-10 20:36:54 +00:00
Zanie Blue	93d3093a2a	Improve formatting of package ranges in error messages (#864 ) Closes #810 Closes https://github.com/astral-sh/puffin/issues/812 Requires https://github.com/zanieb/pubgrub/pull/19 and https://github.com/zanieb/pubgrub/pull/18 - Always pair package ranges with names e.g. `... of a matching a<1.0` instead of `... of a matching <1.0` - Split range segments onto multiple lines when not a singleton as suggested in [#850](https://github.com/astral-sh/puffin/pull/850#discussion_r1446419610) - Improve formatting when ranges are split across multiple lines e.g. by avoiding extra spaces and improving wording Note review will require expanding the hidden files as there are significant changes to the report formatter and snapshots. Bear with me here as these are definitely not perfect still. The following changes build on top of this independently for further improvements: - #868 - #867 - #866 - #871	2024-01-10 14:16:23 -06:00
konsti	4d8bfd7f61	Split source dist error type into error and kind (#872 ) It's a better, less redundant error type. It will come in handy when adding a second parse function.	2024-01-10 17:42:54 +00:00
Charlie Marsh	fbb57b24dd	Add `--seed` flag to `venv` to allow seed package environments (#865 ) ## Summary Installs the seed packages you get with `virtualenv`, but opt-in rather than opt-out. Closes https://github.com/astral-sh/puffin/issues/852. ## Test Plan ``` ❯ ./scripts/benchmarks/venv.sh + hyperfine --runs 20 --warmup 3 --prepare 'rm -rf .venv' './target/release/puffin venv' --prepare 'rm -rf .venv' 'virtualenv --without-pip .venv' --prepare 'rm -rf .venv' 'python -m venv --without-pip .venv' Benchmark 1: ./target/release/puffin venv Time (mean ± σ): 4.6 ms ± 0.2 ms [User: 2.4 ms, System: 3.6 ms] Range (min … max): 4.3 ms … 4.9 ms 20 runs Warning: Command took less than 5 ms to complete. Note that the results might be inaccurate because hyperfine can not calibrate the shell startup time much more precise than this limit. You can try to use the `-N`/`--shell=none` option to disable the shell completely. Benchmark 2: virtualenv --without-pip .venv Time (mean ± σ): 73.3 ms ± 0.3 ms [User: 57.4 ms, System: 14.2 ms] Range (min … max): 72.8 ms … 74.0 ms 20 runs Benchmark 3: python -m venv --without-pip .venv Time (mean ± σ): 22.5 ms ± 0.3 ms [User: 17.0 ms, System: 4.9 ms] Range (min … max): 22.0 ms … 23.2 ms 20 runs Summary './target/release/puffin venv' ran 4.92 ± 0.20 times faster than 'python -m venv --without-pip .venv' 16.00 ± 0.63 times faster than 'virtualenv --without-pip .venv' + hyperfine --runs 20 --warmup 3 --prepare 'rm -rf .venv' './target/release/puffin venv --seed' --prepare 'rm -rf .venv' 'virtualenv .venv' --prepare 'rm -rf .venv' 'python -m venv .venv' Benchmark 1: ./target/release/puffin venv --seed Time (mean ± σ): 20.2 ms ± 0.4 ms [User: 8.6 ms, System: 15.7 ms] Range (min … max): 19.7 ms … 21.2 ms 20 runs Benchmark 2: virtualenv .venv Time (mean ± σ): 135.1 ms ± 2.4 ms [User: 66.7 ms, System: 65.7 ms] Range (min … max): 133.2 ms … 142.8 ms 20 runs Benchmark 3: python -m venv .venv Time (mean ± σ): 1.656 s ± 0.014 s [User: 1.447 s, System: 0.186 s] Range (min … max): 1.641 s … 1.697 s 20 runs Summary './target/release/puffin venv --seed' ran 6.67 ± 0.17 times faster than 'virtualenv .venv' 81.79 ± 1.70 times faster than 'python -m venv .venv' ```	2024-01-09 20:45:56 -05:00
Charlie Marsh	55f2be72e2	Default to PEP 517-based builds (#843 ) ## Summary Our current setup uses the legacy `setup.py`-based builds if a `pyproject.toml` file isn't present. This matches pip's behavior. However, `pypa/build` uses PEP 517-based builds in such cases, and it looks like pip plans to make that the default (https://github.com/pypa/pip/issues/9175), with the limiting factor being performance issues related to isolated builds. This is now the default behavior, but the `--legacy-setup-py` flag allows users to opt-in to using `setup.py` directly for distributions that lack a `pyproject.toml`.	2024-01-10 01:27:06 +00:00
Charlie Marsh	e26dc8e33d	Add support for `prepare_metadata_for_build_wheel` (#842 ) ## Summary This PR adds support for `prepare_metadata_for_build_wheel`, which allows us to determine source distribution metadata without building the source distribution. This represents an optimization for the resolver, as we can skip the expensive build phase for build backends that support it. For reference, `prepare_metadata_for_build_wheel` seems to be supported by: - `hatchling` (as of [1.0.9](https://hatch.pypa.io/latest/history/hatchling/#hatchling-v1.9.0)). - `flit` - `setuptools` In fact, it seems to work for every backend _except_ those using legacy `setup.py`. Closes #599.	2024-01-10 00:07:37 +00:00
konsti	858d5584cc	Use `Dist` in `VersionMap` (#851 ) Refactoring split out from find links support: Find links files can be represented as `Dist`, but not really as `File`, they don't have url nor hashes. `DistRequiresPython` is somewhat odd as an in between type.	2024-01-10 00:14:42 +01:00
konsti	1203f8f9e8	Gourgeist updates (#862 ) * Use caching again * Make clap feature only required for the cli/bin optional	2024-01-09 23:04:15 +00:00
Zanie Blue	34d548de21	Improve error messages when there are no versions of a singleton range (#855 )	2024-01-09 15:09:52 -06:00
Charlie Marsh	33982efb25	Remove a TOCTOU read in build (#860 ) We should just read and handle the not-found case, rather than checking if the file doesn't exist first.	2024-01-09 20:33:08 +00:00
Charlie Marsh	31139aa88d	Add derive feature to `gourgeist` (#854 ) Needed to build `gourgeist` directly, probably dropped during a refactor.	2024-01-09 17:46:16 +00:00
konsti	ee6d809b60	Remove unused `Result` (#849 ) Remove some dead code, seems to be a refactoring oversight	2024-01-09 16:35:10 +00:00
konsti	643e5e4a49	Use pdm for black editable as PEP 621 test case (#848 ) This gives us a PEP 621 test package in tree and increases the diversity for the editable tests a bit.	2024-01-09 16:33:05 +00:00
konsti	5b0b072e3c	Allow files >4GB on 32-bit platforms (#847 ) Changes `File::size` from a `usize` to a `u64`. The motivations are that with tensorflow wheels being 475 MB (https://pypi.org/project/tensorflow/2.15.0.post1/#files), we're already only one order of magnitude away and to avoid target dependent failures.	2024-01-09 17:31:49 +01:00
Charlie Marsh	ee3a6431c7	Show available pre-releases in error hints (#844 ) ## Summary If pre-releases are available for a package that we otherwise couldn't resolve, we now show a hint that includes one of the example versions. Closes https://github.com/astral-sh/puffin/issues/811.	2024-01-09 09:58:38 -05:00
konsti	b1edecdf1f	Filter out files with invalid requires python specifiers (#775 ) Instead of trying to fixup _all_ the invalid version specifiers on pypi and elsewhere, this filters out distributions with invalid `requires-python` version specifiers that even `LenientVersionSpecifiers` couldn't parse, as opposed to failing entirely, which we currently do. I would be nicer to model through an invalid distribution pubgrub type, together with e.g. source dists with an unknown extension, so that the version itself still shows up in the error trace. At the same time, we reduce the log level for fixups from warning to trace, as they are not actionable for the user.	2024-01-09 02:46:27 +00:00
Zanie Blue	64da1f0306	Always pair package names with ranges in error messages (#838 ) Adjusts display of "no versions available" in error messages to be consistent with other package/range pairings i.e. we usually display "<package-name><range>".	2024-01-08 22:11:10 +00:00
Charlie Marsh	19c6d655b5	Avoid duplicated source distribution handling in url (#841 ) ## Summary Right now, both the callback _and_ the "We have no compatible wheel" paths have a lot of repeated code. This PR changes the callback to _just_ remove all the wheels and handle the download, and the rest of the method following the callback is responsible for finding and building any wheels.	2024-01-08 16:19:54 -05:00
Charlie Marsh	cc9140643e	Rename `metadata` to `built_wheel` in `source/mod.rs` (#840 )	2024-01-08 19:20:20 +00:00
Charlie Marsh	df254087d9	Break `source_dist.rs` into a module (#839 ) ## Summary Finding this file hard to edit and work in since it's gotten quite large.	2024-01-08 19:14:45 +00:00
Zanie Blue	2b0c2e294b	Fix formatting of negated singleton versions in error messages (#836 ) Closes #805 Requires https://github.com/zanieb/pubgrub/pull/17	2024-01-08 12:33:01 -06:00
Charlie Marsh	aeefe65227	Fix `tracing-duration-export` compilation (#835 ) ## Summary I'm unable to run `puffin-cli` on `main` as the `tracing-durations-export` is marked as optional, but the crate actually depends on it to compile. Further, without `tracing-durations-export`, there are `Option` types that can't resolve to a concrete type. This PR fixes compilation with and without the feature.	2024-01-08 18:04:23 +00:00
Charlie Marsh	c06bf658bb	Remove some filesystem calls from the installer (#834 ) Noticed these when working on something unrelated. Generally: - Prefer `entry.file_type()` over `entry.path().is_file()` or similar, as the former is almost always free on Unix. - Call `entry.path()` once, since it allocates internally (returns a `PathBuf`).	2024-01-08 12:59:01 -05:00
konsti	004147d441	Add tracing_durations_export feature to puffin-cli (#830 ) The optional `tracing-durations-export` feature allows creating parallelism plots from all puffin-cli commands without affecting production builds. Usage: ``` virtualenv --clear -p 3.10 .venv310 && TRACING_DURATIONS_FILE=target/traces/jupyter-no-cache.ndjson RUST_LOG=puffin=info VIRTUAL_ENV=.venv310 cargo run --bin puffin --profile profiling --features tracing-durations-export -- pip-install -v --no-cache jupyter virtualenv --clear -p 3.10 .venv310 && TRACING_DURATIONS_FILE=target/traces/jupyter.ndjson RUST_LOG=puffin=info VIRTUAL_ENV=.venv310 cargo run --bin puffin --profile profiling --features tracing-durations-export -- pip-install -v jupyter ``` Output, plotted in collapsed mode for readability: Cached jupyter: ![jupyter](https://github.com/astral-sh/puffin/assets/6826232/f7e03c68-0438-4cf4-bceb-9a4a146cc506) Uncached jupyter: ![image](https://github.com/astral-sh/puffin/assets/6826232/cfdd3383-7a9d-43d6-b8d0-201f64611596)	2024-01-08 16:20:45 +01:00
konsti	b6338b5e4a	Use tracing-durations-export to visualize parallelism bottlenecks (dev commands) (#816 ) Example usage: ``` # Cached TRACING_DURATIONS_FILE=target/traces/black.ndjson RUST_LOG=puffin=info cargo run --bin puffin-dev --profile profiling -- resolve black TRACING_DURATIONS_FILE=target/traces/meine_stadt_transparent.ndjson RUST_LOG=puffin=info cargo run --bin puffin-dev --profile profiling -- resolve meine_stadt_transparent TRACING_DURATIONS_FILE=target/traces/jupyter.ndjson RUST_LOG=puffin=info cargo run --bin puffin-dev --profile profiling -- resolve jupyter # No cache TRACING_DURATIONS_FILE=target/traces/black-no-cache.ndjson RUST_LOG=puffin=info cargo run --bin puffin-dev --profile profiling -- resolve --no-cache black TRACING_DURATIONS_FILE=target/traces/meine_stadt_transparent-no-cache.ndjson RUST_LOG=puffin=info cargo run --bin puffin-dev --profile profiling -- resolve --no-cache meine_stadt_transparent TRACING_DURATIONS_FILE=target/traces/jupyter-no-cache.ndjson RUST_LOG=puffin=info cargo run --bin puffin-dev --profile profiling -- resolve --no-cache jupyter ``` Uncached black output example: ![black-no-cache](https://github.com/astral-sh/puffin/assets/6826232/38497b89-7214-453b-9456-c9d9cbf7d2d5)	2024-01-08 16:20:38 +01:00
konsti	243392f718	`cargo run` run `puffin` by default (#831 ) `cargo run` now runs `puffin` by default. `cargo run --bin puffin-dev` remains working.	2024-01-08 12:49:06 +00:00
konsti	3f587156ec	Improve install instrumentation (#829 ) Add tracing spans to different phases of the wheel installation.	2024-01-08 10:13:59 +00:00
konsti	60ba7dd14f	Use `std::io::read_to_string` (#826 ) The `std::io::read_to_string` shorthand was stabilized in 1.65.	2024-01-08 09:15:38 +00:00

1 2 3 4 5 ...

673 Commits