-
-
Notifications
You must be signed in to change notification settings - Fork 14.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add yarn-berry3 and yarn-berry4 to fetch and use version 2 yarn.lock files #355053
base: master
Are you sure you want to change the base?
Conversation
975cc57
to
e3d06a8
Compare
Note also the CI failure. |
This is caused by not adding the
|
Hmm seems like a false negative to me. Let's ask the advice of @NixOS/nixpkgs-vet . |
e3d06a8
to
5f0eecf
Compare
Think this recommendation applies here: https://github.com/NixOS/nixpkgs/tree/master/pkgs/by-name#recommendation-for-new-packages-with-multiple-versions |
5f0eecf
to
e2615ea
Compare
Awesome, thanks for the pointer! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks pretty good, though I wouldn't feel comfortable to merge this because I don't know the yarn config set
tings in yarnBerryConfigHook
but they looks pretty reasonable.
76670ba
to
51531b2
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is a no-go in its current state.
mkdir -p $out | ||
'' | ||
+ lib.optionalString (yarnVersion > 1) '' | ||
yarn install --immutable --mode skip-build |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This will break the FODs almost instantly if Yarn changes anything in how they pull deps, and also is probably not reproducible between platforms when platform-specific dependencies are used.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Valid point. This is partly why there are two yarn versions.
Also this could be said for all external package managers, not just yarn-berry.
I'm uncertain about your point about platform specific dependencies. I'll have to look into it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Platform specific dependencies should be made reproducible by explicitly setting supportedArchitectures
. Your point about Yarn breaking how they fetch deps may be valid depending on the upstream stability guarantees. Explicitly picking the version of yarn somewhat mitigates that issue, but it's no guarantee yes. This is the same path taken in pnpm.fetchDeps
#290715
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@winterqt what is your reply on the arguments laid out above? Your "request for changes" is blocking the merge of the PR, and this is the only review comment left really open.
86534dd
to
6d5d7f6
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In general, I am very against adding these kinds of fetchers. (I was too burnt out to comment much on the pnpm situation, and I wish I had the spoons to object more there.)
Upstream makes no guarantees as to the stability of this, and as we've seen with Rust, this is a ticking time bomb.
As an alternative, we could try to e.g. create our own PnP-based fetcher or something? It takes a lot of time to do things properly as various edge cases are discovered, though, and time isn't something I have a lot of at the moment.
If this is to be merged in its current state, please address the issues I've commented on in this review. However, I would strongly recommend against doing so.
@@ -0,0 +1,16 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should add a comment somewhere so that people avoid touching this file/are very careful when doing it. Ideally we'd be able to use JSON with comments...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This file means that we can never expand our platform support without mass FOD breaks across the tree? That seems very bad. There are already platforms you can boot NixOS on that would be ruled out by this file. At the very least we should try to list every value already used in npm here so that it’s relatively safe to expand in future?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since JSON and comments don't agree (and we also read the file directly), I added a Readme.md
to the folder
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This file means that we can never expand our platform support without mass FOD breaks across the tree?
yes
That seems very bad. There are already platforms you can boot NixOS on that would be ruled out by this file. At the very least we should try to list every value already used in npm here so that it’s relatively safe to expand in future?
We did add all the relevant platforms, cpu's and libc's (see https://yarnpkg.com/configuration/yarnrc#supportedArchitectures) that are available. The only one missing is win32
and I don't think we need to add this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If it’s everything Yarn supports then I guess it’s not so bad – assuming we will be able to add additional platforms if Yarn does support more in future without invalidating every FOD?
I do think we should include Windows. You can use Nixpkgs to cross‐build a lot of stuff for Windows today, and people are working on native bring‐up. What’s the harm?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done.
On a sidenode: I did neither have to change the test hashes, nor the hash for the (seriously complex and long) yarn pgadmin offlineCache. (and yes, I obviously invalidated them, re-downloaded the packages and compared the hashes)
So maybe this won't be such a big deal after all
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I thought the idea is that packages can’t include any platforms other than the ones Yarn supports, so no existing FOD should change as long as we keep up with Yarn?
Yes, I think your right. My guess is that adding another platform or compiler will a huge undertaking anyway, so maybe they will publish yet another big version upgrade, which would make it easier for us (e.g. yarn-berry_5
)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unfortunately, I think I was correct here:
Or can packages on npm reference other platforms and Yarn just can’t support fetching them?
The npm docs imply that the fields are quite freeform; e.g. the cpu
example uses mips
as an example. That means that this is just a limitation of Yarn’s configuration format rather than a constraint on the actual packages in existence, and there are packages that could change in the future if we add new platforms like RISC‐V.
The correct solution here would be to fetch everything independent of platform – theoretically Yarn could add all
values here to opt in to fetching everything, but writing our own deterministic fetcher would make it easy and also obviate the other reproducibility concerns. If neither of those is an option, perhaps we can find a way to check that nothing in the dependency tree references any platform we haven’t included in this file, and otherwise bail out? If we can’t do even that, then it’d be good if someone could write a script to scan the npm repository and collect every value that is used in practice so that we can include them all from the start and reduce the scope of potential changes.
Because we’re adding fundamental packaging infrastructure here that is part of our public interface and will surely be relied upon by out‐of‐tree packages, and changing FOD hashes are very difficult to spot and debug, I don’t think we’ll have the option of just breaking it in the future, unless we can show that the backwards incompatibility would be entirely theoretical in the existing public corpus.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Unfortunately, I think I was correct here:
Or can packages on npm reference other platforms and Yarn just can’t support fetching them?
The npm docs imply that the fields are quite freeform; e.g. the
cpu
example usesmips
as an example. That means that this is just a limitation of Yarn’s configuration format rather than a constraint on the actual packages in existence, and there are packages that could change in the future if we add new platforms like RISC‐V.
Actually, it is not a limitation of the configuration, as it can be adapted to include everything node supports. https://yarnpkg.com/configuration/yarnrc#supportedArchitectures says:
List of CPU architectures to cover.
See https://nodejs.org/docs/latest/api/process.html#processarch for the architectures supported by Node.js
which states:
The operating system CPU architecture for which the Node.js binary was compiled. Possible values are: 'arm', 'arm64', 'ia32', 'loong64', 'mips', 'mipsel', 'ppc', 'ppc64', 'riscv64', 's390', 's390x', and 'x64'.
So we can include all of those for our supported-archs
. Adding this, btw, didn't change the cache for pgadmin, either.
I looked into how yarn-berry
does its comparisons (since these values are for the local .yarnrc.toml
config file) and it compares it to upstreams package.json
file. This file has the same fields (https://yarnpkg.com/configuration/manifest#cpu) and, more importantly, only the libc
and os
fields we already have.
The correct solution here would be to fetch everything independent of platform – theoretically Yarn could add
all
values here to opt in to fetching everything, but writing our own deterministic fetcher would make it easy and also obviate the other reproducibility concerns.
Yes, writing our own fetcher could be a solution. This will be quite a task to parse the yarn.lock
file and generate the cache. I really feel uncomftable reinventing the wheel, though. We do have a package manager, which already parses this file and also controlls the specs about our architectures. I don't want to write yet another solution of yarn2nix
If neither of those is an option, perhaps we can find a way to check that nothing in the dependency tree references any platform we haven’t included in this file, and otherwise bail out?
There seems to be an option (yarn info --all --manifest --json
) which prints all the manifests, but not one had a cpu
, libc
or os
field. There is another option (yarn npm info _packageName_ --json
) which prints the upstream manifest and from the quick peek I took, didn't include any of those, either.
If we can’t do even that, then it’d be good if someone could write a script to scan the npm repository and collect every value that is used in practice so that we can include them all from the start and reduce the scope of potential changes.
Because we’re adding fundamental packaging infrastructure here that is part of our public interface and will surely be relied upon by out‐of‐tree packages, and changing FOD hashes are very difficult to spot and debug, I don’t think we’ll have the option of just breaking it in the future, unless we can show that the backwards incompatibility would be entirely theoretical in the existing public corpus.
Totally agree. This is why we added versions here. Not only because upstream has two supported versions (3 and 4), but also because their yarn.lock
files differ.
If there is a situation where either the file format changes, or the supported architectures, I would suggest adding another version (e.g. yarn-berry_5
or yarn-berry_39
). This will keep all the hashes intakt, be reproducible and still be able to adapt to future changes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The npm docs imply that the fields are quite freeform; e.g. the
cpu
example usesmips
as an example. That means that this is just a limitation of Yarn’s configuration format rather than a constraint on the actual packages in existence, and there are packages that could change in the future if we add new platforms like RISC‐V.
The documentation says these values are based on Node process.{arch,cpu}
which has a known documented set of values - namely platforms supported by nodejs. I wouldn't exactly call it freeform.
I don't really have a better idea on how to handle adding new platforms other than invalidating the FOD (by changing name
) and fixing up all the failed builds, or introducing a new builder version like @gador suggested above.
I wonder how pnpm
managed to avoid this issue. I don't see it handling supportedArchitectures
at all.
$TMP should be univsal for all derivations and should work on darwin as well, if it ever switches to another build-dir (e.g. /nix/) see NixOS#355053 (comment) Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
a94f6c4
to
d0dc8fa
Compare
$TMP should be univsal for all derivations and should work on darwin as well, if it ever switches to another build-dir (e.g. /nix/) see NixOS#355053 (comment) Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
9312a29
to
7a126ea
Compare
7a126ea
to
cf9593e
Compare
$TMP should be univsal for all derivations and should work on darwin as well, if it ever switches to another build-dir (e.g. /nix/) see NixOS#355053 (comment) Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
@emilazy I know this is a controversial PR, but I'd love to here your thoughts on #355053 (comment) |
Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
$TMP should be univsal for all derivations and should work on darwin as well, if it ever switches to another build-dir (e.g. /nix/) see NixOS#355053 (comment) Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
A new argument has been added to fetch-yarn-deps: `yarnVersion` which will default to `1` which uses the original `yarn`. For newer `yarn-berry` `yarn.lock`, one can set the `yarnVersion` to either `3` or `4` which will use yarn-berry3 or yarn-berry4 respectively. The difference is the `yarn.lock` file, which follows a different format, depending on the version. This also adds the corresponding tests. The added support for yarn.lock files > version 1 has been inspired by @szlend from NixOS#254369 (comment) fixes NixOS#254369 Signed-off-by: Florian Brandes <florian.brandes@posteo.de> Co-authored-by: Doron Behar <doron.behar@gmail.com>
Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
Signed-off-by: Florian Brandes <florian.brandes@posteo.de>
cf9593e
to
bad8ab6
Compare
@emilazy I see that there is no activity from you since end of November. I'd still love to get your feedback to my reply above. In the meantime: @SuperSandro2000 could you also have a look at this? I'd like to get this merged and I believe I addressed most of the issues concerning stability and reliability in my comment above |
Problem
Currently there is no good way to use newer
yarn.lock
files because they rely onyarn-berry
and cannot be used together with pre-existing tooling likefetch-yarn-deps
.There is an (abandoned? ) project which converts
v1
files tov2
files with some caveats. This does require an internet connection and cannot be used in an isolated builder.So one needs to convert the
yarn.lock
file and commit it tonixpkgs
to be used for an offline cache.There is an issue open (#254369) which discusses some ways around it. The most sophisticated way was by @szlend and this PR builds on top of it to extend to be included in our new
yarnConfigHook
from #318015Implementation of a solution
A new argument has been added to fetch-yarn-deps:
yarnVersion
which will default to1
which uses the originalyarn
.For newer
yarn.lock
files, one can set theyarnVersion
to either3
or4
which will use yarn-berry3 or yarn-berry4 respectively. We need to separate both versions, because they have some small changes in the way the lock file is created.This also adds the corresponding tests.
How to use it
For an implementation see the included
pgadmin4
derivation, which was also my main motivation for this. I had to build a rather ugly and complicated update script before and do a lot of custom work to build the frontend. This is now simplified.In short, add
yarnBerry3ConfigHook
tonativeBuildInputs
andto your derivation.
fixes #254369
Once we agree on the implantation, I'll add the documentation.
Things done
nix.conf
? (See Nix manual)sandbox = relaxed
sandbox = true
nix-shell -p nixpkgs-review --run "nixpkgs-review rev HEAD"
. Note: all changes have to be committed, also see nixpkgs-review usage./result/bin/
)Add a 👍 reaction to pull requests you find important.