Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Additional packages for UW NASA Hackweek 2024 #118

Open
9 tasks done
scottyhq opened this issue Jul 25, 2024 · 10 comments
Open
9 tasks done

Additional packages for UW NASA Hackweek 2024 #118

scottyhq opened this issue Jul 25, 2024 · 10 comments

Comments

@scottyhq
Copy link
Contributor

scottyhq commented Jul 25, 2024

In prep for https://2024.hackweek.io using CryoCloud we have requests for the following latest package releases in the default environment. This is a tracking issue, might add some more. We can open up separate PRs for each cc @jomey

@jomey
Copy link

jomey commented Jul 25, 2024

CC: @micah-prime
To coordinate a snowexsql release before the hackweek

@micah-prime
Copy link

Linking this issue for getting snowexsql on conda-forge SnowEx/snowexsql#93

scottyhq added a commit that referenced this issue Aug 1, 2024
This relaxes many exact pins and updates a lot of commonly used
scientific python packages to more recent versions

addresses #118
@scottyhq
Copy link
Contributor Author

scottyhq commented Aug 6, 2024

Adding a request for https://pixi.sh/latest/

(ICESAT-2HackWeek/website-2024#13)

@meganmason
Copy link

meganmason commented Aug 8, 2024

Adding a request for itables

@weiji14
Copy link
Member

weiji14 commented Aug 14, 2024

@scottyhq, do you reckon it's ok to add Pytorch (CPU build) into hub-image so that the Hackweek participants can run the notebook at ICESAT-2HackWeek/website-2024#17 without having to install it using a conda install -y pytorch code cell? I'm not sure if we want 10 or 20+ people launching the GPU machine with the Pangeo pytorch-notebook image for an hour-long tutorial.

@scottyhq
Copy link
Contributor Author

do you reckon it's ok to add Pytorch (CPU build)

Fine with me! At some point it'll probably be necessary to clarify (maybe here https://book.cryointhecloud.com/content/hub_best_practices.html) what is in the default environment (minimal + bring your own environment) or everything in the kitchen sink....

@weiji14
Copy link
Member

weiji14 commented Aug 15, 2024

Ok, started the PR at #125 to add more packages in. Do we still need pixi given that ICESAT-2HackWeek/website-2024#13 was closed?

@tsnow03
Copy link
Member

tsnow03 commented Aug 15, 2024

PyTorch is kinda huge though isn't it? We've been keeping the ML stuff, especially the more advanced libraries that fewer people use, separate to streamline the time it takes to load the image. So I'm a little more inclined to keep that in the separate image. Our main image should be things that a large swath of our users will be able to use, but kept as small as possible otherwise. Then people can build permanent kernels or, soon, be able to build their own images automatically from an environment/requirements and use those instead.

@spestana
Copy link

Adding a request for hydroeval

@weiji14
Copy link
Member

weiji14 commented Aug 16, 2024

PyTorch is kinda huge though isn't it? We've been keeping the ML stuff, especially the more advanced libraries that fewer people use, separate to streamline the time it takes to load the image. So I'm a little more inclined to keep that in the separate image. Our main image should be things that a large swath of our users will be able to use, but kept as small as possible otherwise. Then people can build permanent kernels or, soon, be able to build their own images automatically from an environment/requirements and use those instead.

The CPU build of Pytorch won't pull in any of the CUDA libraries (which uses up a lot of disk space), but the package (pytorch, libtorch, etc) will still be 100+MB, so I can leave that out of the default image. I can point the Hackweek participants to use the Pangeo pytorch-notebook instead via the 'Bring your own image' option (instead of using the GPU node), will just be a bit more setup at the start but hopefully ok.

weiji14 added a commit that referenced this issue Aug 16, 2024
Keep the docker image small, xref #118 (comment)
scottyhq added a commit that referenced this issue Aug 16, 2024
Updating the conda environment.yml file to include more packages as
requested at #118.

Newly added:
- hydroeval=0.1.0
- itables=2.14
- jupyter-sshd-proxy=0.2.0
- py-xgboost=2.1.1 (cpu)
~- pytorch=2.3.1 (cpu)~
- stackstac=0.5.1

Extensions that are added back since they have new compatible releases:
- jupyter-offlinenotebook=0.3.1

Upgraded:
- code-server from 4.16.1 to 4.23.1
- snowexsql from 0.4.1 to 0.5.0 (so that we can get geopandas 1.0, xref
#121 (comment))
- icepyx from 1.1.0 to 1.2.0
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants