Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Policy on adding new images #1958

Closed
mathbunnyru opened this issue Aug 2, 2023 · 6 comments · Fixed by #2016
Closed

Policy on adding new images #1958

mathbunnyru opened this issue Aug 2, 2023 · 6 comments · Fixed by #2016
Labels
type:Maintenance A proposed enhancement to how we maintain this project

Comments

@mathbunnyru
Copy link
Member

mathbunnyru commented Aug 2, 2023

It would be nice to come up with a policy when we add new images and when we don't.
#1936

Another question is - do we allow ourselves to remove some images (stop building new versions) for some image.

@consideRatio
Copy link
Collaborator

consideRatio commented Aug 8, 2023

An incomplete history

I was thinking that we could extract some historical decisions for reference going onwards and arrived to this as a starting point.

@mathbunnyru
Copy link
Member Author

Thanks @consideRatio. This is really helpful.

@mathbunnyru
Copy link
Member Author

mathbunnyru commented Aug 11, 2023

So, I have a list of a few things we should consider when adding new images (I also express my own opinion here and people might disagree with it and it's ok):

  1. Popularity. We should not add a separate notebook, which will contain just YOUR_FAVOURITE_RARE_PACKAGE_HERE, just because it's someone's favourite package. If this package is really popular, it's good. The software added should be modern and well-supported.
  2. Added value and potential user database growth.
  3. Consistency with current images. I don't think we should add something completely different, for example, an image with C++ Jupyter Kernel.
  4. Maintenance and especially build time. Since we're using Docker, what's inside the container doesn't affect our build workflows (most of the time). But some images might be heavy to build, especially since we upload/download images as archives.
  5. Any special hardware/equipment - I don't think we should be adding ppc64le images, for example. On the other side, I'm ok adding GPU images stack, if someone is ready to implement this without sacrificing build time (it's possible, but needs some work to do) and if the license allows us to do so.
  6. We should accept the idea that we can't live in a model where "our images fit 95% of needs". I think the correct model - you should choose an image closest to your needs and then install a few packages on top of this image.
  7. I think we should stick to the philosophy of using the latest software available. It worked quite well for us.

@mathbunnyru
Copy link
Member Author

mathbunnyru commented Aug 11, 2023

The message above is my opinion (which might easily change) and not a statement 🙂
Please, feel free ti share your ideas.

@mathbunnyru mathbunnyru pinned this issue Aug 12, 2023
@twalcari
Copy link
Contributor

With regards to point 5: adding GPU-support is non-trivial because the version of CUDA that you bundle is an extra confounding factor. For example, PyTorch has separate package indexes for CUDA 11.7 and 11.8. Do you bundle cuDDN or not? When the version of CUDA bundled into the docker image is newer than what's supported by the Nvidia drivers on the host system, it will crash, etc.

@mathbunnyru
Copy link
Member Author

Thanks, @twalcari, that's a good point.

I think we need a way to build all the images tree in parallel.
And then to be able to tag the whole tree correctly.

For example, we can build the images tree with simple Ubuntu (as now), and also build the images tree for cuda 11.7 (and add some tag prefix like cuda11.7- for all these images) and cuda 11.8.
And have options for cuDDN as well.
Obviously, we can build all this in parallel.

So, if someone wants to add templating based on the root image, even without adding gpu-support, it seems to be a nice feature to have.

We will also have to take a look at licensing of cuda-based images, I think one of the reasons they were not added was not being able to redistribute the images because of the cuda license (I'm not sure about this one).

@mathbunnyru mathbunnyru added the type:Maintenance A proposed enhancement to how we maintain this project label Sep 10, 2023
@mathbunnyru mathbunnyru unpinned this issue Oct 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type:Maintenance A proposed enhancement to how we maintain this project
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants