-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Testing] postsubmit mkp test failure 2021.3.4 #5236
Comments
Step #1 - "verify": 37s Warning FailedScheduling pod/ml-pipeline-55bbc45946-m7s66 0/3 nodes are available: 2 Insufficient memory, 3 Insufficient cpu. |
It seems that the added resource request make it impossible to schedule ml-pipeline pod in the cluster. |
The default cluster for marketplace has 3 nodes each with 2 CPUs and 3GB memory allocatable. We need to reduce the request to fit into the default cluster. |
/cc @NikeNano |
/assign I'll fix this |
Another issue to address is that, mkp test should be triggered on presubmit if a PR touches MKP manifest. Let me add the auto trigger. |
Some reading on requests & limits: https://cloud.google.com/blog/products/containers-kubernetes/kubernetes-best-practices-resource-requests-and-limits |
Some investigation into other OSS projects, argo doesn't provide default for requests/limits for the most part: https://github.com/argoproj/argo-workflows/tree/master/manifests. |
I think we'll need an operator manual documentation to tell people how to adjust their resource requests, but maybe as a next step. |
I think this sounds good, also if we could provide some ball park figures. I guess however if we set them to high we will request more resources than actually used for most people. |
Caused by #5148 |
First failing mkp test: https://oss-prow.knative.dev/view/gs/oss-prow/logs/kubeflow-pipeline-postsubmit-mkp-e2e-test/1365452157549023232
Root cause seems to be: #5158
The text was updated successfully, but these errors were encountered: