[FR] Default resource requirement/limits for the KFP UI and system services #5148

Bobgy · 2021-02-18T13:21:06Z

UPDATE: at the end, we decided to only add resource requirements, see discussion in #5236 (comment)

It's desirable to provide a set of default resource requirement & limits for KFP UI & system services, to make sure their QoS is Guaranteed by default.

https://kubernetes.io/docs/tasks/configure-pod-container/quality-service-pod/
I'm not exactly sure what will be reasonable, because if they are set too low, the services may stop operating when there are workloads reaching a limit.
But setting them to make QoS Guaranteed is also important, because otherwise when there are many other workloads, KFP UI & API services may be evicted because default QoS is BestEffort and BestEffort Pods are the first to be evicted by Kubernetes when it runs out of resources.

The text was updated successfully, but these errors were encountered:

Bobgy · 2021-02-19T02:57:48Z

Got some help from Sid Palas:

A couple of example request settings:
ml-pipeline (api server)
        requests:
          cpu: '2'
          memory: 4Gi
ml-pipeline-ui
        requests:
          cpu: 10m
          memory: 70Mi
workflow-controller (argo)
        requests:
          cpu: 200m
          memory: 3Gi
minio
          requests:
            cpu: 20m
            memory: 25Mi
persistent-agent
          requests:
            cpu: 120m
            memory: 2Gi

see thread https://kubeflow.slack.com/archives/CE10KS9M4/p1613655024114300

NikeNano · 2021-02-20T21:42:34Z

According to the argo documentation the memory and cpu usage for argo scales linearly with the nbr of workflows, see. So users will probably have to adjust this according if they are running heavier workloads or like to reduce costs.

I would be happy to update this!

/assign

Bobgy · 2021-02-26T14:06:54Z

thank you @NikeNano

Bobgy · 2021-04-01T02:10:01Z

/reopen
after #5273, we need to apply default resource requirement to argo pods again

google-oss-robot · 2021-04-01T02:10:05Z

@Bobgy: Reopened this issue.

In response to this:

/reopen
after #5273, we need to apply default resource requirement to argo pods again

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

* fix(deployment): fix default resource requests * fix mkp presubmit for rc version

Bobgy added the help wanted The community is welcome to contribute. label Feb 18, 2021

k8s-ci-robot assigned NikeNano Feb 20, 2021

google-oss-robot closed this as completed in 98f946a Feb 27, 2021

This was referenced Apr 1, 2021

[Testing] postsubmit mkp test failure 2021.3.4 #5236

Closed

deployment: adjust default resource requests. Fixes #5236 #5237

Merged

google-oss-robot reopened this Apr 1, 2021

Bobgy mentioned this issue Apr 1, 2021

fix(deployment): fix default resource requests. Fixes #5148 #5409

Merged

2 tasks

Bobgy closed this as completed in #5409 Apr 1, 2021

Bobgy added a commit that referenced this issue Apr 1, 2021

fix(deployment): fix default resource requests. Fixes #5148 (#5409)

5d0f3a3

* fix(deployment): fix default resource requests * fix mkp presubmit for rc version

Bobgy mentioned this issue Feb 1, 2022

pipelines: Increase default memroy limit for Mysql instance kubeflow/testing#980

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FR] Default resource requirement/limits for the KFP UI and system services #5148

[FR] Default resource requirement/limits for the KFP UI and system services #5148

Bobgy commented Feb 18, 2021 •

edited

Loading

Bobgy commented Feb 19, 2021

NikeNano commented Feb 20, 2021

Bobgy commented Feb 26, 2021

Bobgy commented Apr 1, 2021

google-oss-robot commented Apr 1, 2021

[FR] Default resource requirement/limits for the KFP UI and system services #5148

[FR] Default resource requirement/limits for the KFP UI and system services #5148

Comments

Bobgy commented Feb 18, 2021 • edited Loading

Bobgy commented Feb 19, 2021

NikeNano commented Feb 20, 2021

Bobgy commented Feb 26, 2021

Bobgy commented Apr 1, 2021

google-oss-robot commented Apr 1, 2021

Bobgy commented Feb 18, 2021 •

edited

Loading