Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ksonnet changes to support deploying the v1alpha2 TFJob operator. #851

Merged
merged 2 commits into from
May 24, 2018

Conversation

jlewi
Copy link
Contributor

@jlewi jlewi commented May 22, 2018

  • K8s doesn't support installing multiple versions of a CRD in the same cluster.

  • So we add an option to choose which version to support.

  • We don't add an E2E test in this PR because its not straightforward.
    Currently the E2E test deploys Kubeflow in a different namespace but not
    a different cluster. This won't allow us to simulatenously test both
    versions.

  • So to test multiple versions we will need to spin up a separate GKE cluster.

Related to: kubeflow/training-operator#599


This change is Reviewable

* K8s doesn't support installing multiple versions of a CRD in the same cluster.
* So we add an option to choose which version to support.
* We don't add an E2E test in this PR because its not straightforward.
  Currently the E2E test deploys Kubeflow in a different namespace but not
  a different cluster. This won't allow us to simulatenously test both
  versions.

* So to test multiple versions we will need to spin up a separate GKE cluster.

Related to: kubeflow#599
@jlewi
Copy link
Contributor Author

jlewi commented May 22, 2018

/assign @gaocegege

@jlewi
Copy link
Contributor Author

jlewi commented May 24, 2018

@gaocegege PTAL

@gaocegege
Copy link
Member

/lgtm
/approve

@gaocegege
Copy link
Member

/lgtm
/approve

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: gaocegege

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@jlewi
Copy link
Contributor Author

jlewi commented May 24, 2018

Test failure looks like some random networking issue

Cloning into '/mnt/test-data-volume/kubeflow-presubmit-kubeflow-e2e-minikube-851-b67da5b-1642-e8ae/src/kubeflow/tf-operator'...
error: RPC failed; curl 56 GnuTLS recv error (-54): Error in the pull function.
fatal: The remote end hung up unexpectedly
fatal: early EOF
fatal: index-pack failed

@jlewi
Copy link
Contributor Author

jlewi commented May 24, 2018

/retest

@k8s-ci-robot k8s-ci-robot merged commit 01ccdff into kubeflow:master May 24, 2018
saffaalvi pushed a commit to StatCan/kubeflow that referenced this pull request Feb 11, 2021
…beflow#851)

* ksonnet changes to support deploying the v1alpha2 TFJob operator.

* K8s doesn't support installing multiple versions of a CRD in the same cluster.
* So we add an option to choose which version to support.
* We don't add an E2E test in this PR because its not straightforward.
  Currently the E2E test deploys Kubeflow in a different namespace but not
  a different cluster. This won't allow us to simulatenously test both
  versions.

* So to test multiple versions we will need to spin up a separate GKE cluster.

Related to: kubeflow#599

* Autoformat jsonnet.
yanniszark pushed a commit to arrikto/kubeflow that referenced this pull request Feb 15, 2021
* chore: Speedup local build.sh

Signed-off-by: Ce Gao <gaoce@caicloud.io>

* chore: Update liveness probe

Signed-off-by: Ce Gao <gaoce@caicloud.io>

* chore: Ignore frontend build

Signed-off-by: Ce Gao <gaoce@caicloud.io>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants