Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Develop a prow migration plan #752

Closed
spiffxp opened this issue Apr 15, 2020 · 25 comments
Closed

Develop a prow migration plan #752

spiffxp opened this issue Apr 15, 2020 · 25 comments
Assignees
Labels
area/infra Infrastructure management, infrastructure design, code in infra/ priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra. sig/testing Categorizes an issue or PR as relevant to SIG Testing.
Milestone

Comments

@spiffxp
Copy link
Member

spiffxp commented Apr 15, 2020

Break out into issues when we've gotten consensus

Current sketch is:

  • decide whether we want a build cluster, and/or how many we we want
  • allow prow.k8s.io to schedule jobs on our build cluster(s)
  • setup a prow instance (in aaa?)
  • figure out how we want to migrate/cut-over github traffic

Some concerns include:

  • jobs that run in same cluster as prow.k8s.io to take advantage of the ghproxy instance that lives there
  • using a new github bot user/token so as not to conflict with k8s-ci-robot while prow.k8s.io continues to use it
  • setting up boskos in our build cluster(s)
  • setting up greenhouse in our build cluster(s)
  • setting up projects for boskos to manage
  • migration of google-owned buckets like kubernetes-jenkins
  • on-call
@spiffxp
Copy link
Member Author

spiffxp commented Apr 15, 2020

/wg k8s-infra
/sig testing
/area cluster-infra
/assign

@k8s-ci-robot k8s-ci-robot added wg/k8s-infra sig/testing Categorizes an issue or PR as relevant to SIG Testing. area/infra Infrastructure management, infrastructure design, code in infra/ labels Apr 15, 2020
@spiffxp
Copy link
Member Author

spiffxp commented Apr 27, 2020

I prototyped a single build cluster here #806

I have it hooked up to my personal prow instance here: https://prow.bashfire.dev

@spiffxp
Copy link
Member Author

spiffxp commented Apr 28, 2020

Trying to work through what the plan looks like, it's unlikely we'll get to discussing the actual "migrate prow.k8s.io" part in much detail this week.

https://docs.google.com/document/d/1g7RBDhbTAH1VBMl2kJZR71OfzM6cJF277zbysd2AoCQ/edit#

@spiffxp
Copy link
Member Author

spiffxp commented Apr 29, 2020

Discussed during today's meeting, came away with:

  • We'll do two build clusters, each in their own project (trusted, untrusted)
  • Untrusted build cluster will use same machine type as used by prow.k8s.io's existing build cluster (n1-highmem-8)
  • Will use regional clusters
  • Will start with e2e project pool that is not sig/job-specific
    • Adjust when costs get to unknown/unexplained size
    • Consider per-sig project pools
  • Naming Is Hard:
    • k8s-infra-prow-build - untrusted build cluster (and project, serviceaccount, etc)
    • k8s-infra-prow-build-trusted - trusted build cluster (and project, serviceaccount, etc)
    • k8s-infra-e2e-{suffix} - e2e projects intended to be pinned to a job
    • k8s-infra-e2e-boskos-{nnn} - e2e projects intended to be managed by boskos
  • What I'm aiming for in two weeks:
    • clusters hooked up to prow.k8s.io
    • confirmed migration of:
      • ci-kubernetes-node-kubelet
      • ci-kubernetes-e2e-gci-gce
      • (an image pushing job?)
    • dev/test cycle for migrating jobs

@spiffxp
Copy link
Member Author

spiffxp commented May 6, 2020

#830 implements all of the above except dev/test cycle

@spiffxp
Copy link
Member Author

spiffxp commented May 6, 2020

Opened up some more issues to represent followup work:

@MHBauer
Copy link

MHBauer commented May 27, 2020

Learning about sig-node ci, can I help with or shadow along changes to ci-kubernetes-node-kubelet?

@spiffxp
Copy link
Member Author

spiffxp commented Jun 12, 2020

@MHBauer add yourself to the k8s-infra-prow-viewers@kubernetes.io group in groups/groups.yaml and let me know if you can see https://console.cloud.google.com/monitoring/dashboards?project=k8s-infra-prow-build

@helenfeng737
Copy link
Contributor

I can help with the migration work too if needed. Already added myself in k8s-infra-prow-viewers

@spiffxp
Copy link
Member Author

spiffxp commented Oct 28, 2020

About ready to enumerate:

  • setting up a staging prow instance to experiment with running in community infra
  • how to shift traffic from prow.k8s.io

@spiffxp
Copy link
Member Author

spiffxp commented Jan 20, 2021

/assign @ameukam
since also working on staging prow (ref: #1394)

@spiffxp
Copy link
Member Author

spiffxp commented Jan 22, 2021

/priority important-longterm
I don't know how much time I'll have to work on completing a plan by end of v1.21

@spiffxp
Copy link
Member Author

spiffxp commented Sep 29, 2021

The proposal linked in this issue likely needs to be refreshed, but dropping a few thoughts before I forget.

Blockers to migrating all jobs over to a community-run build cluster include:

Blockers to shutting down prow.k8s.io and then standing it all up over in kubernetes-public include:

  • all of the existing kubernetes/test-infra tooling / manifests assume deployment to its own cluster vs. a shared cluster
  • allowing a community-owned prow to trigger builds in the google.com-owned "default" build cluster (if we fail to migrate all jobs over
  • allowing a community-owned prow to write/update gs://k8s-testgrid
  • allowing a community-owned prow to write/update gs://kubernetes-jenkins
  • allowing a community-owned prow to write to the google.com-owned k8s-prow project
  • prow.k8s.io currently runs jobs for more than the kubernetes community (Migrate non-Kubernetes repos off of prow.k8s.io test-infra#12863)
  • (not sure what else I'm missing here...)

@k8s-ci-robot k8s-ci-robot added sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra. and removed wg/k8s-infra labels Sep 29, 2021
@ameukam
Copy link
Member

ameukam commented Dec 6, 2021

/milestone v1.24

@k8s-ci-robot k8s-ci-robot modified the milestones: v1.23, v1.24 Dec 6, 2021
@ameukam ameukam removed this from the v1.24 milestone Mar 22, 2022
@ameukam
Copy link
Member

ameukam commented Mar 3, 2024

/milestone v1.30
/lifecycle clear

Rebooted this effort. See: https://docs.google.com/document/d/1erBhuCwY26d0UfPbzt8lEj6bYT2hOUKzc2j36YHVqfM/edit?usp=sharing

@k8s-ci-robot k8s-ci-robot added this to the v1.30 milestone Mar 3, 2024
@ameukam ameukam removed the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Mar 6, 2024
@ameukam
Copy link
Member

ameukam commented Apr 19, 2024

/milestone v1.31

@k8s-ci-robot k8s-ci-robot modified the milestones: v1.30, v1.31 Apr 19, 2024
@ameukam ameukam moved this to In Progress in SIG K8S Infra Jul 8, 2024
@k8s-triage-robot
Copy link

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

  • After 90d of inactivity, lifecycle/stale is applied
  • After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
  • After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

  • Mark this issue as fresh with /remove-lifecycle stale
  • Close this issue with /close
  • Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

@k8s-ci-robot k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 18, 2024
@ameukam
Copy link
Member

ameukam commented Jul 18, 2024

/remove-lifecycle stale

We have a timeline for this and an overall plan to do the migration.
Technical requirements need to be discussed.

@k8s-ci-robot k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 18, 2024
@ameukam
Copy link
Member

ameukam commented Aug 21, 2024

@k8s-ci-robot
Copy link
Contributor

@ameukam: Closing this issue.

In response to this:

We have a migration plan established: https://docs.google.com/document/d/1erBhuCwY26d0UfPbzt8lEj6bYT2hOUKzc2j36YHVqfM/edit

Execution is planned: https://groups.google.com/a/kubernetes.io/g/dev/c/qzNYpcN5la4

cc @BenTheElder @michelle192837

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/infra Infrastructure management, infrastructure design, code in infra/ priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/k8s-infra Categorizes an issue or PR as relevant to SIG K8s Infra. sig/testing Categorizes an issue or PR as relevant to SIG Testing.
Projects
Status: Done
Status: Done
Development

No branches or pull requests

7 participants