flaking unit test in `TestReconcileMachinePoolMachines` #11070

cahillsf · 2024-08-19T22:04:45Z

Which jobs are flaking?

these failures are apparent in periodic-cluster-api-test-mink8s-main and periodic-cluster-api-test-main

Which tests are flaking?

TestReconcileMachinePoolMachines/Reconcile_MachinePool_Machines/Should_create_two_machines_if_two_infra_machines_exist

Since when has it been flaking?

at least since 20214-07-06: https://storage.googleapis.com/k8s-triage/index.html?date=2024-07-20&text=TestReconcileMachinePoolMachines%2FReconcile_MachinePool_Machines%2FShould_create_two_machines_if_two_infra_machines_exist&job=.*cluster-api.*(test%7Ce2e)-(mink8s-)*main&xjob=.*-provider-.*

Testgrid link

https://prow.k8s.io/view/gs/kubernetes-jenkins/logs/periodic-cluster-api-test-mink8s-main/1824877164462346240

Reason for failure (if possible)

No response

Anything else we need to know?

No response

Label(s) to be applied

/kind flake
One or more /area label. See https://github.com/kubernetes-sigs/cluster-api/labels?q=area for the list of labels.

The text was updated successfully, but these errors were encountered:

cahillsf · 2024-08-19T22:05:23Z

/area machinepool

sbueringer · 2024-08-21T12:34:21Z

Yup. I saw a bunch of flakes around MachinePool unit tests as well

/triage accepted

/help

k8s-ci-robot · 2024-08-21T12:34:23Z

@sbueringer:
This request has been marked as needing help from a contributor.

Guidelines

Please ensure that the issue body includes answers to the following questions:

Why are we solving this issue?
To address this issue, are there any code changes? If there are code changes, what needs to be done in the code and what places can the assignee treat as reference points?
Does this issue have zero to low barrier of entry?
How can the assignee reach out to you for help?

For more details on the requirements of such an issue, please see here and ensure that they are met.

If this request no longer meets these requirements, the label can be removed
by commenting with the /remove-help command.

In response to this:

Yup. I saw a bunch of flakes around MachinePool unit tests as well

/triage accepted

/help

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

cahillsf · 2024-09-01T16:40:30Z

/assign cahillsf

cannot reproduce this issue locally, have opened a draft that seems to use preferred methods in this unit test, see PR for details. hopefully this will improve the stability of this test

sbueringer · 2024-09-02T09:06:06Z

Would be great if some folks familiar with Machine Pools / MachinePool Machines can review #11124

(cc @Jont828 @willie-yao)

sbueringer · 2024-09-02T16:56:08Z

/reopen

I assume we want to keep this issue open for now as we're not sure if the PR will fix all flakes

k8s-ci-robot · 2024-09-02T16:56:12Z

@sbueringer: Reopened this issue.

In response to this:

/reopen

I assume we want to keep this issue open for now as we're not sure if it will fix all flakes

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

cahillsf · 2024-09-02T17:17:18Z

/reopen

I assume we want to keep this issue open for now as we're not sure if the PR will fix all flakes

Yep sounds good, will track the test and revisit

edit: adding k8s-triage link https://storage.googleapis.com/k8s-triage/index.html?text=TestReconcileMachinePoolMachines&job=.*cluster-api-(test%7Ce2e)-(mink8s-)*main

cahillsf · 2024-09-18T16:44:04Z

revisiting this, test hasn't flaked since 9/1 prior to the attempted fix being merged: https://storage.googleapis.com/k8s-triage/index.html?date=2024-09-15&text=TestReconcileMachinePoolMachines&job=.*cluster-api-(test%7Ce2e)-(mink8s-)*main

if we update the date for today the failures are out of the default lookback window: https://storage.googleapis.com/k8s-triage/index.html?date=2024-09-18&text=TestReconcileMachinePoolMachines&job=.*cluster-api-(test%7Ce2e)-(mink8s-)*main

not sure how long we want to wait before closing out this issue @sbueringer ?

sbueringer · 2024-09-19T07:33:51Z

I think we can close the issue, the flake was pretty frequent before, so I think we have enough data to be sure it's fixed.

Thx for fixing this flake!

/close

k8s-ci-robot · 2024-09-19T07:33:56Z

@sbueringer: Closing this issue.

In response to this:

I think we can close the issue, the flake was pretty frequent before, so I think we have enough data to be sure it's fixed.

Thx for fixing this flake!

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

k8s-ci-robot added kind/flake Categorizes issue or PR as related to a flaky test. needs-priority Indicates an issue lacks a `priority/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Aug 19, 2024

k8s-ci-robot added the area/machinepool Issues or PRs related to machinepools label Aug 19, 2024

sbueringer added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Aug 21, 2024

k8s-ci-robot removed the needs-priority Indicates an issue lacks a `priority/foo` label and requires one. label Aug 21, 2024

Sunnatillo added this to CAPI v1.9 release improvement tasks Aug 28, 2024

cahillsf mentioned this issue Sep 1, 2024

🌱 Improve TestReconcileMachinePoolMachines unit test #11124

Merged

k8s-ci-robot assigned cahillsf Sep 1, 2024

k8s-ci-robot closed this as completed in #11124 Sep 2, 2024

github-project-automation bot moved this to Done in CAPI v1.9 release improvement tasks Sep 2, 2024

k8s-ci-robot reopened this Sep 2, 2024

k8s-ci-robot closed this as completed Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flaking unit test in `TestReconcileMachinePoolMachines` #11070

flaking unit test in `TestReconcileMachinePoolMachines` #11070

cahillsf commented Aug 19, 2024

cahillsf commented Aug 19, 2024

sbueringer commented Aug 21, 2024

k8s-ci-robot commented Aug 21, 2024

cahillsf commented Sep 1, 2024

sbueringer commented Sep 2, 2024

sbueringer commented Sep 2, 2024 •

edited

Loading

k8s-ci-robot commented Sep 2, 2024

cahillsf commented Sep 2, 2024 •

edited

Loading

cahillsf commented Sep 18, 2024

sbueringer commented Sep 19, 2024

k8s-ci-robot commented Sep 19, 2024

flaking unit test in TestReconcileMachinePoolMachines #11070

flaking unit test in TestReconcileMachinePoolMachines #11070

Comments

cahillsf commented Aug 19, 2024

Which jobs are flaking?

Which tests are flaking?

Since when has it been flaking?

Testgrid link

Reason for failure (if possible)

Anything else we need to know?

Label(s) to be applied

cahillsf commented Aug 19, 2024

sbueringer commented Aug 21, 2024

k8s-ci-robot commented Aug 21, 2024

Guidelines

cahillsf commented Sep 1, 2024

sbueringer commented Sep 2, 2024

sbueringer commented Sep 2, 2024 • edited Loading

k8s-ci-robot commented Sep 2, 2024

cahillsf commented Sep 2, 2024 • edited Loading

cahillsf commented Sep 18, 2024

sbueringer commented Sep 19, 2024

k8s-ci-robot commented Sep 19, 2024

flaking unit test in `TestReconcileMachinePoolMachines` #11070

flaking unit test in `TestReconcileMachinePoolMachines` #11070

sbueringer commented Sep 2, 2024 •

edited

Loading

cahillsf commented Sep 2, 2024 •

edited

Loading