Testing build fix #2406

mik-dass · 2019-11-21T10:16:35Z

What kind of PR is this?

/kind flake

What does does this PR do / why we need it:

Which issue(s) this PR fixes:

Fixes #1981

How to test changes / Special notes to the reviewer:

mik-dass · 2019-11-21T12:23:22Z

[odo]  ✗  waited 4m0s but couldn't find running pod matching selector: 'deploymentconfig=backend-app'

/retest

mik-dass · 2019-11-25T09:56:54Z

Container setup exited with code 1, reason Error

/retest

mik-dass · 2019-11-25T14:00:31Z

error: could not run steps: step integration-e2e-benchmark failed: template pod "integration-e2e-benchmark" failed: the pod ci-op-hgs9sfdi/integration-e2e-benchmark failed after 44m39s (failed containers: setup, test): ContainerFailed one or more containers exited

/retest

mik-dass · 2019-11-26T16:44:41Z

error: could not run steps: step [release:latest] failed: release "release-latest" failed: the pod ci-op-v48j03kz/release-latest failed after 47s (failed containers: release): ContainerFailed one or more containers exited

/retest

mik-dass · 2019-12-05T07:19:54Z

error: could not run steps: step integration-e2e-benchmark failed: template pod "integration-e2e-benchmark" failed: the pod ci-op-3q88t7ms/integration-e2e-benchmark failed after 11m46s (failed containers: setup, test): ContainerFailed one or more containers exited

/retest

mik-dass · 2019-12-09T06:27:04Z

/test v4.3-integration-e2e-benchmark

cdrage · 2019-12-10T20:21:39Z

pkg/occlient/occlient.go

@@ -1682,23 +1683,36 @@ func (c *Client) WaitForBuildToFinish(buildName string) error {
 		return errors.Wrapf(err, "unable to watch build")
 	}
 	defer w.Stop()
+	timeout := time.After(5 * time.Minute)


Shouldn't this be a constant? (high above in occlient.go)

cdrage · 2019-12-10T20:22:09Z

pkg/occlient/occlient.go

@@ -1672,7 +1672,8 @@ func (c *Client) StartBuild(name string) (string, error) {
 }

 // WaitForBuildToFinish block and waits for build to finish. Returns error if build failed or was canceled.
-func (c *Client) WaitForBuildToFinish(buildName string) error {
+func (c *Client) WaitForBuildToFinish(buildName string, stdout io.Writer) error {
+	following := false


Need more description on this, bit confused why we're setting following to false then true when going through the channel loop.

cdrage · 2019-12-10T20:22:19Z

pkg/occlient/occlient.go

+						if err != nil {
+							return err
+						}
+					}


All of this above looks good!

cdrage · 2019-12-10T20:22:28Z

pkg/occlient/occlient.go

@@ -1845,6 +1859,7 @@ func (c *Client) FollowBuildLog(buildName string, stdout io.Writer) error {
 	}

 	rd, err := c.buildClient.RESTClient().Get().
+		Timeout(5*time.Minute).


Should be a constant

mik-dass · 2019-12-11T07:57:49Z

@cdrage Fixed

dharmit

TBH, I don't understand the code in this PR. 😞

dharmit · 2019-12-11T12:08:10Z

pkg/occlient/occlient.go

-func (c *Client) WaitForBuildToFinish(buildName string) error {
+func (c *Client) WaitForBuildToFinish(buildName string, stdout io.Writer) error {
+	// following indicates if we have already setup the following logic
+	following := false
 	glog.V(4).Infof("Waiting for %s  build to finish", buildName)

 	w, err := c.buildClient.Builds(c.Namespace).Watch(metav1.ListOptions{


I understand this is not touched in the proposed PR but can we add a comment that describes what this all about? At least I didn't understand a thing here. 😞

dharmit · 2019-12-11T12:24:35Z

pkg/occlient/occlient.go

@@ -1682,23 +1685,37 @@ func (c *Client) WaitForBuildToFinish(buildName string) error {
 		return errors.Wrapf(err, "unable to watch build")
 	}
 	defer w.Stop()
+	timeout := time.After(OcBuildTimeout)
 	for {


Don't understand what's going on in this for loop either. 😞

I can help here a bit. So here first thing to observe is the select statement on a quick TLDR: select statements are like channel collectors - listen on multiple channels at once, more here https://tour.golang.org/concurrency/5.

now the Build().Watch() returns a channel which returns messages on as per the pod status. Now the change that has been done here is
we wait for the pod to start before we start following the logs.

as for the need for the following, The w.ResultChan() can send multiple messages of type BuildPhaseRunning. But we dont have to follow logs multiple times, just the first time should be good enough. Hence this flag.
Think of it as a singleton

mik-dass · 2019-12-12T08:03:04Z

@dharmit Updated with more comments

mik-dass · 2019-12-12T10:52:39Z

[odo]  ✗  invalid configuration: [context was not found for specified context: tmeibqyxnu/api-ci-op-t9t1htld-00a90-origin-ci-int-aws-dev-rhcloud-com:6443/developer, cluster has no server defined]
[odo] Please login to your server:

/retest

girishramnani · 2019-12-27T07:10:25Z

/approve

openshift-ci-robot · 2019-12-27T07:10:56Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: girishramnani

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [girishramnani]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

kadel · 2020-01-31T12:58:46Z

/test all

kadel · 2020-01-31T15:18:53Z

/lgtm

mik-dass added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. Required by Prow. label Nov 21, 2019

openshift-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. Required by Prow. flake Categorizes issue or PR as related to a flaky test. size/M labels Nov 21, 2019

openshift-ci-robot requested review from dharmit and kadel November 21, 2019 10:17

mik-dass force-pushed the build_timeout_fix branch from fca70cc to da2f222 Compare December 5, 2019 07:02

Testing build fix

377ca01

mik-dass force-pushed the build_timeout_fix branch from da2f222 to 377ca01 Compare December 9, 2019 07:35

mik-dass changed the title ~~[WIP] Testing build fix~~ Testing build fix Dec 10, 2019

openshift-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. Required by Prow. label Dec 10, 2019

mik-dass removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. Required by Prow. label Dec 10, 2019

cdrage suggested changes Dec 10, 2019

View reviewed changes

Adds build timeout constant and some comments

d500359

dharmit reviewed Dec 11, 2019

View reviewed changes

Adds comments for WaitForBuildToFinish()

2fdd200

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. Required by Prow. label Dec 27, 2019

openshift-ci-robot assigned kadel Jan 31, 2020

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. Required by Prow. label Jan 31, 2020

openshift-merge-robot merged commit ec73cab into redhat-developer:master Jan 31, 2020

mik-dass deleted the build_timeout_fix branch February 3, 2020 06:08

rm3l added the estimated-size/M (10-20) Rough sizing for Epics. About 1 sprint of work for one person label Jun 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Testing build fix #2406

Testing build fix #2406

mik-dass commented Nov 21, 2019

mik-dass commented Nov 21, 2019

mik-dass commented Nov 25, 2019

mik-dass commented Nov 25, 2019

mik-dass commented Nov 26, 2019 •

edited

Loading

mik-dass commented Dec 5, 2019

mik-dass commented Dec 9, 2019

cdrage Dec 10, 2019

cdrage Dec 10, 2019

cdrage Dec 10, 2019

cdrage Dec 10, 2019

mik-dass commented Dec 11, 2019

dharmit left a comment

dharmit Dec 11, 2019

dharmit Dec 11, 2019

girishramnani Dec 27, 2019

girishramnani Dec 27, 2019

mik-dass commented Dec 12, 2019

mik-dass commented Dec 12, 2019

girishramnani commented Dec 27, 2019

openshift-ci-robot commented Dec 27, 2019

kadel commented Jan 31, 2020

kadel commented Jan 31, 2020

Testing build fix #2406

Testing build fix #2406

Conversation

mik-dass commented Nov 21, 2019

mik-dass commented Nov 21, 2019

mik-dass commented Nov 25, 2019

mik-dass commented Nov 25, 2019

mik-dass commented Nov 26, 2019 • edited Loading

mik-dass commented Dec 5, 2019

mik-dass commented Dec 9, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mik-dass commented Dec 11, 2019

dharmit left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mik-dass commented Dec 12, 2019

mik-dass commented Dec 12, 2019

girishramnani commented Dec 27, 2019

openshift-ci-robot commented Dec 27, 2019

kadel commented Jan 31, 2020

kadel commented Jan 31, 2020

mik-dass commented Nov 26, 2019 •

edited

Loading