CI: Fix retry action #306

larsoner · 2024-08-05T17:35:52Z

Closes #301
First part of the suggestion from #301 (comment). Let's see if the default is to have errorexit on.

larsoner · 2024-08-05T17:37:24Z

Confirmed errexit on at least. Now let's see if we can get it to fail three times...

larsoner · 2024-08-05T17:40:47Z

Okay it exited with code 0 even though I passed a command that should fail, I think because of a bug in the logic about attempt count

larsoner · 2024-08-05T17:50:32Z

Okay the Debug step in 01971cc seems to be working

https://github.com/mne-tools/mne-lsl/actions/runs/10253578557/job/28366526502?pr=306

larsoner · 2024-08-05T18:08:35Z

Appears to have worked on the Pytest step, too.

Command exited with code 1 after 1 attempts.

I guess maybe you also want exit code 1 in retry_error_codes not just 134,139?

larsoner · 2024-08-05T18:56:39Z

Appears to have worked as this one took two attempts:

https://github.com/mne-tools/mne-lsl/actions/runs/10253825291/job/28367329623?pr=306#step:8:5621
https://github.com/mne-tools/mne-lsl/actions/runs/10253825291/job/28367329623?pr=306#step:8:12024

mscheltienne · 2024-08-05T19:50:30Z

set -eo pipefail, set +e and set -e.. I completely missed that part, thanks!
Actually, I don't want the CIs to retry on exit-code 1, only on segmentation fault and python fatal error which still occur at least once every 20/30 runs probably during the tear-up/clean-ups. If the tests fails, it should be fixable in the tests, and there is no need to wait for 3 retries to get that information ;)

.github/workflows/pytest.yaml

larsoner · 2024-08-05T19:53:29Z

Okay look at 18c2a49 then it had one or two jobs fail with exit 1

larsoner · 2024-08-05T19:54:47Z

Like this run specifically https://github.com/mne-tools/mne-lsl/actions/runs/10253602827/job/28366606652

larsoner · 2024-08-05T19:56:27Z

Maybe those tests could use https://pypi.org/project/pytest-retry/ since they're flaky but don't ~~default~~ segfault?

mscheltienne · 2024-08-06T06:29:00Z

Like this run specifically https://github.com/mne-tools/mne-lsl/actions/runs/10253602827/job/28366606652

Yes, in this one, an LSL stream from other tests was not properly terminated.. Probably one of the rarest flakiness.

Anyway, the idea was to have this small action to recover from hard crashes; and eventually pytest-retry, flaky or pytest-rerunfailures for flaky test, but it might not even be needed. For instance, it would not have salvage the test run with this additional LSL stream that was not properly terminated.

mscheltienne · 2024-08-06T06:36:23Z

But this run https://github.com/mne-tools/mne-lsl/actions/runs/10253602827/job/28366606652#step:8:14480 now suggest that when an hard crash occurs, LSL streams are not properly terminated and are still lingering in the background, making the test suit fail on the next iteration when looking for a fix number of streams with resolve_streams().. I'll mark those test/part as xfail and will add a random uuid str at the end of the stream names to guarantee uniqueness.

WIP: Fix retry action

739a72a

larsoner added 2 commits August 5, 2024 13:38

FIX: Should tell me it failed three times

9785706

FIX: Correct check

adee568

larsoner added 5 commits August 5, 2024 13:43

FIX: Check

3e1a745

FIX: Direct

4ec1f1e

FIX: Location

16a5266

FIX: Restore

a163c9b

FIX: Failing successfully?

01971cc

FIX: Restore

18c2a49

larsoner changed the title ~~WIP: Fix retry action~~ CI: Fix retry action Aug 5, 2024

FIX: One more exit code

167c2b7

mscheltienne reviewed Aug 5, 2024

View reviewed changes

.github/workflows/pytest.yaml Outdated Show resolved Hide resolved

Update .github/workflows/pytest.yaml

60b6822

mscheltienne enabled auto-merge (squash) August 5, 2024 19:50

add error code 3, python fatal error

51a48b9

mscheltienne merged commit 81bcef3 into mne-tools:main Aug 6, 2024
18 of 19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: Fix retry action #306

CI: Fix retry action #306

larsoner commented Aug 5, 2024 •

edited by mscheltienne

Loading

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024 •

edited

Loading

mscheltienne commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024 •

edited

Loading

mscheltienne commented Aug 6, 2024 •

edited

Loading

mscheltienne commented Aug 6, 2024

CI: Fix retry action #306

CI: Fix retry action #306

Conversation

larsoner commented Aug 5, 2024 • edited by mscheltienne Loading

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024 • edited Loading

mscheltienne commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024

larsoner commented Aug 5, 2024 • edited Loading

mscheltienne commented Aug 6, 2024 • edited Loading

mscheltienne commented Aug 6, 2024

larsoner commented Aug 5, 2024 •

edited by mscheltienne

Loading

larsoner commented Aug 5, 2024 •

edited

Loading

larsoner commented Aug 5, 2024 •

edited

Loading

mscheltienne commented Aug 6, 2024 •

edited

Loading