Use correct module for errors_found_in_log #3119

Flamefire · 2019-12-10T16:55:07Z

The (important) warning message was never printed due to a wrong module being used after a rename/move some time ago

easybuild/framework/easyblock.py

test/framework/utilities.py

ocaisa

For me this looks good, I will take a look tomorrow to see how hard it would be to come up a test

ocaisa · 2019-12-10T17:40:50Z

For me this looks good, I will take a look tomorrow to see how hard it would be to come up a test

Actually the test is there, this was really just incorrect import/usage.

Flamefire · 2019-12-10T18:18:46Z

Hmm, haven't found one by grepping for errors_found_in_log . Probably best to check the log anyway

boegel · 2019-12-10T19:24:03Z

Nice catch @Flamefire...

We indeed overlooked this errors_found_in_log global variable when moving parse_log_for_error from easybuild.tools.filetools to easybuild.tools.run a (very) long time ago (see #842).

I'm a bit reluctant to re-instate the warning like this though, since the pattern used by parse_log_for_error for error seems very generic to me now that I take a closer look at it...

It basically considers any occurrence of error or failed as a clear indication that something went wrong when running a command, while we know now that this often leads to many false positives.

This is from the CMake log file for example, after checking the output of ./configure ...:

== 2019-11-03 23:36:42,996 run.py:589 INFO parse_log_for_error (some may be harmless) regExp (?<![(,-]|\w)(?:error|segmentation fault|failed)(?![(,-]|\.?\w) found:
-- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Failed                                                                                                                                                         -- Check size of __int64 - failed
-- Check size of unsigned __int64 - failed                                                                                                                                                                  -- Performing Test HAVE_WORKING_EXT2_IOC_GETFLAGS - Failed
-- Performing Test HAVE_STRUCT_VFSCONF - Failed                                                                                                                                                             -- Performing Test HAVE_STRUCT_XVFSCONF - Failed
-- Performing Test MAJOR_IN_MKDEV - Failed                                                                                                                                                                  -- Performing Test HAVE_LZMA_STREAM_ENCODER_MT - Failed
-- Performing Test HAVE_STRUCT_TM___TM_GMTOFF - Failed                                                                                                                                                      -- Performing Test HAVE_STRUCT_STATFS_F_NAMEMAX - Failed
-- Performing Test HAVE_STRUCT_STAT_ST_BIRTHTIME - Failed                                                                                                                                                   -- Performing Test HAVE_STRUCT_STAT_ST_BIRTHTIMESPEC_TV_NSEC - Failed
-- Performing Test HAVE_STRUCT_STAT_ST_MTIMESPEC_TV_NSEC - Failed                                                                                                                                           -- Performing Test HAVE_STRUCT_STAT_ST_MTIME_N - Failed
-- Performing Test HAVE_STRUCT_STAT_ST_UMTIME - Failed                                                                                                                                                      -- Performing Test HAVE_STRUCT_STAT_ST_MTIME_USEC - Failed
-- Performing Test HAVE_STRUCT_STAT_ST_FLAGS - Failed                                                                                                                                                       -- Performing Test HAVE_STRUCT_STATVFS_F_IOSIZE - Failed

So, I'm not sure that re-instating this without doing a better job at filtering out false positives (which can be hard to distinguish from actual problems in practice)...

Flamefire · 2019-12-11T12:30:00Z

Found another bug in the code, so it seems the log message is not covered by any test.

I'm a bit reluctant to re-instate the warning like this though, since the pattern used by parse_log_for_error for error seems very generic to me now that I take a closer look at it...

It basically considers any occurrence of error or failed as a clear indication that something went wrong when running a command, while we know now that this often leads to many false positives.

Then my proposal would be:

Merge this, it fixes a regression either way
Sort out and merge Add check_log_for_errors to detect and handle multiple errors #3118
Use Add check_log_for_errors to detect and handle multiple errors #3118 from parse_log_for_error in the default case to produce less false-positives. Eg. -- Performing Test can be added as an "IGNORE" match
Use Add check_log_for_errors to detect and handle multiple errors #3118 in configure EB to fix hotfixes #157

Flamefire · 2019-12-12T09:20:03Z

Together with #3118 I could use it on our system to install a big package chain and sort out the most common false positives. I expect most of them to be the same, so should be easy to reduce the amount of false-positives considerably

ocaisa

I'm fine with this but since there's a big connection to #3118 I'll leave it to @boegel to merge

boegel · 2020-01-14T12:06:38Z

Reducing the false positives is nice, but I still feel that producing a big fat warning for every occurrence of something that looks like an error but may not be is not a good idea.

In some cases you may not even be able to automatically distinguish false positives from actual errors...

How about changing the print_warning to log.warning, to just log in the log file rather than spit it out in the output of the eb command?

Flamefire · 2020-01-14T12:14:44Z

Sure. This PR was just meant to fix a regression and not question the initial intention of that feature. I'm all for replacing this by something more advanced.

On the other hand: Providing this warnings to someone testing a EC as a "warning" could still be useful. Checking for false-positives can be done by the user and common false-positives can be suppressed by EB (see my other PR). I mean: In the end this is a warning that something might be wrong. We can't know if this is a false-positive or not. For the second case it would be wrong not to show it.

boegel · 2020-01-14T12:27:55Z

I understand your point, but I think there will be a lot more false positives than genuine errors that go totally undetected through other means (like a failing sanity check).

We certainly have had errors that went undetected for a while, but I'd say they are rare, and so I'm reluctant to start raising concerns by highlighting false positives...

Flamefire · 2020-01-14T12:33:18Z

Ok so _log.warning instead

smoors

lgtm

smoors

lgtm

ocaisa reviewed Dec 10, 2019

View reviewed changes

easybuild/framework/easyblock.py Outdated Show resolved Hide resolved

ocaisa added the bug fix label Dec 10, 2019

Flamefire force-pushed the fix_error_display branch from 9fd2b83 to e5950d8 Compare December 10, 2019 17:09

houndci-bot reviewed Dec 10, 2019

View reviewed changes

test/framework/utilities.py Outdated Show resolved Hide resolved

test/framework/utilities.py Outdated Show resolved Hide resolved

test/framework/utilities.py Outdated Show resolved Hide resolved

Flamefire force-pushed the fix_error_display branch from e5950d8 to 70c72c2 Compare December 10, 2019 17:13

ocaisa previously approved these changes Dec 10, 2019

View reviewed changes

boegel added this to the release after 4.1.0 (4.1.1?) milestone Dec 10, 2019

Flamefire dismissed ocaisa’s stale review via d053316 December 11, 2019 12:25

ocaisa previously approved these changes Dec 12, 2019

View reviewed changes

boegel modified the milestones: next release (4.1.1), release after 4.1.1 (4.1.2?) Dec 30, 2019

Flamefire added 3 commits January 14, 2020 13:52

Use correct module for errors_found_in_log

fa2a21b

Fix keyword argument to print_msg

e9e3f3b

Write possible errors warning to log only

1f01afe

Flamefire dismissed ocaisa’s stale review via 1f01afe January 14, 2020 12:56

Flamefire force-pushed the fix_error_display branch from d053316 to 1f01afe Compare January 14, 2020 12:56

smoors previously approved these changes Feb 17, 2020

View reviewed changes

Merge branch 'develop' into fix_error_display

690dff1

smoors dismissed their stale review via 690dff1 February 17, 2020 13:06

smoors approved these changes Feb 19, 2020

View reviewed changes

smoors merged commit e453884 into easybuilders:develop Feb 19, 2020

Flamefire deleted the fix_error_display branch February 19, 2020 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use correct module for errors_found_in_log #3119

Use correct module for errors_found_in_log #3119

Flamefire commented Dec 10, 2019

ocaisa left a comment

ocaisa commented Dec 10, 2019

Flamefire commented Dec 10, 2019 •

edited

Loading

boegel commented Dec 10, 2019

Flamefire commented Dec 11, 2019

Flamefire commented Dec 12, 2019

ocaisa left a comment

boegel commented Jan 14, 2020

Flamefire commented Jan 14, 2020

boegel commented Jan 14, 2020

Flamefire commented Jan 14, 2020

smoors left a comment

smoors left a comment

Use correct module for errors_found_in_log #3119

Use correct module for errors_found_in_log #3119

Conversation

Flamefire commented Dec 10, 2019

ocaisa left a comment

Choose a reason for hiding this comment

ocaisa commented Dec 10, 2019

Flamefire commented Dec 10, 2019 • edited Loading

boegel commented Dec 10, 2019

Flamefire commented Dec 11, 2019

Flamefire commented Dec 12, 2019

ocaisa left a comment

Choose a reason for hiding this comment

boegel commented Jan 14, 2020

Flamefire commented Jan 14, 2020

boegel commented Jan 14, 2020

Flamefire commented Jan 14, 2020

smoors left a comment

Choose a reason for hiding this comment

smoors left a comment

Choose a reason for hiding this comment

Flamefire commented Dec 10, 2019 •

edited

Loading