Fix error handling when failing to install a deb package (#11846) #15087

liushilongbuaa · 2023-05-16T06:22:01Z

Why I did it

Fix endless build log issue.
Cherry pick PR#11846

Work item tracking

Microsoft ADO (number only): 19299131

How I did it

The current error handling code for when a deb package fails to be installed currently has a chain of commands linked together by && and ends with exit 1. The assumption is that the commands would succeed, and the last exit 1 would end it with a non-zero return code, thus fully failing the target and causing the build to stop because of bash's -e flag.

However, if one of the commands prior to exit 1 returns a non-zero return code, then bash won't actually treat it as a terminating error. From bash's man page:

-e      Exit immediately if a pipeline (which may consist of a single simple
	command), a list, or a compound command (see SHELL GRAMMAR above),
        exits with a non-zero status.  The shell does not exit if the
        command that fails is part of the  command  list  immediately
        following a while or until keyword, part of the test following the
        if or elif reserved words, part of any command executed in a && or
        || list except the command following the final && or ||, any
        command in a pipeline but the last, or if the command's return
        value is being inverted with !.  If a compound command other than a
        subshell returns a non-zero status because a command failed while
        -e was being ignored, the shell does not exit.

The part part of any command executed in a && or || list except the command following the final && or || says that if the failing command is not the exit 1 that we have at the end, then bash doesn't treat it as an error and exit immediately. Additionally, since this is a compound command, but isn't in a subshell (subshell are marked by ( and ), whereas { and } just tells bash to run the commands in the current environment), bash doesn't exist. The result of this is that in the deb-install target, if a package installation fails, it may be infinitely stuck in that while-loop.

There are two fixes for this: change to using a subshell, or use ; instead of &&. Using a subshell would, I think, require exporting any shell variables used in the subshell, so I chose to change the && to ;. In addition, at the start of the subshell, set +e is added in, which removes the exit-on-error handling of bash. This makes sure that all commands are run (the output of which may help for debugging) and that it still exits with 1, which will then fully fail the target.

How to verify it

Which release branch to backport (provide reason below if selected)

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

…1846) The current error handling code for when a deb package fails to be installed currently has a chain of commands linked together by && and ends with `exit 1`. The assumption is that the commands would succeed, and the last `exit 1` would end it with a non-zero return code, thus fully failing the target and causing the build to stop because of bash's -e flag. However, if one of the commands prior to `exit 1` returns a non-zero return code, then bash won't actually treat it as a terminating error. From bash's man page: ``` -e Exit immediately if a pipeline (which may consist of a single simple command), a list, or a compound command (see SHELL GRAMMAR above), exits with a non-zero status. The shell does not exit if the command that fails is part of the command list immediately following a while or until keyword, part of the test following the if or elif reserved words, part of any command executed in a && or || list except the command following the final && or ||, any command in a pipeline but the last, or if the command's return value is being inverted with !. If a compound command other than a subshell returns a non-zero status because a command failed while -e was being ignored, the shell does not exit. ``` The part `part of any command executed in a && or || list except the command following the final && or ||` says that if the failing command is not the `exit 1` that we have at the end, then bash doesn't treat it as an error and exit immediately. Additionally, since this is a compound command, but isn't in a subshell (subshell are marked by `(` and `)`, whereas `{` and `}` just tells bash to run the commands in the current environment), bash doesn't exist. The result of this is that in the deb-install target, if a package installation fails, it may be infinitely stuck in that while-loop. There are two fixes for this: change to using a subshell, or use `;` instead of `&&`. Using a subshell would, I think, require exporting any shell variables used in the subshell, so I chose to change the `&&` to `;`. In addition, at the start of the subshell, `set +e` is added in, which removes the exit-on-error handling of bash. This makes sure that all commands are run (the output of which may help for debugging) and that it still exits with 1, which will then fully fail the target. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com> Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>

xumia · 2023-05-17T03:05:43Z

@qiluo-msft , could you please merge the code?

liushilongbuaa marked this pull request as ready for review May 16, 2023 06:22

liushilongbuaa requested a review from xumia May 16, 2023 06:23

xumia approved these changes May 16, 2023

View reviewed changes

liushilongbuaa requested a review from qiluo-msft May 17, 2023 06:33

lguohan added the Request for 202012 Branch label May 17, 2023

qiluo-msft approved these changes May 25, 2023

View reviewed changes

qiluo-msft merged commit 87e1a0a into sonic-net:202012 May 25, 2023

liuh-80 added the Included in 202012 Branch label Jun 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix error handling when failing to install a deb package (#11846) #15087

Fix error handling when failing to install a deb package (#11846) #15087

liushilongbuaa commented May 16, 2023 •

edited

Loading

xumia commented May 17, 2023

Fix error handling when failing to install a deb package (#11846) #15087

Fix error handling when failing to install a deb package (#11846) #15087

Conversation

liushilongbuaa commented May 16, 2023 • edited Loading

Why I did it

Work item tracking

How I did it

How to verify it

Which release branch to backport (provide reason below if selected)

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

xumia commented May 17, 2023

liushilongbuaa commented May 16, 2023 •

edited

Loading