fix(process worker): fix the crashing of pod #1119

dhiren-singh-007 · 2024-10-25T13:09:18Z

Description

Handled the system exception and added the Critical/Fatal log type so that we can monitor the logs for criticality.

Why

It was causing issues for other processes as well .
Eg if a OSP customer sets the wrong callback address or if it is down then it returns the referenced class of SystemException and then it crashed the pod , which stops all other processes.

Issue

#1114

Checklist

Please delete options that are not relevant.

I have followed the contributing guidelines
I have performed a self-review of my own code
I have successfully tested my changes locally
I have checked that new and existing tests pass locally with my changes

Phil91

Please revert the changes in the framework dir since nothing changed in the framework logic

…calls. terminate processworker only when running out of memory. update framework version

sonarqubecloud · 2024-10-29T11:57:42Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

ntruchsess · 2024-10-29T12:10:39Z

@dhiren-singh-007 @Phil91 : I think it makes sense to force a restart of the pod when it runs out of memory - everything else should be logged as critical. I also changed the way SocketException and IOException are handled within the remote api calls. Those wouln't throw this kind of systemexception but ServiceException flagged as RecoverOptions.NETWORK (being mapped to RecoverOptions.INFRASTRUCTURE which is most external system calls treat as recoverable skipping the process-step to be retried later). Nevertheless we might want to evaluate whether this is the right strategy for all external system calls - e.g. when calling a url that is provided by the customer it might be better to set the respective process-step to ERROR while retrying the same call with the next process-worker run would be suitable for calls to 'known as good' urls where it's reasonable to assume the network-related issue is temporary

* terminate processworker only when running out of memory. * handle SocketException and IOException explicitly in external system calls. * update framework version * update process worker test --------- Co-authored-by: Norbert Truchsess <norbert.truchsess@t-online.de>

dhiren-singh-007 added 2 commits October 25, 2024 14:52

fix(process worker): fix the crashing of pod

f8270b8

update process worker test

297a030

dhiren-singh-007 requested review from ntruchsess and Phil91 and removed request for ntruchsess October 25, 2024 13:10

updated nuget package versions

88000ec

Phil91 requested changes Oct 28, 2024

View reviewed changes

revert the framework changes

09f891b

dhiren-singh-007 requested a review from Phil91 October 29, 2024 08:20

handle SocketException and IOException explicitly in external system …

b62273b

…calls. terminate processworker only when running out of memory. update framework version

ntruchsess force-pushed the bugifx/1114-fix-process-worker-crash branch from c9ebf6f to b62273b Compare October 29, 2024 11:53

ntruchsess approved these changes Oct 29, 2024

View reviewed changes

Phil91 approved these changes Oct 29, 2024

View reviewed changes

ntruchsess merged commit a6936a5 into eclipse-tractusx:main Oct 29, 2024
11 checks passed

ntruchsess added this to the Release 25.03 milestone Oct 29, 2024

dhiren-singh-007 deleted the bugifx/1114-fix-process-worker-crash branch October 29, 2024 15:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(process worker): fix the crashing of pod #1119

fix(process worker): fix the crashing of pod #1119

dhiren-singh-007 commented Oct 25, 2024

Phil91 left a comment

sonarqubecloud bot commented Oct 29, 2024

ntruchsess commented Oct 29, 2024 •

edited

Loading

fix(process worker): fix the crashing of pod #1119

fix(process worker): fix the crashing of pod #1119

Conversation

dhiren-singh-007 commented Oct 25, 2024

Description

Why

Issue

Checklist

Phil91 left a comment

Choose a reason for hiding this comment

sonarqubecloud bot commented Oct 29, 2024

Quality Gate passed

ntruchsess commented Oct 29, 2024 • edited Loading

ntruchsess commented Oct 29, 2024 •

edited

Loading