-
Notifications
You must be signed in to change notification settings - Fork 304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DAOS-16330 cart: Reduce error logs during timeouts #14905
DAOS-16330 cart: Reduce error logs during timeouts #14905
Conversation
…me time - Replace error prints with info during state processing, as error is already logged above Required-githooks: true Signed-off-by: Alexander A Oganezov <alexander.a.oganezov@intel.com>
Ticket title is 'cart: reduce flood of error prints when rpcs time out' |
RPC_ERROR(rpc_priv, | ||
"d_binheap_insert failed, rc: %d\n", | ||
rc); | ||
RPC_ERROR(rpc_priv, "d_binheap_insert failed, rc: %d\n", rc); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
May I suggest we start using DL_ERROR internally here (or add another version of RPC_ERROR that takes rc as a parameter). It will print string version of error
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I dont think we ever hit this particular error; this would be something fatal. I agree though we should use DL_ERROR in places where it makes sense
Required-githooks: true Signed-off-by: Alexander A Oganezov <alexander.a.oganezov@intel.com>
Required-githooks: true Signed-off-by: Alexander A Oganezov <alexander.a.oganezov@intel.com>
Test stage Unit Test on EL 8.8 completed with status FAILURE. https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-14905/2/display/redirect |
Required-githooks: true Signed-off-by: Alexander A Oganezov <alexander.a.oganezov@intel.com>
src/cart/crt_internal_types.h
Outdated
/** whether metrics are used */ | ||
cg_use_sensors : 1, | ||
/** whether we are on a primary provider */ | ||
cg_provider_is_primary : 1; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it would be great if you were using unsigned int
on each field to fix the format or just start using bool instead :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it would be great if you were using
unsigned int
on each field to fix the format or just start using bool instead :)
we use bitfields here as we have to be careful about the size of this rpc_priv struct. its annoying for formatting though
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe I was not clear, what I'm saying is that you should not separate each field with ,
but instead repeat unsigned int
for each field so that they remain aligned.
/** whether metrics are used */
unsigned int cg_use_sensors : 1;
/** whether we are on a primary provider */
unsigned int cg_provider_is_primary : 1;
does that make sense ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
also fwiw this is the struct crt_gdata
not the rpc_priv struct...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maybe I was not clear, what I'm saying is that you should not separate each field with
,
but instead repeatunsigned int
for each field so that they remain aligned./** whether metrics are used */ unsigned int cg_use_sensors : 1; /** whether we are on a primary provider */ unsigned int cg_provider_is_primary : 1;does that make sense ?
Neat, i didnt know it allowed such style
Required-githooks: true Signed-off-by: Alexander A Oganezov <alexander.a.oganezov@intel.com>
Test stage Build on EL 8 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14905/8/execution/node/384/log |
Test stage Build on Leap 15.5 with Intel-C and TARGET_PREFIX completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14905/8/execution/node/388/log |
Test stage Build RPM on EL 9 completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-14905/8/execution/node/367/log |
- Reduce burst timeout error prints when multiple rpcs fail at the same time - Replace error prints with info during state processing, as error is already logged above - nuke 'No more free indices left' debug message - cleanup of startup logs Required-githooks: true Signed-off-by: Alexander A Oganezov <alexander.a.oganezov@intel.com>
- Reduce burst timeout error prints when multiple rpcs fail at the same time - Replace error prints with info during state processing, as error is already logged above - nuke 'No more free indices left' debug message - cleanup of startup logs Signed-off-by: Alexander A Oganezov <alexander.a.oganezov@intel.com>
Required-githooks: true
Before requesting gatekeeper:
Features:
(orTest-tag*
) commit pragma was used or there is a reason documented that there are no appropriate tags for this PR.Gatekeeper: