-
Notifications
You must be signed in to change notification settings - Fork 304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DAOS-16487 control: Require hostname for nvme set-faulty & replace #15074
Conversation
Ticket title is 'dmg storage must precisely identify NVMe device(s)' |
Test stage Unit Test on EL 8.8 completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-15074/1/testReport/ |
Features: control Required-githooks: true Signed-off-by: Tom Nabarro <tom.nabarro@intel.com>
Features: control Required-githooks: true Signed-off-by: Tom Nabarro <tom.nabarro@intel.com>
ea480b9
to
2f3ca2f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall looks good. Ftest dmg infrastructure updates may be needed for the telemetry commands now that the single host command has changed.
Test stage Functional on EL 8.8 completed with status UNSTABLE. https://build.hpdd.intel.com/job/daos-stack/job/daos//view/change-requests/job/PR-15074/2/testReport/ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
+1 with kris for adding the telemetry feature before approving.
Features: control telemetry Required-githooks: true Signed-off-by: Tom Nabarro <tom.nabarro@intel.com>
Features: pool telemetry Required-githooks: true Signed-off-by: Tom Nabarro <tom.nabarro@intel.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ftest LGTM
Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-15074/4/display/redirect |
Test stage Functional Hardware Medium Verbs Provider completed with status FAILURE. https://build.hpdd.intel.com/job/daos-stack/job/daos/job/PR-15074/4/display/redirect |
Test stage Functional Hardware Medium completed with status FAILURE. https://build.hpdd.intel.com//job/daos-stack/job/daos/view/change-requests/job/PR-15074/5/execution/node/505/log |
src/tests/ftest/util/nvme_utils.py
Outdated
@@ -44,7 +44,7 @@ def set_device_faulty(test, dmg, server, uuid, pool=None, has_sys_xs=False, **kw | |||
Args: | |||
test (Test): avocado test class | |||
dmg (DmgCommand): a DmgCommand class instance | |||
server (NodeSet): host on which to issue the dmg storage set nvme-faulty | |||
server (str): host on which to issue the dmg storage set nvme-faulty |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Minor, but we should probably leave this as a NodeSet input type. This docstring change implies that the user will need to unnecessarily convert a NodeSet object into a string when this is not actually needed. Through the FormattedParameter object the DmgCommand.host.value NodeSet object will be converted into a string when creating the dmg command string. Ideally, we want to keep all host information as a NodeSet object in the test harness as it simplifies other operations we may need to perform.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed
deployment/disk_failure.py and vmd/fault_reintegration.py failures in |
Features: pool telemetry Required-githooks: true Signed-off-by: Tom Nabarro <tom.nabarro@intel.com>
…tfaulty-replace-hostopt Features: pool telemetry Signed-off-by: Tom Nabarro <tom.nabarro@intel.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ftest LGTM
Passed VMD tests because by chance functional hardware medium ran on VMD enabled cluster (PCI addresses begin with non-zero domain) Failed VMD tests because the same test ran on a non-VMD enabled cluster (PCI addresses begin with zero domain) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changes look good to me, but I'll defer to those with more context into requirements for approvals.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks OK to me as long as telemetry tests pass.
…15074) Makes --host option mandatory for dmg storage set nvme-faulty, dmg storage replace nvme and dmg telemetry metrics. These commands should only be run on a single host which should be explicitly specified. Also remove --no-reint flag from replace nvme command. Signed-off-by: Tom Nabarro <tom.nabarro@intel.com>
…15074) (#15260) Makes --host option mandatory for dmg storage set nvme-faulty, dmg storage replace nvme and dmg telemetry metrics. These commands should only be run on a single host which should be explicitly specified. Also remove --no-reint flag from replace nvme command. Signed-off-by: Tom Nabarro <tom.nabarro@intel.com>
Makes --host option mandatory for dmg storage set nvme-faulty, dmg
storage replace nvme and dmg telemetry metrics. These commands should
only be run on a single host which should be explicitly specified.
Also remove --no-reint flag from replace nvme command.
Features: control
Required-githooks: true
Before requesting gatekeeper:
Features:
(orTest-tag*
) commit pragma was used or there is a reason documented that there are no appropriate tags for this PR.Gatekeeper: