You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Issue:
The CSI Podmon applies taints to all worker nodes when there is a storage disconnection. However, when storage connectivity is restored, it fails to automatically remove the taints. This behavior significantly delays the restoration of production traffic after a storage disconnection event.
Action Taken:
We opened a Dell support request 206899925 and successfully reproduced the issue in the presence of the support team in the lab environment. We also provided all the necessary logs for further analysis and tested the issue with different versions of the CSI.
Dell’s Support team Recommendations:
After analyzing the issue and consulting with Dell engineering team, the support engineer suggested submitting an enhancement request through the account team. They determined that auto-removal of taints is currently not supported in any version of CSI (refer to the attached email for further details).
Enhancement Request:
We recommend that the CSI Podmon solution should be enhanced to automatically remove taints from all worker nodes as soon as storage connectivity is restored. This improvement is crucial for large environments where more than 300 pods are running at a single site. It will ensure that production traffic is restored immediately after any storage disconnection, minimizing downtime and optimizing efficiency.
The text was updated successfully, but these errors were encountered:
@JWilsonDell: Thank you for submitting this issue!
The issue is currently awaiting triage. Please make sure you have given us as much context as possible.
If the maintainers determine this is a relevant issue, they will remove the needs-triage label and respond appropriately.
We want your feedback! If you have any questions or suggestions regarding our contributing process/workflow, please reach out to us at container.storage.modules@dell.com.
Issue:
The CSI Podmon applies taints to all worker nodes when there is a storage disconnection. However, when storage connectivity is restored, it fails to automatically remove the taints. This behavior significantly delays the restoration of production traffic after a storage disconnection event.
Action Taken:
We opened a Dell support request 206899925 and successfully reproduced the issue in the presence of the support team in the lab environment. We also provided all the necessary logs for further analysis and tested the issue with different versions of the CSI.
Dell’s Support team Recommendations:
After analyzing the issue and consulting with Dell engineering team, the support engineer suggested submitting an enhancement request through the account team. They determined that auto-removal of taints is currently not supported in any version of CSI (refer to the attached email for further details).
Enhancement Request:
We recommend that the CSI Podmon solution should be enhanced to automatically remove taints from all worker nodes as soon as storage connectivity is restored. This improvement is crucial for large environments where more than 300 pods are running at a single site. It will ensure that production traffic is restored immediately after any storage disconnection, minimizing downtime and optimizing efficiency.
The text was updated successfully, but these errors were encountered: