-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remedy Azure disk ConflictingUserInput errors #1
Comments
@stoyanr, Jun 15 I tried to reproduce the issue and found that it's already fixed when using the in-tree volume attacher:
After about 2 minutes, the disk should automatically be deattached from the other node and attached to the correct one, and the pod should start. Manually deattaching the pod from the other node would also work, but is not necessary. The fix came with kubernetes/kubernetes#81266 and should be available in all releases 1.16 onwards, and also in patches for older releases. It came as a fix for kubernetes/kubernetes#81079. It seems however that the fix is still not part of the Azure CSI driver, so if we build an Azure cluster using the CSI driver (disabled by default, @d062553 do you know how to enable?), the original issue should still be reproducible. The relevant code can be found in https://github.com/kubernetes-sigs/azuredisk-csi-driver/blob/master/pkg/azuredisk/controllerserver.go, and there is also this issue: kubernetes-sigs/azuredisk-csi-driver#288 and it can't be implemented because of kubernetes/kubernetes#80488. In any case, I would suggest not to proactively try to solve this issue in the remedy controller, but instead push for a resolution in the CSI driver. @stoyanr, Jun 16 The issue is definitely reproducible when the Azure CSI driver is enabled. The error we are getting is different from the description in the ticket, due to the change introduced with kubernetes/kubernetes#81266:
However, with the CSI driver the automatic deattachment of the disk from the other node doesn't happen. Instead, the pod remains in @stoyanr, Jun 16 The same issue is also reproducible on GCP when the CSI driver is enabled (the default with 1.18). The error is the following:
@stoyanr, Jun 19 Short summary from meeting on 19.06:
|
As described above, we will not implement a remedy for this issue. Closing this epic. |
Issue
Detect Azure disks which can't be attached because they are still attached to another vm on the infrastructure.
Proposed solution
2.1. If Field
reason
equalsFailedAttachVolume
and FieldCode
equalsConflictingUserInput
2.2 Then read Field
message
and extract the disk uriEvent example
Tasks
The text was updated successfully, but these errors were encountered: