Have Terraform stop to-be-destroyed instance first when moving attached ebs volume #2084

cnoffsin · 2017-10-27T15:45:11Z

When destroying an instance, and then moving an attached volume to another instance, it would be nice if Terraform could send a Stop to the instance being destroyed first.

This is because often the volume doesn't detach in time and I end up just stopping the old instance and re-running the apply. The old instance is getting destroyed anyway.

Terraform Version

10.7 and before

Affected Resource(s)

Please list the resources as a list, for example:
volume_attachment

Panic Output

1 error(s) occurred:

module.sre_apps.aws_volume_attachment.az1_collector_data (destroy): 1 error(s) occurred:
aws_volume_attachment.az1_collector_data: Error waiting for Volume (vol-0ec7ae950309d165a) to detach from Instance: i-058b7f34f0a545f77

Terraform does not automatically rollback in the face of errors.
Instead, your Terraform state file has been partially updated with
any resources that successfully completed. Please address the error

Expected Behavior

Volume moves to other instance quickly

Actual Behavior

What actually happened?

Steps to Reproduce

Please list the steps required to reproduce the issue, for example:

Move an EBS volume from one instance to another.

njam · 2017-10-28T16:29:18Z

There are a bunch of tickets about this problem:
https://github.com/terraform-providers/terraform-provider-aws/search?q=%22Error+waiting+for+Volume%22&type=Issues
And also hashicorp/terraform#2957 which is closed but still relevant I think.

The problem seems to be that the EBS volume is being detached from an EC2 instance, while it's still mounted.

Shutting down the EC2 instance first is probably the only sane thing to do in general. I can do that by adding a destroy-provisioner to the EBS-attachment:

  provisioner "remote-exec" {
    when = "destroy"
    inline = "sudo poweroff"
  }

But there's no way to start the instance again. So when changing the attachment of an EC2 instance from EBS volume A to volume B I then get this error:

Error waiting for instance (i-07416cee66e784c04) to become ready: unexpected state 'stopped', wanted target 'running'. last error: %!s(<nil>)

Not sure how the terraform provider could handle this. #569 suggests to add a new aws_instance_state resource, maybe it would help.

Probably best would be if terraform-aws would stop the instance before detaching, and start the instance again before attaching, both using the AWS APIs.

duality72 · 2017-11-10T00:16:29Z

My eventual solution, but only works for us because the instance and volume are both ephemeral and will always be created/destroyed together.

resource "aws_volume_attachment" "volume_attachment" {
  device_name  = "${module.common.device_name_label}"
  volume_id    = "${var.volume_id}"
  instance_id  = "${module.instance.id}"

  # Fix for https://github.com/terraform-providers/terraform-provider-aws/issues/2084
  provisioner "remote-exec" {
    inline     = ["sudo poweroff"]
    when       = "destroy"
    on_failure = "continue"

    connection {
      type        = "ssh"
      host        = "${module.instance.private_ip}"
      user        = "${lookup(module.common.linux_user_map, var.os)}"
      private_key = "${file("${var.local_key_file}")}"
      agent       = false
    }
  }

  # Make sure instance has had some time to power down before attempting volume detachment
  provisioner "local-exec" {
    command = "sleep 30"
    when    = "destroy"
  }
}

GarrisonD · 2017-12-15T13:57:58Z

@duality72 can I use your solution for instances in private subnets?

duality72 · 2018-06-01T17:07:55Z

@GarrisonD very late reply here, but I don't see why not

see hashicorp/terraform-provider-aws#2084

github-actions · 2020-05-21T17:42:47Z

Marking this issue as stale due to inactivity. This helps our maintainers find and focus on the active issues. If this issue receives no comments in the next 30 days it will automatically be closed. Maintainers can also remove the stale label.

If this issue was automatically closed and you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. Thank you!

ghost · 2020-07-21T17:10:25Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. Thanks!

…lume By stopping the instance, the volume is unmounted in the instance and the detaching of the volume doesn't run into a timeout fixes #6673 fixes #2084 fixes #2957 fixes #4770 fixes #288 fixes #1017

radeksimko added the enhancement Requests to existing resources that expand the functionality or scope. label Nov 14, 2017

radeksimko added the service/ec2 Issues and PRs that pertain to the ec2 service. label Jan 28, 2018

f3lang mentioned this issue May 10, 2019

[WIP] Add option to stop an instance before trying to remove an attached volume #8602

Closed

jdef pushed a commit to jdef/terraform-aws-instance that referenced this issue Jun 20, 2019

main: workaround flaky volume detachment issues

86bf174

see hashicorp/terraform-provider-aws#2084

jdef mentioned this issue Jun 20, 2019

main: workaround flaky volume detachment issues dcos-terraform/terraform-aws-instance#33

Open

github-actions bot added the stale Old or inactive issues managed by automation, if no further action taken these will get closed. label May 21, 2020

github-actions bot closed this as completed Jun 20, 2020

ghost locked and limited conversation to collaborators Jul 21, 2020

github-actions bot added this to the v3.62.0 milestone Oct 5, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Have Terraform stop to-be-destroyed instance first when moving attached ebs volume #2084

Have Terraform stop to-be-destroyed instance first when moving attached ebs volume #2084

cnoffsin commented Oct 27, 2017

njam commented Oct 28, 2017

duality72 commented Nov 10, 2017

GarrisonD commented Dec 15, 2017

duality72 commented Jun 1, 2018

github-actions bot commented May 21, 2020

ghost commented Jul 21, 2020

Have Terraform stop to-be-destroyed instance first when moving attached ebs volume #2084

Have Terraform stop to-be-destroyed instance first when moving attached ebs volume #2084

Comments

cnoffsin commented Oct 27, 2017

Terraform Version

Affected Resource(s)

Panic Output

Expected Behavior

Actual Behavior

Steps to Reproduce

njam commented Oct 28, 2017

duality72 commented Nov 10, 2017

GarrisonD commented Dec 15, 2017

duality72 commented Jun 1, 2018

github-actions bot commented May 21, 2020

ghost commented Jul 21, 2020