GitHub - gabrieladt/kube-spot-termination-notice-handler

Forked from https://github.com/mumoshu/kube-spot-termination-notice-handler

With some small changes like send cpu_usage function, that is used to pull Requested CPU and MEMORY from spot nodes and inject them into cloudwatch and later, those metrics can be used by autoscaler, in my recomendation Spot Fleet Advisor.

With those metrics, you are able to work "over provisioned", what is important for spot intance setup.

Attention

Was added a new variable NODE_GROUP_NAME, used to segregate cpu and memory usage, by Auto Scaling Group and/or Spot Fleet.

A Kubernetes DaemonSet to run 1 container per node to periodically polls the EC2 Spot Instance Termination Notices endpoint. Once a termination notice is received, it will try to gracefully stop all the pods running on the Kubernetes node, up to 2 minutes before the EC2 Spot Instance backing the node is terminated.

Installation

Helm

A helm chart has been created for this tool, and at time of writing was in the incubator repository.

$ helm install incubator/kube-spot-termination-notice-handler

Available docker images/tags

Tags denotes Kubernetes/kubectl versions. Using the same version for your Kubernetes cluster and spot-termination-notice-handler is recommended. Note that the -1 (or similar) is the revision of this tool, in case we need versioning.

egeland/spot-termination-notice-handler:1.6.8-1
egeland/spot-termination-notice-handler:1.7.8-1
egeland/spot-termination-notice-handler:1.8.1-1

Why use it

So that your kubernetes jobs backed by spot instances can keep running on another instances (typically on-demand instances)

How it works

Each spot-termination-notice-handler pod polls the notice endpoint until it returns a http status 200. That status means a termination is scheduled for the EC2 spot instance running the handler pod, according to my study).

Run kubectl logs against the handler pod to watch how it works.

$ kubectl logs --namespace kube-system spot-termination-notice-handler-ibyo6
This script polls the "EC2 Spot Instance Termination Notices" endpoint to gracefully stop and then reschedule all the pods running on this Kubernetes node, up to 2 minutes before the EC2 Spot Instance backing the node is terminated.
See https://aws.amazon.com/jp/blogs/aws/new-ec2-spot-instance-termination-notices/ for more information.
`kubectl drain minikubevm` will be executed once a termination notice is made.
Polling http://169.254.169.254/latest/meta-data/spot/termination-time every 5 second(s)
Fri Jul 29 07:38:59 UTC 2016: 404
Fri Jul 29 07:39:04 UTC 2016: 404
Fri Jul 29 07:39:09 UTC 2016: 404
Fri Jul 29 07:39:14 UTC 2016: 404
...
Fri Jul 29 hh:mm:ss UTC 2016: 200

Building against a specific version of Kubernetes

Run KUBE_VERSION=<your desired k8s version> make build to specify the version number of k8s/kubectl.

Slack Notifications

Introduced in version 0.9.2 of this application (the @mumoshu version), you are able to setup a Slack incoming web hook in order to send slack notifications to a channel, notifying the users that an instance has been terminated.

Incoming WebHooks require that you set the SLACK_URL environmental variable as part of your PodSpec.

The URL should look something like: https://hooks.slack.com/services/T67UBFNHQ/B4Q7WQM52/1ctEoFjkjdjwsa22934

Slack Setup:

Show where things are happening by setting the CLUSTER environment variable to whatever you call your cluster. Very handy if you have several clusters that report to the same Slack channel.

Example Pod Spec:

        env:
          - name: POD_NAME
            valueFrom:
              fieldRef:
                fieldPath: metadata.name
          - name: NAMESPACE
            valueFrom:
              fieldRef:
                fieldPath: metadata.namespace
          - name: SLACK_URL
            value: "https://hooks.slack.com/services/T67UBFNHQ/B4Q7WQM52/1ctEoFjkjdjwsa22934"
          - name: CLUSTER
            value: development
          - name: NODE_GROUP_NAME (ADDED NEW)
            value: ASG/SpotFleet node name

Credits

This was forked from @mumoshu's original, and enhanced/simplified.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Dockerfile		Dockerfile
README.md		README.md
entrypoint.sh		entrypoint.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attention

Was added a new variable NODE_GROUP_NAME, used to segregate cpu and memory usage, by Auto Scaling Group and/or Spot Fleet.

Installation

Helm

Available docker images/tags

Why use it

How it works

Building against a specific version of Kubernetes

Slack Notifications

Credits

About

Releases

Packages

Languages

gabrieladt/kube-spot-termination-notice-handler

Folders and files

Latest commit

History

Repository files navigation

Attention

Was added a new variable NODE_GROUP_NAME, used to segregate cpu and memory usage, by Auto Scaling Group and/or Spot Fleet.

Installation

Helm

Available docker images/tags

Why use it

How it works

Building against a specific version of Kubernetes

Slack Notifications

Credits

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages