Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change horizontal pod autoscaler to use otelcol scale subresource #941

Closed
wants to merge 30 commits into from
Closed
Show file tree
Hide file tree
Changes from 22 commits
Commits
Show all changes
30 commits
Select commit Hold shift + click to select a range
c5f0e1d
Change horizontal pod autoscaler to use otelcol scale subresource
kevinearls Jun 23, 2022
04abf43
Fix name
kevinearls Jun 23, 2022
e6e73da
Change horizontal pod autoscaler to use otelcol scale subresource
kevinearls Jun 23, 2022
6ba5e93
Fix name
kevinearls Jun 23, 2022
23d00f0
Merge branch 'hpa-use-scale-subresource' of github.com:kevinearls/ope…
kevinearls Jun 29, 2022
949a29f
Moved autoscale values into their own spec, added scale up test
kevinearls Jun 29, 2022
f088fea
Added a period to a comment to make the linter stop complaining
kevinearls Jun 29, 2022
34f44f7
Reran make generate after linting
kevinearls Jun 29, 2022
4974844
Fix autoscale test
kevinearls Jun 29, 2022
88575f6
Updated to autoscaling/v2beta2 as v1 is not available in k8s 1.24
kevinearls Jun 30, 2022
1d45856
Install metrics-server on kind clusters for autoscale tests
kevinearls Jul 1, 2022
2149091
Merge branch 'main' into hpa-use-scale-subresource
kevinearls Jul 1, 2022
dc55641
Fix more lint complaints
kevinearls Jul 1, 2022
36d2dda
Only scale up to 5 replicas in test
kevinearls Jul 1, 2022
41b581a
Fix test
kevinearls Jul 1, 2022
c22f6b4
Update test so it will work correctly in CI
kevinearls Jul 4, 2022
b4a72d1
Revert to autoscaling v1
kevinearls Jul 4, 2022
33d9c48
Reduce maxReplicas count to get kuttl tests to work in CI
kevinearls Jul 6, 2022
2f5ade5
Back out addition of AutoScaleSpec to CRD
kevinearls Jul 6, 2022
0bc348b
Fixed api-docs
kevinearls Jul 6, 2022
3ce81aa
Restore accidentally delete test, clean up whitespace
kevinearls Jul 6, 2022
aeb4a97
Document that this script is not needed for minikube
kevinearls Jul 7, 2022
96917a2
Fix scaledown
kevinearls Jul 7, 2022
2ef9467
Update webhook to validate minReplicas when needed
kevinearls Jul 11, 2022
ba0423b
Fix tests
kevinearls Jul 11, 2022
ef318ef
Initial upgrade code for instances with an HPA
kevinearls Jul 13, 2022
27fb26f
Make sure old Deploment based autoscalers get updated
kevinearls Jul 15, 2022
c733b6e
Fix lint error
kevinearls Jul 15, 2022
81aa971
Respond to comments
kevinearls Jul 15, 2022
f3d190b
Fixed api-docs
kevinearls Jul 15, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 5 additions & 1 deletion Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -152,7 +152,7 @@ e2e:
$(KUTTL) test

.PHONY: prepare-e2e
prepare-e2e: kuttl set-test-image-vars set-image-controller container container-target-allocator start-kind load-image-all
prepare-e2e: kuttl set-test-image-vars set-image-controller container container-target-allocator start-kind install-metrics-server load-image-all
mkdir -p tests/_build/crds tests/_build/manifests
$(KUSTOMIZE) build config/default -o tests/_build/manifests/01-opentelemetry-operator.yaml
$(KUSTOMIZE) build config/crd -o tests/_build/crds/
Expand Down Expand Up @@ -184,6 +184,10 @@ container-target-allocator:
start-kind:
kind create cluster --config $(KIND_CONFIG)

.PHONY: install-metrics-server
install-metrics-server:
./hack/install-metrics-server.sh

.PHONY: load-image-all
load-image-all: load-image-operator load-image-target-allocator

Expand Down
7 changes: 7 additions & 0 deletions hack/install-metrics-server.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
#!/bin/bash

# Install metrics-server on kind clusters for autoscale tests. Note: This is not needed for minikube,
# you can just add --addons "metrics-server" to the start command.
kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml
kubectl patch deployment -n kube-system metrics-server --type "json" -p '[{"op": "add", "path": "/spec/template/spec/containers/0/args/-", "value": --kubelet-insecure-tls}]'
kubectl wait --for=condition=available deployment/metrics-server -n kube-system --timeout=5m
9 changes: 5 additions & 4 deletions pkg/collector/horizontalpodautoscaler.go
Original file line number Diff line number Diff line change
Expand Up @@ -35,17 +35,18 @@ func HorizontalPodAutoscaler(cfg config.Config, logger logr.Logger, otelcol v1al

return autoscalingv1.HorizontalPodAutoscaler{
ObjectMeta: metav1.ObjectMeta{
Name: naming.Collector(otelcol),
Name: naming.HorizontalPodAutoscaler(otelcol),
Namespace: otelcol.Namespace,
Labels: labels,
Annotations: annotations,
},
Spec: autoscalingv1.HorizontalPodAutoscalerSpec{
ScaleTargetRef: autoscalingv1.CrossVersionObjectReference{
APIVersion: "apps/v1",
Kind: "Deployment",
Name: naming.Collector(otelcol),
APIVersion: v1alpha1.GroupVersion.String(),
Kind: "OpenTelemetryCollector",
Name: naming.OpenTelemetryCollector(otelcol),
},

MinReplicas: otelcol.Spec.Replicas,
MaxReplicas: *otelcol.Spec.MaxReplicas,
TargetCPUUtilizationPercentage: &cpuTarget,
Expand Down
5 changes: 0 additions & 5 deletions pkg/collector/reconcile/deployment.go
Original file line number Diff line number Diff line change
Expand Up @@ -99,11 +99,6 @@ func expectedDeployments(ctx context.Context, params Params, expected []appsv1.D
updated.ObjectMeta.Labels[k] = v
}

if params.Instance.Spec.MaxReplicas != nil && desired.Labels["app.kubernetes.io/component"] == "opentelemetry-collector" {
currentReplicas := currentReplicasWithHPA(params.Instance.Spec, existing.Status.Replicas)
updated.Spec.Replicas = &currentReplicas
}

// Selector is an immutable field, if set, we cannot modify it otherwise we will face reconciliation error.
updated.Spec.Selector = existing.Spec.Selector.DeepCopy()

Expand Down
1 change: 1 addition & 0 deletions pkg/collector/reconcile/horizontalpodautoscaler_test.go
Original file line number Diff line number Diff line change
Expand Up @@ -52,6 +52,7 @@ func TestExpectedHPA(t *testing.T) {
maxReplicas := int32(3)
updateParms := paramsWithHPA()
updateParms.Instance.Spec.Replicas = &minReplicas
updateParms.Instance.Spec.Replicas = &minReplicas
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please fix this duplicated code.

updateParms.Instance.Spec.MaxReplicas = &maxReplicas
updatedHPA := collector.HorizontalPodAutoscaler(updateParms.Config, logger, updateParms.Instance)

Expand Down
10 changes: 10 additions & 0 deletions pkg/naming/main.go
Original file line number Diff line number Diff line change
Expand Up @@ -54,6 +54,16 @@ func Collector(otelcol v1alpha1.OpenTelemetryCollector) string {
return DNSName(Truncate("%s-collector", 63, otelcol.Name))
}

// HorizontalPodAutoscaler builds the autoscaler name based on the instance.
func HorizontalPodAutoscaler(otelcol v1alpha1.OpenTelemetryCollector) string {
return DNSName(Truncate("%s-collector", 63, otelcol.Name))
}

// HorizontalPodAutoscaler builds the collector (deployment/daemonset) name based on the instance.
func OpenTelemetryCollector(otelcol v1alpha1.OpenTelemetryCollector) string {
return DNSName(Truncate("%s", 63, otelcol.Name))
}

// TargetAllocator returns the TargetAllocator deployment resource name.
func TargetAllocator(otelcol v1alpha1.OpenTelemetryCollector) string {
return DNSName(Truncate("%s-targetallocator", 63, otelcol.Name))
Expand Down
16 changes: 16 additions & 0 deletions tests/e2e/autoscale/00-assert.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
apiVersion: apps/v1
kind: Deployment
metadata:
name: simplest-collector
status:
readyReplicas: 1

---
apiVersion: autoscaling/v1
kind: HorizontalPodAutoscaler
metadata:
name: simplest-collector
spec:
minReplicas: 1
maxReplicas: 2

32 changes: 32 additions & 0 deletions tests/e2e/autoscale/00-install.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
apiVersion: opentelemetry.io/v1alpha1
kind: OpenTelemetryCollector
metadata:
name: simplest
spec:
replicas: 1
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should this field be automatically filled, once the minReplicas is the default lower bound of that instance?

maxReplicas: 2
resources:
limits:
cpu: 500m
memory: 128Mi
requests:
cpu: 5m
memory: 64Mi

config: |
receivers:
otlp:
protocols:
grpc:
http:
processors:

exporters:
logging:

service:
pipelines:
traces:
receivers: [otlp]
processors: []
exporters: [logging]
18 changes: 18 additions & 0 deletions tests/e2e/autoscale/01-assert.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
apiVersion: batch/v1
kind: Job
metadata:
name: tracegen
status:
conditions:
- status: "True"
type: Complete

---
apiVersion: opentelemetry.io/v1alpha1
kind: OpenTelemetryCollector

metadata:
name: simplest
status:
scale:
replicas: 2
19 changes: 19 additions & 0 deletions tests/e2e/autoscale/01-install.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
apiVersion: batch/v1
kind: Job
metadata:
name: tracegen
spec:
template:
spec:
containers:
- name: tracegen
image: ghcr.io/open-telemetry/opentelemetry-collector-contrib/tracegen:latest
command:
- "./tracegen"
args:
- -otlp-endpoint=simplest-collector-headless:4317
- -otlp-insecure
- -duration=1m
- -workers=20
restartPolicy: Never
backoffLimit: 4