feat(docs): add documentation for HPA-based autoscaling #5935

lc525 · 2024-09-25T13:43:45Z

This describes a solution for scaling both Models and Servers based on HPA for
the case of single-model serving. In the example described in the docs, the
scaling is done based on Model RPS metrics fetched from Prometheus.

Which issue(s) this PR fixes:
Fixes #INFRA-1190 (docs)

This describes a solution for scaling both Models and Servers based on HPA for the case of single-model serving. In the example described in the docs, the scaling is done based on Model RPS metrics fetched from Prometheus.

lc525 · 2024-09-25T14:24:28Z

Reviewed and approved by Sherif here: #5932

New format compatible with GItBook * moved docs out of the source directory and removed spnix-related files * APIs section completed * changing the configuration section in the getting started guide * getting started sectionc completed * rearranged models directory and enhanced different docs * added most images in the dos to the images directory * moved outliers and drift docs to its own file in the root directory * deleted servers directory and moved servers.md to the root directory with enhancements * deleted pipelines dir and moved pipelines.md to the root directory * deleted inference dir and moved inference.md to the root directory * deleted explainers dir and moved explainers.md to the root directory * deleted performance-tests dir and moved .md to the root directory * deleted experiments dir and moved .md to the root directory * updated about section to match gitbook's expected format * updated FAQs section to match gitbook's expected format * updated pandas query section with choice1.yaml * mostly moved and renamed files and directories * updated SUMMARY.md for GitBook * adding additional images * restructured development dir * restructured and reformatted examples dir to match GitBook's md flavor * added gitbook format to metrics dir * restructured k8s directory to match GitBooks expected md flavor * reformatted cli dir * typos and links fixed * typos and links fixed * tentative structured added to the root of the docs * fixed names in kubernetes section * GITBOOK-1: changed hard-coded reference to scheduler.proto * added reference to chainer.proto instead of hard-coded version * removed hard-coded references and added GitHub Gist pointing to v2 branch * fixed format and broken links feat(docs): adding a mention of per component labels and annotations to the docs (SeldonIO#5931) feat(docs): add documentation for HPA-based autoscaling (SeldonIO#5935) This describes a solution for scaling both Models and Servers based on HPA for the case of single-model serving. In the example described in the docs, the scaling is done based on Model RPS metrics fetched from Prometheus.

This describes a solution for scaling both Models and Servers based on HPA for the case of single-model serving. In the example described in the docs, the scaling is done based on Model RPS metrics fetched from Prometheus.

feat(docs): add documentation for HPA-based autoscaling

8669fe0

This describes a solution for scaling both Models and Servers based on HPA for the case of single-model serving. In the example described in the docs, the scaling is done based on Model RPS metrics fetched from Prometheus.

lc525 requested a review from sakoush as a code owner September 25, 2024 13:43

lc525 added the v2 label Sep 25, 2024

lc525 merged commit 8db956d into SeldonIO:v2-docs-master Sep 25, 2024
2 of 3 checks passed

lc525 deleted the INFRA-1190/hpa-rps-docs branch September 25, 2024 14:24

Rajakavitha1 mentioned this pull request Sep 26, 2024

seldon core v2: improved autoscaling #5096

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(docs): add documentation for HPA-based autoscaling #5935

feat(docs): add documentation for HPA-based autoscaling #5935

lc525 commented Sep 25, 2024

lc525 commented Sep 25, 2024

feat(docs): add documentation for HPA-based autoscaling #5935

feat(docs): add documentation for HPA-based autoscaling #5935

Conversation

lc525 commented Sep 25, 2024

lc525 commented Sep 25, 2024