From 10b931fb6b3d59f3cc743b66d9da9545c44697bc Mon Sep 17 00:00:00 2001 From: syaseen-rh Date: Fri, 14 Jun 2024 09:50:02 -0400 Subject: [PATCH] minor fixes --- .../deploying-models-on-the-single-model-serving-platform.adoc | 2 +- .../viewing-metrics-for-the-single-model-serving-platform.adoc | 1 + 2 files changed, 2 insertions(+), 1 deletion(-) diff --git a/modules/deploying-models-on-the-single-model-serving-platform.adoc b/modules/deploying-models-on-the-single-model-serving-platform.adoc index 557b0fc8..e6eaa315 100644 --- a/modules/deploying-models-on-the-single-model-serving-platform.adoc +++ b/modules/deploying-models-on-the-single-model-serving-platform.adoc @@ -30,7 +30,7 @@ endif::[] * To use the Caikit-TGIS runtime, you have converted your model to Caikit format. For an example, see link:https://github.com/opendatahub-io/caikit-tgis-serving/blob/main/demo/kserve/built-tip.md#bootstrap-process[Converting Hugging Face Hub models to Caikit format^] in the link:https://github.com/opendatahub-io/caikit-tgis-serving/tree/main[caikit-tgis-serving^] repository. ifndef::upstream[] * If you want to use graphics processing units (GPUs) with your model server, you have enabled GPU support in {productname-short}. See link:{rhoaidocshome}{default-format-url}/managing_resources/managing-cluster-resources_cluster-mgmt#enabling-gpu-support_cluster-mgmt[Enabling GPU support in {productname-short}^]. -* To use the vLLM runtime, you have enabled GPU support in {productname-short} and have installed and configured the Node Feature Discovery operator on your cluster. For more information, see link:https://docs.openshift.com/container-platform/{ocp-latest-version}/hardware_enablement/psap-node-feature-discovery-operator.html#installing-the-node-feature-discovery-operator_node-feature-discovery-operator[^Installing the Node Feature Discovery operator] and link:{rhoaidocshome}{default-format-url}/managing_resources/managing-cluster-resources_cluster-mgmt#enabling-gpu-support_cluster-mgmt[Enabling GPU support in {productname-short}^] +* To use the vLLM runtime, you have enabled GPU support in {productname-short} and have installed and configured the Node Feature Discovery operator on your cluster. For more information, see link:https://docs.openshift.com/container-platform/{ocp-latest-version}/hardware_enablement/psap-node-feature-discovery-operator.html#installing-the-node-feature-discovery-operator_node-feature-discovery-operator[Installing the Node Feature Discovery operator] and link:{rhoaidocshome}{default-format-url}/managing_resources/managing-cluster-resources_cluster-mgmt#enabling-gpu-support_cluster-mgmt[Enabling GPU support in {productname-short}^] endif::[] ifdef::upstream[] * To use the vLLM runtime or use graphics processing units (GPUs) with your model server, you have enabled GPU support. This includes installing the Node Feature Discovery and GPU Operators. For more information, see https://docs.nvidia.com/datacenter/cloud-native/openshift/latest/index.html[NVIDIA GPU Operator on {org-name} OpenShift Container Platform^] in the NVIDIA documentation. diff --git a/modules/viewing-metrics-for-the-single-model-serving-platform.adoc b/modules/viewing-metrics-for-the-single-model-serving-platform.adoc index 288b80e9..f1a3f7f8 100644 --- a/modules/viewing-metrics-for-the-single-model-serving-platform.adoc +++ b/modules/viewing-metrics-for-the-single-model-serving-platform.adoc @@ -72,6 +72,7 @@ sum(increase(tgi_request_success{namespace=${namespace}, pod=~'${model_name}-pre [source,subs="+quotes"] ---- sum(increase(ovms_requests_success{namespace='${namespace}',name='${model_name}'}[${rate_interval}])) +---- endif::[]