remove vllm-check/tgi-check init-container #1605

robinliubin · 2024-05-08T16:06:36Z

add wget vllm into h2ogpt container

fixing: #1603

…tainer

helm/h2ogpt-chart/templates/deployment.yaml

joby-h20 · 2024-05-08T17:19:04Z

Tested by injecting the changes directly to deployment manifest.

https://h2oai.slack.com/archives/C06R0QAP54N/p1715188293152829?thread_ts=1715178088.028339&cid=C06R0QAP54N

joby-h20

lgtm

achraf-mer

added one comment about using a list or arguments, but if tested and works fine then LGTM, thanks.

EshamAaqib

LGTM

ozahavi · 2024-05-08T19:22:16Z

This change may cause issues if we ever want to introduce liveness\readiness probes to h2oGPT.

EshamAaqib · 2024-05-08T19:48:29Z

^ +1, If the inference service is vLLM and if Model Lock is used the check will not be needed, it can be disabled completely

robinliubin · 2024-05-08T19:49:23Z

This change may cause issues if we ever want to introduce liveness\readiness probes to h2oGPT.

thanks @ozahavi, would you elaborate more about moving wget into h2ogpt container would be an issue for liveness/readiness?

robinliubin · 2024-05-08T19:55:01Z

^ +1, If the inference service is vLLM and if Model Lock is used the check will not be needed, it can be disabled completely

like this:

{{- if and .Values.vllm.enabled (not .Values.h2ogpt.externalLLM.enabled) }}

EshamAaqib · 2024-05-08T19:59:00Z

^ +1, If the inference service is vLLM and if Model Lock is used the check will not be needed, it can be disabled completely

like this:
{{- if and .Values.vllm.enabled (not .Values.h2ogpt.externalLLM.enabled) }}

Yes, and also you can stick with Model Lock itself for locally hosted LLMs and if the inference is vLLM the wget check is not needed (if you use Model Lock)

Ex: https://github.com/h2oai/public-cloud-infrastructure/blob/develop/hamc/hadc-customer-account/terraform/environment/applications/main-h2ogpt.tf#L5

robinliubin · 2024-05-08T20:05:58Z

^ +1, If the inference service is vLLM and if Model Lock is used the check will not be needed, it can be disabled completely

like this:
{{- if and .Values.vllm.enabled (not .Values.h2ogpt.externalLLM.enabled) }}
Yes, and also you can stick with Model Lock itself for locally hosted LLMs and if the inference is vLLM the wget check is not needed (if you use Model Lock)

Ex: https://github.com/h2oai/public-cloud-infrastructure/blob/develop/hamc/hadc-customer-account/terraform/environment/applications/main-h2ogpt.tf#L5

like this:

{{- if and .Values.vllm.enabled (not .Values.h2ogpt.externalLLM.modelLock) }}

robinliubin · 2024-05-08T20:07:44Z

tested with: helm template h2oai helm/h2ogpt-chart --set vllm.enabled=true --set h2ogpt.externalLLM.modelLocak=null
it rendered to:

       args:
          - >
            until wget -O- http://h2oai-h2ogpt-vllm-inference:5000/v1/models >/dev/null 2>&1;
              do
                echo "Waiting for inference service to become ready...";
                sleep 5;
              done
              
            python3 /workspace/generate.py

EshamAaqib · 2024-05-08T20:13:05Z

^ +1, If the inference service is vLLM and if Model Lock is used the check will not be needed, it can be disabled completely

like this:
{{- if and .Values.vllm.enabled (not .Values.h2ogpt.externalLLM.enabled) }}
Yes, and also you can stick with Model Lock itself for locally hosted LLMs and if the inference is vLLM the wget check is not needed (if you use Model Lock)

Ex: https://github.com/h2oai/public-cloud-infrastructure/blob/develop/hamc/hadc-customer-account/terraform/environment/applications/main-h2ogpt.tf#L5

like this:
{{- if and .Values.vllm.enabled (not .Values.h2ogpt.externalLLM.modelLock) }}

Yup lgtm

remove vllm-check, tgi-check init-container, add wget into h2ogpt con…

d3f9f8d

…tainer

robinliubin requested review from achraf-mer and EshamAaqib May 8, 2024 16:07

robinliubin linked an issue May 8, 2024 that may be closed by this pull request

h2ogpt vllm-check init-container stuck when istio injection #1603

Closed

robinliubin requested a review from joby-h20 May 8, 2024 16:13

achraf-mer reviewed May 8, 2024

View reviewed changes

helm/h2ogpt-chart/templates/deployment.yaml Show resolved Hide resolved

fix typo

c21db6b

joby-h20 approved these changes May 8, 2024

View reviewed changes

achraf-mer approved these changes May 8, 2024

View reviewed changes

This comment was marked as resolved.

Sign in to view

EshamAaqib requested review from ozahavi and EshamAaqib May 8, 2024 19:15

EshamAaqib approved these changes May 8, 2024

View reviewed changes

skip wget if using modelLock

653eb26

robinliubin merged commit fbafbe4 into main May 8, 2024
2 checks passed

robinliubin deleted the robin/fix_vllm_check_when_istio branch May 8, 2024 20:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove vllm-check/tgi-check init-container #1605

remove vllm-check/tgi-check init-container #1605

robinliubin commented May 8, 2024

joby-h20 commented May 8, 2024

joby-h20 left a comment

achraf-mer left a comment

This comment was marked as resolved.

EshamAaqib left a comment

ozahavi commented May 8, 2024

EshamAaqib commented May 8, 2024 •

edited

Loading

robinliubin commented May 8, 2024

robinliubin commented May 8, 2024 •

edited

Loading

EshamAaqib commented May 8, 2024 •

edited

Loading

robinliubin commented May 8, 2024

robinliubin commented May 8, 2024

EshamAaqib commented May 8, 2024

remove vllm-check/tgi-check init-container #1605

remove vllm-check/tgi-check init-container #1605

Conversation

robinliubin commented May 8, 2024

joby-h20 commented May 8, 2024

joby-h20 left a comment

Choose a reason for hiding this comment

achraf-mer left a comment

Choose a reason for hiding this comment

This comment was marked as resolved.

EshamAaqib left a comment

Choose a reason for hiding this comment

ozahavi commented May 8, 2024

EshamAaqib commented May 8, 2024 • edited Loading

robinliubin commented May 8, 2024

robinliubin commented May 8, 2024 • edited Loading

EshamAaqib commented May 8, 2024 • edited Loading

robinliubin commented May 8, 2024

robinliubin commented May 8, 2024

EshamAaqib commented May 8, 2024

EshamAaqib commented May 8, 2024 •

edited

Loading

robinliubin commented May 8, 2024 •

edited

Loading

EshamAaqib commented May 8, 2024 •

edited

Loading