Skip to content

Commit

Permalink
[ChatQnA]Update manifests (#716)
Browse files Browse the repository at this point in the history
* update manifests for v0.9
  • Loading branch information
Zhenzhong1 authored Sep 3, 2024
1 parent 8c40204 commit 3563f5d
Show file tree
Hide file tree
Showing 4 changed files with 7 additions and 7 deletions.
2 changes: 1 addition & 1 deletion ChatQnA/benchmark/four_gaudi/chatqna_config_map.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ data:
TGI_LLM_ENDPOINT: http://llm-dependency-svc.default.svc.cluster.local:9009
REDIS_URL: redis://vector-db.default.svc.cluster.local:6379
INDEX_NAME: rag-redis
HUGGINGFACEHUB_API_TOKEN: {HF_TOKEN}
HUGGINGFACEHUB_API_TOKEN: ${HF_TOKEN}
EMBEDDING_SERVICE_HOST_IP: embedding-svc
RETRIEVER_SERVICE_HOST_IP: retriever-svc
RERANK_SERVICE_HOST_IP: reranking-svc
Expand Down
4 changes: 2 additions & 2 deletions ChatQnA/benchmark/four_gaudi/llm-dependency_run.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -35,9 +35,9 @@ spec:
- --model-id
- $(LLM_MODEL_ID)
- --max-input-length
- '2048'
- '1024'
- --max-total-tokens
- '4096'
- '2048'
- --max-batch-total-tokens
- '65536'
- --max-batch-prefill-tokens
Expand Down
4 changes: 2 additions & 2 deletions ChatQnA/benchmark/single_gaudi/llm-dependency_run.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -35,9 +35,9 @@ spec:
- --model-id
- $(LLM_MODEL_ID)
- --max-input-length
- '2048'
- '1024'
- --max-total-tokens
- '4096'
- '2048'
- --max-batch-total-tokens
- '65536'
- --max-batch-prefill-tokens
Expand Down
4 changes: 2 additions & 2 deletions ChatQnA/benchmark/two_gaudi/llm-dependency_run.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -35,9 +35,9 @@ spec:
- --model-id
- $(LLM_MODEL_ID)
- --max-input-length
- '2048'
- '1024'
- --max-total-tokens
- '4096'
- '2048'
- --max-batch-total-tokens
- '65536'
- --max-batch-prefill-tokens
Expand Down

0 comments on commit 3563f5d

Please sign in to comment.