-
-
Notifications
You must be signed in to change notification settings - Fork 6.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Restrict prometheus_client >= 0.18.0 to prevent errors when importing pkgs #3070
Conversation
…etrics from prometheus_client failed.
Hi, @AllenDou When I do a
|
@grandiose-pizza Show me your startup command. |
My entrypoint file
Its loading a model from disk in app.routes.text_generation_vllm is same as the api_server.py but with the change that chat template doesn't exist and I instead use a custom |
@grandiose-pizza Have you registered VLLM-related metrics to the Prometheus registry? Please refer to the Metrics class, eg: self.gauge_scheduler_running = Gauge(), |
I didn't change anything in make_asgi_app() function. Its using the default things provided by vLLM. Do I need to add something specifically if I am using a new model? I am using the Jais models. |
@grandiose-pizza The type of model and metrics are irrelevant, Jais is fine. Can I see your code? |
I have folded my imports into this single file/ code as I can't upload the files. I am using vllm==0.4.0 As you can see, the only change is that I use
|
@grandiose-pizza the problem is '--engine-use-ray', let's me find out why. |
Hi @AllenDou , This worked. Thanks a ton. Is this a bug, Should we try to create a PR to fix this for '--engine-use-ray'? I am not sure how much of a performance difference it create with and without this parameter. |
Traceback (most recent call last):
File "/usr/lib/python3.10/runpy.py", line 187, in _run_module_as_main
mod_name, mod_spec, code = _get_module_details(mod_name, _Error)
File "/usr/lib/python3.10/runpy.py", line 110, in _get_module_details
import(pkg_name)
File "/root/vllm/vllm/init.py", line 4, in
from vllm.engine.async_llm_engine import AsyncLLMEngine
File "/root/vllm/vllm/engine/async_llm_engine.py", line 10, in
from vllm.engine.llm_engine import LLMEngine
File "/root/vllm/vllm/engine/llm_engine.py", line 14, in
from vllm.engine.metrics import StatLogger, Stats
File "/root/vllm/vllm/engine/metrics.py", line 2, in
from prometheus_client import Counter, Gauge, Histogram, REGISTRY, disable_created_metrics
ImportError: cannot import name 'disable_created_metrics' from 'prometheus_client' (/usr/local/lib/python3.10/dist-packages/prometheus_client/init.py)
restrict prometheus_client to >= 0.18.0 to prevent errors when importing pkgs