-
-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: vLLM throws error when sampling from Cerebras GPT Models #11224
Comments
|
Although HF does keep mappings from |
To solve this, I suggest you pass the architecture name explicitly via |
@DarkLight1337 could you please explain how to modify my above minimal example? When I try something like:
I hit the error: `TypeError: EngineArgs.init() got an unexpected keyword argument 'hf_overrides' |
What is your vLLM version? You might have to update it for this to be supported. |
0.5.4 |
Yeah, pretty sure you need to update vLLM. |
Is there any guarantee of backwards consistency? I've been generating data for a couple months and I need to make sure there's no distribution shift if I change the vllm version |
|
I've updated to |
I'm currently trying:
But this throws:
|
To ask a related but separate follow up question, when I try:
I receive the following error: Since I believe all of the Cerebras models are based on GPT2, what would you advise? |
The "architecture" field should be class name of the model that's implemented in vLLM. In this case, it should be |
Can you please provide a correctly functioning minimal working example? |
throws the error:
|
Also, the key should be "architectures" (plural) and you need to pass a list to it. It is basically the same format as HF |
Can you please give a functioning minimal working example? |
|
I'm currently testing this. If it works, I'll close the issue. |
Your current environment
The output of `python collect_env.py`
Model Input Dumps
No response
🐛 Describe the bug
vLLM throws an error when attempting to use Cerebras's models. Here is a minimal reproduction:
The error is:
TypeError: 'NoneType' object is not iterable
It arises here:
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: