KeyError: lm_head.weight in GemmaForCausalLM.load_weights when loading finetuned Gemma 2B #3323

patleeman · 2024-03-11T18:16:40Z

Hello,

I finetuned Gemma 2B with Unsloth. It uses LoRA and merges the weights back into the base model.

When I try to load this model, it gives me the following error:

...
File "/home/ubuntu/projects/cql-ml/.venv/lib/python3.10/site-packages/vlm/model_executor/model_loader.py", line 86, in get _model model. load weights(model_config.model, model_config.download_
config. model, model_ config. download dir,
File "/home/ubuntu/projects/cql-ml/.venv/lib/python3.10/site-packages/vlm/model_executor/models/gemma.py", line 339, in load weights
param = params_dict [name]
KeyError: 'lm_head.weight'

My pytorch_model.bin.index.json looks like this:

{
  "metadata": {
    "total_size": 6060920832
  },
  "weight_map": {
    "lm_head.weight": "pytorch_model-00002-of-00002.bin",
    "model.embed_tokens.weight": "pytorch_model-00001-of-00002.bin",
    "model.layers.0.input_layernorm.weight": "pytorch_model-00001-of-00002.bin",
    "model.layers.0.mlp.down_proj.weight": "pytorch_model-00001-of-00002.bin",
    "model.layers.0.mlp.gate_proj.weight": "pytorch_model-00001-of-00002.bin",
...

I saw in a few of the other classes a similar check for lm_head.weight so I replicated it in load_weights and the model loads correctly and works as intended. 1 2 3

The modified load_weights function:

1333322

I'm not sure if this is an issue with vllm, or an issue with the output of Unsloth. The model works correctly when load_weights is modified. I don't know what the internals of the model should look like. Any help would be appreciated!

I'm unsure if this is related to #2816

My model is Private, so unfortunately I can't share it. However I found this other model on huggingface that's trained with the same tool with the lm_head.weight in the index.

If the modified load_weights function is the desired fix, I can submit a PR if that will help.

Thank you for the help!

The text was updated successfully, but these errors were encountered:

lcw99 · 2024-03-11T22:38:40Z

Same error occurs when using quantized gemma-7b by autoawq.

KelleyYin · 2024-03-12T03:14:54Z

After fine-tuning gemma-7b, the same error occurs.

SparkJiao · 2024-03-16T05:27:16Z

It should be relevant to this PR: 3050.

I'm not sure if directly add the deleted two lines will break the loading of lora weights.

@WoosukKwon

uRENu · 2024-03-19T03:12:28Z

fine-tuning gemma-7b, the same error occurs. +1

kvikk · 2024-03-21T08:52:38Z

Same here I used axolotl and LoRa to finetune gemma-7b. +1.
vllm 0.3.3

patleeman · 2024-03-21T12:55:33Z

Thank you @taeminlee for sending the PR!

KelleyYin mentioned this issue Mar 12, 2024

The fine-tuned Gemma model encounters an error when loaded through vllm: KeyError: 'lm_head.weight' hiyouga/LLaMA-Factory#2789

Closed

1 task

taeminlee mentioned this issue Mar 21, 2024

[BugFix] gemma loading after quantization or LoRA. #3553

Merged

simon-mo closed this as completed in #3553 Mar 21, 2024

Yuki-Kokomi mentioned this issue Jun 22, 2024

Missing key "lm_head.weight" in GemmaForCausalLM when loading lora finetuned TinyLLaVA-Gemma-SigLIP-2.4B TinyLLaVA/TinyLLaVA_Factory#88

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyError: lm_head.weight in GemmaForCausalLM.load_weights when loading finetuned Gemma 2B #3323

KeyError: lm_head.weight in GemmaForCausalLM.load_weights when loading finetuned Gemma 2B #3323

patleeman commented Mar 11, 2024 •

edited

Loading

lcw99 commented Mar 11, 2024

KelleyYin commented Mar 12, 2024

SparkJiao commented Mar 16, 2024

uRENu commented Mar 19, 2024

kvikk commented Mar 21, 2024

patleeman commented Mar 21, 2024

KeyError: lm_head.weight in GemmaForCausalLM.load_weights when loading finetuned Gemma 2B #3323

KeyError: lm_head.weight in GemmaForCausalLM.load_weights when loading finetuned Gemma 2B #3323

Comments

patleeman commented Mar 11, 2024 • edited Loading

lcw99 commented Mar 11, 2024

KelleyYin commented Mar 12, 2024

SparkJiao commented Mar 16, 2024

uRENu commented Mar 19, 2024

kvikk commented Mar 21, 2024

patleeman commented Mar 21, 2024

patleeman commented Mar 11, 2024 •

edited

Loading