Unable to load trained model #8

YukinoshitaKaren · 2024-11-26T11:24:46Z

I have tried to train llama2-7b-chat-hf, and I got 6 safetensors. But when I tried to load them using AutoModelForCausalLM.from_pretrained, something went wrong:

RuntimeError: Error(s) in loading state_dict for LlamaForCausalLM:
        size mismatch for model.embed_tokens.weight: copying a param with shape torch.Size([65537024]) from checkpoint, the shape in current model is torch.Size([32000, 4096]).
        size mismatch for model.norm.weight: copying a param with shape torch.Size([0]) from checkpoint, the shape in current model is torch.Size([4096]).
        size mismatch for lm_head.weight: copying a param with shape torch.Size([0]) from checkpoint, the shape in current model is torch.Size([32000, 4096]).
        You may consider adding `ignore_mismatched_sizes=True` in the model `from_pretrained` method.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to load trained model #8

Unable to load trained model #8

YukinoshitaKaren commented Nov 26, 2024

Unable to load trained model #8

Unable to load trained model #8

Comments

YukinoshitaKaren commented Nov 26, 2024