Add conversion support for CodeLlama 34B #25737

AlpinDale · 2023-08-24T16:12:29Z

There hasn't been any architectural changes, and I was able to convert the 34B by simply adding in the intermediate size for the model. I have converted the 34B model and it appears to work well.

What does this PR do?

This PR should make it possible to convert the CodeLlama models, recently released by Meta AI.

There hasn't been any architectural changes, and I was able to convert the 34B by simply adding in the intermediate size for the model.

AlpinDale · 2023-08-24T16:45:26Z

Doesn't seem to work for 7B and 13B variants.

AlpinDale · 2023-08-24T17:29:19Z

Conversion doesn't account for the modified tokenizer and the new rope theta value. Closing the PR until someone else implements the new changes.

ArthurZucker · 2023-08-25T06:31:22Z

See #25740

Add conversion support for CodeLlama 34B

ac20071

There hasn't been any architectural changes, and I was able to convert the 34B by simply adding in the intermediate size for the model.

AlpinDale closed this Aug 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add conversion support for CodeLlama 34B #25737

Add conversion support for CodeLlama 34B #25737

AlpinDale commented Aug 24, 2023

AlpinDale commented Aug 24, 2023

AlpinDale commented Aug 24, 2023

ArthurZucker commented Aug 25, 2023

Add conversion support for CodeLlama 34B #25737

Add conversion support for CodeLlama 34B #25737

Conversation

AlpinDale commented Aug 24, 2023

What does this PR do?

AlpinDale commented Aug 24, 2023

AlpinDale commented Aug 24, 2023

ArthurZucker commented Aug 25, 2023