-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update doc on vLLM support #981
Update doc on vLLM support #981
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for updating this! LGTM! Just a nit comment then we should be good to go.
**Step 2: Convert XLFormer checkpoint to HF checkpoint** | ||
|
||
After obtaining the XLFormer checkpoint, the next step is to convert it to the Hugging Face format. Please refer to the official `HF script`_. | ||
Architecture ``--model`` must exist and be defined in `e.g.` :meth:`fairseq2.models.llama.archs.register_archs`. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
per the latest refactoring, we should update the :meth:xxx
to this new one: https://github.com/facebookresearch/fairseq2/blob/main/src/fairseq2/models/llama/_config.py#L105
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @zyaoj , fixed this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! We can probably wait a bit for @cbalioglu to land the refactoring then to merge this together with #993.
adf453d
into
facebookresearch:zyaoj/doc-update-sprint-4
What does this PR do? Please describe:
Updates the vLLM part of the end-to-end finetuning tutorial.
Does your PR introduce any breaking changes? If yes, please list them:
No
Check list: