Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add QwQ-32B-Preview #1843

Closed
wants to merge 21 commits into from
Closed

Add QwQ-32B-Preview #1843

wants to merge 21 commits into from

Conversation

ysjprojects
Copy link
Contributor

https://qwenlm.github.io/blog/qwq-32b-preview/

Reason for adding:

  • Latest Qwen reasoning model that beats O1
  • Currently it's a 32B preview version, but there will definitely be more QwQ releases in the near future.

Note: QwQ is based off Qwen2.5 architecture.

@Andrei-Aksionov
Copy link
Collaborator

Hey @ysjprojects
Thanks for the contribution 👍

Could you also update parametrized tests for Qwen model in test_model.py and in test_convert_lit_checkpoint.py, so it also includes QwQ.

Also, it looks like you created a branch from the one that you used for adding support for Qwen models.
It would be nice if you can create a branch from the current main and cherry-pick changes to it.

@ysjprojects
Copy link
Contributor Author

Also, it looks like you created a branch from the one that you used for adding support for Qwen models. It would be nice if you can create a branch from the current main and cherry-pick changes to it.

Made a new PR #1844

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants