Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GPU] Fix per-token dynamic quantization #27332

Merged

Conversation

sshlyapn
Copy link
Contributor

Details:

  • Allow the DynamicQuantizeKernelOpt kernel to be selected with the default scales order
  • Relax DynamicQuantizeKernelRef kernel validation function

@sshlyapn sshlyapn added category: GPU OpenVINO GPU plugin Code Freeze labels Oct 30, 2024
@sshlyapn sshlyapn added this to the 2024.5 milestone Oct 30, 2024
@sshlyapn sshlyapn requested review from a team as code owners October 30, 2024 09:28
@isanghao isanghao added this pull request to the merge queue Oct 31, 2024
Merged via the queue into openvinotoolkit:master with commit 9ec63be Oct 31, 2024
150 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: GPU OpenVINO GPU plugin Code Freeze
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants