Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Q8_0 quantization for intermediate results #951

Merged
merged 7 commits into from
Apr 15, 2023
Merged

Add Q8_0 quantization for intermediate results #951

merged 7 commits into from
Apr 15, 2023

Commits on Apr 15, 2023

  1. Configuration menu
    Copy the full SHA
    3b894ec View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    19e7a65 View commit details
    Browse the repository at this point in the history
  3. Q8: use int8_t, AVX/AVX2 optimizations

    sw authored and ggerganov committed Apr 15, 2023
    Configuration menu
    Copy the full SHA
    2c4f9b6 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    312a927 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    3a111ab View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    01de5c5 View commit details
    Browse the repository at this point in the history
  7. ggml : fix q4_1 dot func

    ggerganov committed Apr 15, 2023
    Configuration menu
    Copy the full SHA
    60f27ed View commit details
    Browse the repository at this point in the history