Skip to content

Commit

Permalink
ggml : add Q8_0 quantization for intermediate results (#951)
Browse files Browse the repository at this point in the history
* ggml : add Q8_0 quantization for intermediate results

* quantize-stats : fix test + add it to Makefile default

* Q8: use int8_t, AVX/AVX2 optimizations

* ggml : fix quantize_row_q8_0() ARM_NEON rounding

* minor : updates after rebase to latest master

* quantize-stats : delete obsolete strings

* ggml : fix q4_1 dot func

---------

Co-authored-by: Stephan Walter <stephan@walter.name>
  • Loading branch information
ggerganov and sw authored Apr 15, 2023
1 parent aa485ce commit e95b655
Show file tree
Hide file tree
Showing 3 changed files with 442 additions and 18 deletions.
2 changes: 1 addition & 1 deletion Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -133,7 +133,7 @@ $(info I CC: $(CCV))
$(info I CXX: $(CXXV))
$(info )

default: main quantize perplexity embedding
default: main quantize quantize-stats perplexity embedding

#
# Build library
Expand Down
Loading

0 comments on commit e95b655

Please sign in to comment.