Moved to Sampling v2 (and all other relevant commits from llama.cpp), reworked messages regeneration.
- DRY is not reimplemented yet
- Naming scheme now reflexes not only the backend, but also the use of
llamafile
,OpenMP
andOpenBLAS
Moved to Sampling v2 (and all other relevant commits from llama.cpp), reworked messages regeneration.
llamafile
, OpenMP
and OpenBLAS