[Question] PandaGPT with llama.cpp #12

ningshanwutuobang · 2023-07-01T14:33:32Z

I have tried to use llama.cpp for PandaGPT in panda_gpt_llama_cpp.
The script get poor performance. Is there any thing wrong for the procedure? Or is it just the limit of the model or q4_1 precision?

The following are my steps.

Obtain vicuna v0. Use FastChat@v0.1.10 to merge llama-13b-hf and vicuna-13b-delta-v0.
Merge lora weights to vicuna v0.
Convert it to ggml format and quantize it to q4_1. The result is ggml-pandagpt-vicuna-merge.
The script is located in panda_gpt_llama_cpp.

The model seems to recognize <Img>...</Img> labels.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] PandaGPT with llama.cpp #12

[Question] PandaGPT with llama.cpp #12

ningshanwutuobang commented Jul 1, 2023

[Question] PandaGPT with llama.cpp #12

[Question] PandaGPT with llama.cpp #12

Comments

ningshanwutuobang commented Jul 1, 2023