It is possible compile gpt-neox with cuda ??? #196

ArturK-85 · 2023-05-25T14:57:30Z

ArturK-85
May 25, 2023

When i trying compile with -DGGML_CUBLAS=ON gpt-neox example run only on cpu.

Answered by ggerganov

May 25, 2023

It's possible. You have to offload the tensors used for matrix multiplication to the GPU.
Something like this:

https://github.com/ggerganov/llama.cpp/blob/905d87b70aa189623d500a28602d7a3a755a4769/llama.cpp#L1030-L1056

View full answer

ggerganov · 2023-05-25T18:57:45Z

ggerganov
May 25, 2023
Maintainer

It's possible. You have to offload the tensors used for matrix multiplication to the GPU.
Something like this:

https://github.com/ggerganov/llama.cpp/blob/905d87b70aa189623d500a28602d7a3a755a4769/llama.cpp#L1030-L1056

1 reply

ArturK-85 May 25, 2023
Author

Thank you for your quick response, I was very interested in your project, it opens up completely new possibilities in AI optimization. I will use the tips. Thank you for your help and explanation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

It is possible compile gpt-neox with cuda ??? #196

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

It is possible compile gpt-neox with cuda ??? #196

ArturK-85 May 25, 2023

Replies: 1 comment · 1 reply

ggerganov May 25, 2023 Maintainer

ArturK-85 May 25, 2023 Author

ArturK-85
May 25, 2023

Replies: 1 comment 1 reply

ggerganov
May 25, 2023
Maintainer

ArturK-85 May 25, 2023
Author