Skip to content

It is possible compile gpt-neox with cuda ??? #196

Answered by ggerganov
ArturK-85 asked this question in Q&A
Discussion options

You must be logged in to vote

It's possible. You have to offload the tensors used for matrix multiplication to the GPU.
Something like this:

https://github.com/ggerganov/llama.cpp/blob/905d87b70aa189623d500a28602d7a3a755a4769/llama.cpp#L1030-L1056

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@ArturK-85
Comment options

Answer selected by ArturK-85
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants