ModuleNotFoundError: No module named 'llama_inference_offload' #869

xiaojunhh · 2023-04-07T06:43:00Z

Describe the bug

ModuleNotFoundError: No module named 'llama_inference_offload'I searched, and someone had the same problem as me before, but he used an AMD graphics card, while mine was an Nvidia 3080

Welcome to bitsandbytes. For bug reports, please submit your error trace to: https://github.com/TimDettmers/bitsandbytes/issues

CUDA SETUP: CUDA runtime path found: D:\GPT-fast\oobabooga-windows\oobabooga-windows\installer_files\env\bin\cudart64_110.dll
CUDA SETUP: Highest compute capability among GPUs detected: 8.6
CUDA SETUP: Detected CUDA version 117
CUDA SETUP: Loading binary D:\GPT-fast\oobabooga-windows\oobabooga-windows\installer_files\env\lib\site-packages\bitsandbytes\libbitsandbytes_cuda117.dll...
The following models are available:

opt-6.7b
vicuna-13b-GPTQ-4bit-128g

Which one do you want to load? 1-2

2

Loading vicuna-13b-GPTQ-4bit-128g...
Traceback (most recent call last):
File "D:\GPT-fast\oobabooga-windows\oobabooga-windows\text-generation-webui\server.py", line 293, in
shared.model, shared.tokenizer = load_model(shared.model_name)
File "D:\GPT-fast\oobabooga-windows\oobabooga-windows\text-generation-webui\modules\models.py", line 100, in load_model
from modules.GPTQ_loader import load_quantized
File "D:\GPT-fast\oobabooga-windows\oobabooga-windows\text-generation-webui\modules\GPTQ_loader.py", line 14, in
import llama_inference_offload
ModuleNotFoundError: No module named 'llama_inference_offload'

Is there an existing issue for this?

I have searched the existing issues

Reproduction

My graphics card is Nvidia 3080, in a conda environment with pytorch/cuda, run:

pip install -r requirements.txt
Then in this repository:

pip install -e . The result is not good

Screenshot

No response

Logs

Welcome to bitsandbytes. For bug reports, please submit your error trace to: https://github.com/TimDettmers/bitsandbytes/issues
================================================================================
CUDA SETUP: CUDA runtime path found: D:\GPT-fast\oobabooga-windows\oobabooga-windows\installer_files\env\bin\cudart64_110.dll
CUDA SETUP: Highest compute capability among GPUs detected: 8.6
CUDA SETUP: Detected CUDA version 117
CUDA SETUP: Loading binary D:\GPT-fast\oobabooga-windows\oobabooga-windows\installer_files\env\lib\site-packages\bitsandbytes\libbitsandbytes_cuda117.dll...
The following models are available:

1. opt-6.7b
2. vicuna-13b-GPTQ-4bit-128g

Which one do you want to load? 1-2

2

Loading vicuna-13b-GPTQ-4bit-128g...
Traceback (most recent call last):
  File "D:\GPT-fast\oobabooga-windows\oobabooga-windows\text-generation-webui\server.py", line 293, in <module>
    shared.model, shared.tokenizer = load_model(shared.model_name)
  File "D:\GPT-fast\oobabooga-windows\oobabooga-windows\text-generation-webui\modules\models.py", line 100, in load_model
    from modules.GPTQ_loader import load_quantized
  File "D:\GPT-fast\oobabooga-windows\oobabooga-windows\text-generation-webui\modules\GPTQ_loader.py", line 14, in <module>
    import llama_inference_offload
ModuleNotFoundError: No module named 'llama_inference_offload'

System Info

Nvidia 3080

xiaojunhh · 2023-04-07T06:52:04Z

windows11

da3dsoul · 2023-04-07T14:24:23Z

#879

vague-score · 2023-04-07T19:53:21Z

i resolved this problem in my installation by adding llama-cpp-python==0.1.23 to the requirements.txt and then running the intall.bat one click installer

github-actions · 2023-05-07T23:16:14Z

This issue has been closed due to inactivity for 30 days. If you believe it is still relevant, please leave a comment below.

Snimm · 2023-05-09T11:04:57Z

I am having the same issue

TalhaUusuf · 2023-05-12T04:19:40Z

same issue

xiaojunhh added the bug Something isn't working label Apr 7, 2023

github-actions bot added the stale label May 7, 2023

github-actions bot closed this as completed May 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ModuleNotFoundError: No module named 'llama_inference_offload' #869

ModuleNotFoundError: No module named 'llama_inference_offload' #869

xiaojunhh commented Apr 7, 2023

xiaojunhh commented Apr 7, 2023

da3dsoul commented Apr 7, 2023

vague-score commented Apr 7, 2023

github-actions bot commented May 7, 2023

Snimm commented May 9, 2023

TalhaUusuf commented May 12, 2023

ModuleNotFoundError: No module named 'llama_inference_offload' #869

ModuleNotFoundError: No module named 'llama_inference_offload' #869

Comments

xiaojunhh commented Apr 7, 2023

Describe the bug

Welcome to bitsandbytes. For bug reports, please submit your error trace to: https://github.com/TimDettmers/bitsandbytes/issues

Is there an existing issue for this?

Reproduction

Screenshot

Logs

System Info

xiaojunhh commented Apr 7, 2023

da3dsoul commented Apr 7, 2023

vague-score commented Apr 7, 2023

github-actions bot commented May 7, 2023

Snimm commented May 9, 2023

TalhaUusuf commented May 12, 2023