Skip to content

Commit

Permalink
make 3.18b use SanctumAI weights rather than attempt local irpa
Browse files Browse the repository at this point in the history
  • Loading branch information
renxida committed Jan 30, 2025
1 parent 99b2c5a commit 6de4446
Showing 1 changed file with 3 additions and 5 deletions.
8 changes: 3 additions & 5 deletions app_tests/integration_tests/llm/shortfin/conftest.py
Original file line number Diff line number Diff line change
Expand Up @@ -26,11 +26,9 @@
device_settings=device_settings.CPU,
),
"llama3.1_8b": ModelConfig(
source=ModelSource.LOCAL,
local_path=Path(
"/data/llama3.1/weights/8b/fp16/llama3.1_8b_fp16_instruct.irpa"
),
model_file="llama3.1_8b_fp16_instruct.irpa",
source=ModelSource.HUGGINGFACE,
repo_id="SanctumAI/Meta-Llama-3.1-8B-Instruct-GGUF",
model_file="meta-llama-3.1-8b-instruct.f16.gguf",
tokenizer_id="NousResearch/Meta-Llama-3.1-8B",
batch_sizes=(1, 4),
device_settings=device_settings.CPU,
Expand Down

0 comments on commit 6de4446

Please sign in to comment.