We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 06e3a04 commit 049b810Copy full SHA for 049b810
tools/llm/run_llm.py
@@ -68,7 +68,7 @@ def get_model(args):
68
else:
69
model = model.to(torch.float32)
70
71
- return model.cuda()
+ return model
72
73
74
def compile_torchtrt(model, input_ids, args):
0 commit comments