Skip to content

Commit

Permalink
[Bugfix] Fix TP > 1 for new granite (vllm-project#8544)
Browse files Browse the repository at this point in the history
Signed-off-by: Joe Runde <Joseph.Runde@ibm.com>
  • Loading branch information
joerunde authored and Jeffwan committed Sep 19, 2024
1 parent 35ede69 commit 9d9c360
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion vllm/model_executor/models/granite.py
Original file line number Diff line number Diff line change
Expand Up @@ -428,7 +428,8 @@ def compute_logits(
sampling_metadata: SamplingMetadata) -> Optional[torch.Tensor]:
logits = self.logits_processor(self.lm_head, hidden_states,
sampling_metadata)
logits /= self.config.logits_scaling
if logits is not None:
logits /= self.config.logits_scaling
return logits

def sample(
Expand Down

0 comments on commit 9d9c360

Please sign in to comment.