Skip to content

Reduce the memory usage of logits from O(context_length) to O(1) (#4688) #3825

Reduce the memory usage of logits from O(context_length) to O(1) (#4688)

Reduce the memory usage of logits from O(context_length) to O(1) (#4688) #3825

Annotations

1 warning

test-llama-runner-mac (fp32, cmake, portable)  /  macos-job

succeeded Aug 23, 2024 in 12m 37s