Skip to content

Reduce the memory usage of logits from O(context_length) to O(1) #19529

Reduce the memory usage of logits from O(context_length) to O(1)

Reduce the memory usage of logits from O(context_length) to O(1) #19529

Annotations

1 warning

test-models-linux (buck2, mv3, xnnpack-quantization-delegation, linux.2xlarge, 90)  /  linux-job

succeeded Aug 22, 2024 in 7m 52s