Skip to content

llama : custom attention mask + parallel decoding + no context swaps#3228

Merged
ggerganov merged 57 commits intomasterfrom custom-attention-maskSep 28, 2023

Commits

Commits on Sep 21, 2023

Commits on Sep 27, 2023