Skip to content
This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

[LLM Runtime] enable MHA fusion for gptneox&dolly&starcoder&llama2-70b#567

Merged
VincyZhang merged 17 commits intomainfrom mha_fusionNov 1, 2023

Commits

Commits on Oct 27, 2023

Commits on Oct 31, 2023

Commits on Nov 1, 2023