This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

[LLM Runtime] enable MHA fusion for gptneox&dolly&starcoder&llama2-70b#567

Provide feedback