Skip to content

[Fix/Inference] Fix GQA Triton and Support Llama3#5624

Merged
yuanheng-zhao merged 9 commits intohpcaitech:feature/colossal-inferfrom yuanheng-zhao:fix/inference/Yi34BApr 23, 2024

Commits

Commits on Apr 23, 2024