[Fix/Inference] Fix GQA Triton and Support Llama3#5624
Merged
yuanheng-zhao merged 9 commits intohpcaitech:feature/colossal-inferfrom yuanheng-zhao:fix/inference/Yi34BApr 23, 2024
Commits
Commits on Apr 23, 2024
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed