Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Update on "Reordered TP parallel plan to follow execution order"
- Llama uses pre-norm (norm before attention and before FFN), so we can move these up. - The root norm is before output, so we can swap this order too. [ghstack-poisoned]
- Loading branch information