Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Mixtral to NeMo conversion script. (NVIDIA#8155)
* HF-Mixtral to NeMo conversion script. Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Pass MoE options from NeMo config to TransformerConfig. Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Add version check for get_gpt_layer_with_transformer_engine_spec Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Determine MoE support by attempting to import MoETokenDispatcher. Using importlib.metadata.version would be an alternative, however, a) it requires having mcore installed via pip (not always the case) and b) one might override megatron's location (e.g. via PYTHONPATH) and as a result would get inaccurate version from importlib.metadata. Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Mixtral-NeMo to Mixtral-HF converter. Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fixup: Update mcore_supports_moe due to file rename in upcoming MoE Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Mixtral-converters: use `set_expert_model_parallel_world_size` to specify MoE world size. Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Fix import Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Jenkins: install lightning. Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Match latest MoE parameter names. Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com> Signed-off-by: Sasha Meister <ameister@nvidia.com>
- Loading branch information