forked from NVIDIA/Megatron-LM
-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upstream sync #6
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
ci: Bump reference sha See merge request ADLR/megatron-lm!2092
ci: Disable broken test See merge request ADLR/megatron-lm!2093
…nto 'main' Multimodal sequence length optimizations See merge request ADLR/megatron-lm!1985
tests: Disable flaky test See merge request ADLR/megatron-lm!2094
tests: Repeat MRs 5 times See merge request ADLR/megatron-lm!2004
…t_process_group, it causes hangs Co-authored-by: Szymon Migacz <1934379+szmigacz@users.noreply.github.com>
Don't pass device_id to torch.distributed.init_process_group, it causes hangs See merge request ADLR/megatron-lm!2091
ci: Add release tests for 0.9 See merge request ADLR/megatron-lm!2059
… branches to fail
fix: allow merge request CI for non-protected branches to fail See merge request ADLR/megatron-lm!2106
chore: Fix autoformatter for release branches See merge request ADLR/megatron-lm!2107
Co-authored-by: Shanmugam Ramasamy <shanmugamr@shanmugamr-mlt.client.nvidia.com>
Fixing broken links See merge request ADLR/megatron-lm!2104
Add video handling into multimodal mcore See merge request ADLR/megatron-lm!2072
Enable optional kwargs with CUDA graph See merge request ADLR/megatron-lm!1715
Resolve "Fix TE version in TELinear" Closes NVIDIA#318 See merge request ADLR/megatron-lm!2077
Update path to MMMU to use new repos structure See merge request ADLR/megatron-lm!2112
…STIC_ALGO Co-authored-by: Shanmugam Ramasamy <shanmugamr@shanmugamr-mlt.client.nvidia.com>
Removing env variable NVTE_ALLOW_NONDETERMINISTIC_ALGO See merge request ADLR/megatron-lm!1880
Fix upcycling issues. See merge request ADLR/megatron-lm!2089
tests: Fix ENV export See merge request ADLR/megatron-lm!2189
tests: Fix ENV export See merge request ADLR/megatron-lm!2194
…ChainedOptimizer Support for distributed checkpointing
GroupedMLP DistOpt Resharding and add UTs to ChainedOptimizer Support for distributed checkpointing See merge request ADLR/megatron-lm!1790
ci: Always upload artifacts See merge request ADLR/megatron-lm!2197
Data parallel inference See merge request ADLR/megatron-lm!2141
Remove CUDA requirement from cpu test. See merge request ADLR/megatron-lm!2199
Support padding between subsequences of Packed Sequence See merge request ADLR/megatron-lm!2096
Revert "Merge branch 'vitalyk/testfix' into 'main'" See merge request ADLR/megatron-lm!2206
Standard interface for getting offsets from tokenizers See merge request ADLR/megatron-lm!1909
tests: Use flaky instead of skip marker See merge request ADLR/megatron-lm!2208
@gurpreet-dhami Can we re-run the Unit tests once again, just to see if we are not regressing on anything. |
Able to run llama2 70b on this. |
Verbally approved by Chaitanya @lcskrishna . |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.