-
Notifications
You must be signed in to change notification settings - Fork 519
Issues: allenai/OLMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
High CrossEntropy and Z Loss variance after loading from checkpoint
type/bug
An issue about a bug
#776
opened Jan 6, 2025 by
abhijangda
Generating training mix of OLMo2 from dolmino-mix
type/question
An issue that's a question
#775
opened Jan 5, 2025 by
Cy-47
OLMo2 checkpoints for continued pretraining (non-HF)
type/question
An issue that's a question
#773
opened Dec 30, 2024 by
SimonSuster
Sudden data error during training
type/bug
An issue about a bug
#766
opened Dec 16, 2024 by
faresobeid
tokenizer.encode function`s param add_special_tokens=False not work.
type/bug
An issue about a bug
#765
opened Dec 12, 2024 by
xiaohan2909
Difference Between DDP and FSDP Modes
type/question
An issue that's a question
#762
opened Dec 6, 2024 by
lllabmaster
About eos_token_id in config file (20M, 1B)
type/question
An issue that's a question
#757
opened Nov 29, 2024 by
lllabmaster
Fail to load tokenizer for checkpoints
type/bug
An issue about a bug
#741
opened Oct 24, 2024 by
tresiwald
Error Encountered During Multi-Node Pretraining with Torchrun
type/bug
An issue about a bug
#737
opened Oct 21, 2024 by
Zehui127
8-bit allgather support
type/question
An issue that's a question
#722
opened Sep 19, 2024 by
yaroslavvb
Which mmlu validation setting is recommend?
type/question
An issue that's a question
#714
opened Aug 27, 2024 by
mathfinder
[Quick question]: How do I turn off FSDP?
type/question
An issue that's a question
#703
opened Aug 15, 2024 by
candygocandy
RuntimeError: Triton Error [CUDA]: invalid device context
type/bug
An issue about a bug
#700
opened Aug 13, 2024 by
andymvp2018
slurm script for: configs/official/OLMo-7B.yaml
type/question
An issue that's a question
#699
opened Aug 13, 2024 by
andymvp2018
Gflops computation is faulty for FSDP due to bug in
OLMo.num_params()
#695
opened Aug 7, 2024 by
AkshitaB
why CrossEntropyLoss is zero,i
type/question
An issue that's a question
#692
opened Aug 6, 2024 by
aizhweiwei
Olmo 0724 An issue about a bug
-hf
checkpoints don't load the proper config when instantiating with OLMoForCausalLM
type/bug
#689
opened Aug 5, 2024 by
sarahwie
Model ladder has no documentation
type/documentation
An issue or pull request related to documentation
#683
opened Jul 31, 2024 by
IanMagnusson
mlp_ratio not adjusted in config if mlp_hidden_size is set
type/bug
An issue about a bug
#673
opened Jul 21, 2024 by
Muennighoff
Does global_train_batch_size support gradient accumulation?
type/question
An issue that's a question
#672
opened Jul 21, 2024 by
jinzhuoran
Is there explicitly instruction-following data in the version of Dolma used to train v1?
type/question
An issue that's a question
#658
opened Jul 15, 2024 by
john-hewitt
Can long text be splitted into short texts?
type/question
An issue that's a question
#655
opened Jul 12, 2024 by
CoinCheung
Cannot convert internal OLMo checkpoint to HF
type/bug
An issue about a bug
#654
opened Jul 11, 2024 by
viking-sudo-rm
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.