allenai / OLMo Public

Notifications You must be signed in to change notification settings
Fork 548
Star 5.2k

Code
Issues 53
Pull requests 53
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: allenai/OLMo

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

53 Open 172 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Failed to resolve dependency while using uv type/bug

An issue about a bug

#798 opened Feb 18, 2025 by aztecher

Activations Exploding Across Layers type/question

An issue that's a question

#797 opened Feb 15, 2025 by c3-utsavdutta98

Optimizer and trainer states for OLMo-7B (Feb. 2024) type/question

An issue that's a question

#796 opened Feb 13, 2025 by rahuln

Request for Checkpoint for Mid-stage Training type/question

An issue that's a question

#794 opened Feb 4, 2025 by liziniu

Tokenizer to be used for generation of data to .npy files type/question

An issue that's a question

#791 opened Jan 29, 2025 by WenJett

Merging new tokens into parts type/question

An issue that's a question

#778 opened Jan 9, 2025 by RitwikGupta

High CrossEntropy and Z Loss variance after loading from checkpoint type/bug

An issue about a bug

#776 opened Jan 6, 2025 by abhijangda

Generating training mix of OLMo2 from dolmino-mix type/question

An issue that's a question

#775 opened Jan 5, 2025 by Cy-47

tokenizer.encode function`s param add_special_tokens=False not work. type/bug

An issue about a bug

#765 opened Dec 12, 2024 by xiaohan2909

About eos_token_id in config file (20M, 1B) type/question

An issue that's a question

#757 opened Nov 29, 2024 by lllabmaster

Fail to load tokenizer for checkpoints type/bug

An issue about a bug

#741 opened Oct 24, 2024 by tresiwald

Error Encountered During Multi-Node Pretraining with Torchrun type/bug

An issue about a bug

#737 opened Oct 21, 2024 by Zehui127

Missing OLMo checkpoints

#726 opened Oct 3, 2024 by mirandrom

Expected Data Format type/question

An issue that's a question

#715 opened Aug 27, 2024 by aflah02

Which mmlu validation setting is recommend? type/question

An issue that's a question

#714 opened Aug 27, 2024 by mathfinder

[Quick question]: How do I turn off FSDP? type/question

An issue that's a question

#703 opened Aug 15, 2024 by candygocandy

RuntimeError: Triton Error [CUDA]: invalid device context type/bug

An issue about a bug

#700 opened Aug 13, 2024 by andymvp2018

slurm script for: configs/official/OLMo-7B.yaml type/question

An issue that's a question

#699 opened Aug 13, 2024 by andymvp2018

Gflops computation is faulty for FSDP due to bug in OLMo.num_params()

#695 opened Aug 7, 2024 by AkshitaB

Olmo 0724 -hf checkpoints don't load the proper config when instantiating with OLMoForCausalLM type/bug

An issue about a bug

#689 opened Aug 5, 2024 by sarahwie

Model ladder has no documentation type/documentation

An issue or pull request related to documentation

#683 opened Jul 31, 2024 by IanMagnusson

mlp_ratio not adjusted in config if mlp_hidden_size is set type/bug

An issue about a bug

#673 opened Jul 21, 2024 by Muennighoff

Does global_train_batch_size support gradient accumulation? type/question

An issue that's a question

#672 opened Jul 21, 2024 by jinzhuoran

Is there explicitly instruction-following data in the version of Dolma used to train v1? type/question

An issue that's a question

#658 opened Jul 15, 2024 by john-hewitt

Can long text be splitted into short texts? type/question

An issue that's a question

#655 opened Jul 12, 2024 by CoinCheung

Previous 1 2 3 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2025-01-19.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly