Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.8k 639

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.3k 147

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.5k 251

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 13.6k 995

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 829 78

Repositories

Showing 10 of 509 repositories
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 269 Apache-2.0 51 0 32 Updated Aug 5, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 37 Apache-2.0 8 0 27 Updated Aug 5, 2025
  • agent-eval Public
    allenai/agent-eval’s past year of commit activity
    Python 2 1 0 2 Updated Aug 5, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,085 Apache-2.0 426 22 16 Updated Aug 5, 2025
  • regmixer Public
    allenai/regmixer’s past year of commit activity
    Jupyter Notebook 6 0 0 2 Updated Aug 5, 2025
  • panda Public

    Panda ("plan-and-act") agent for Autonomous Scientific Discovery

    allenai/panda’s past year of commit activity
    Python 3 Apache-2.0 0 0 0 Updated Aug 4, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 13,580 Apache-2.0 994 13 3 Updated Aug 4, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    allenai/ai2thor’s past year of commit activity
    C# 1,474 Apache-2.0 251 266 5 Updated Aug 4, 2025
  • dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    allenai/dolma’s past year of commit activity
    Python 1,283 Apache-2.0 147 6 17 Updated Aug 4, 2025
  • OLMo-in-loop-evals Public

    Code for in-loop evaluation tasks used by the OLMo training team

    allenai/OLMo-in-loop-evals’s past year of commit activity
    Python 6 Apache-2.0 4 0 0 Updated Aug 4, 2025