Skip to content

Pinned Loading

  1. OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.8k 638

  2. dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.3k 147

  3. ai2thor Public

    An open-source platform for Visual AI.

    C# 1.5k 251

  4. olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 13.5k 992

  5. OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 826 77

Repositories

Showing 10 of 509 repositories
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    Python 269 Apache-2.0 51 0 32 Updated Aug 3, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    Python 37 Apache-2.0 8 0 27 Updated Aug 2, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    Python 3,083 Apache-2.0 426 22 14 Updated Aug 3, 2025
  • understanding_mcqa Public

    Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"

    Python 11 Apache-2.0 2 0 0 Updated Aug 2, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    Python 40 Apache-2.0 5 11 8 Updated Aug 2, 2025
  • datamap-rs Public

    Data mapping framework for rust stuff

    Rust 4 1 0 2 Updated Aug 1, 2025
  • dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1,283 Apache-2.0 147 5 17 Updated Aug 1, 2025
  • beaker-gantry Public

    Gantry streamlines running Python experiments in Beaker by managing containers and boilerplate for you

    Python 25 Apache-2.0 7 2 2 Updated Aug 1, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    C# 1,465 Apache-2.0 251 266 5 Updated Aug 1, 2025
  • agent-eval Public
    Python 2 1 0 1 Updated Aug 1, 2025