Skip to content

Pinned Loading

  1. OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.1k 671

  2. dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.3k 154

  3. ai2thor Public

    An open-source platform for Visual AI.

    C# 1.6k 262

  4. olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 16k 1.2k

  5. OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 904 83

Repositories

Showing 10 of 530 repositories
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    Python 52 Apache-2.0 10 1 32 Updated Nov 17, 2025
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    Python 319 Apache-2.0 58 3 44 Updated Nov 17, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    Python 3,300 Apache-2.0 457 9 (1 issue needs help) 50 Updated Nov 16, 2025
  • FlexOlmo Public

    Code and training scripts for FlexOlmo

    Python 113 Apache-2.0 13 3 11 Updated Nov 16, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    Python 56 Apache-2.0 7 27 2 Updated Nov 16, 2025
  • olmoearth_projects Public

    OlmoEarth projects

    Python 35 3 1 1 Updated Nov 15, 2025
  • Python 81 Apache-2.0 10 5 4 Updated Nov 15, 2025
  • decon Public

    decontamination

    Rust 6 Apache-2.0 0 0 0 Updated Nov 15, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 15,962 Apache-2.0 1,216 30 11 Updated Nov 14, 2025
  • scirepeval Public

    SciRepEval benchmark training and evaluation scripts

    Python 76 Apache-2.0 10 3 1 Updated Nov 14, 2025