Skip to content

v0.9.1: Many Vision Models, Qwen2.5 Coder, Gradient Fix

Latest
Compare
Choose a tag to compare
@hiyouga hiyouga released this 24 Nov 17:17
· 96 commits to main since this release
18daf10

New features

Note: now you can install transformers>=4.46.0,<=4.46.1 to make the gradient accumulation fix enabled.

New models

  • Base models
    • Qwen2.5 (0.5B/1.5B/3B/7B/14B/32B/72B) 📄
    • Qwen2.5-Coder (0.5B/1.5B/3B/7B/14B/32B) 📄🖥️
    • Llama-3.2 (1B/3B) 📄
    • OpenCoder (1.5B/8B) 📄🖥️
    • Index (1.9B) 📄
  • Instruct/Chat models
    • Qwen2.5-Instruct (0.5B/1.5B/3B/7B/14B/32B/72B) 📄🤖
    • Qwen2.5-Coder-Instruct (0.5B/1.5B/3B/7B/14B/32B) 📄🤖🖥️
    • Llama-3.2-Instruct (1B/3B) 📄🤖
    • OpenCoder-Instruct (1.5B/8B) 📄🤖🖥️
    • Index-Chat (1.9B) 📄🤖
    • LLaVA-NeXT (7B/8B/13B/34B/72B/110B) 📄🤖🖼️
    • LLaVA-NeXT-Video (7B/34B) 📄🤖🖼️
    • Video-LLaVA (7B) 📄🤖🖼️
    • Pixtral (12B) 📄🤖🖼️
    • EXAONE-3.0-Instruct (8B) 📄🤖

Security fix

Bug fix

Full Changelog: v0.9.0...v0.9.1