Add YaRN rope adjustment + DeepSeekV3 rope_interleave by ysjprojects · Pull Request #2202 · Lightning-AI/litgpt

ysjprojects · 2026-02-15T23:04:21Z

This pull request introduces comprehensive support for YaRN (Yet another RoPE extensioN) rotary position embedding (RoPE) scaling to the codebase, enabling advanced context extension and compatibility with models like DeepSeek V3. The changes include new configuration options, robust parameter validation, updated RoPE cache construction, and new tensor operations to support interleaved RoPE layouts. Additionally, a new test suite is added to validate LitGPT’s YaRN implementation against HuggingFace’s DeepSeek V3.

Added a new rope_interleave boolean and extended rope_adjustments in the Config class to support YaRN and DeepSeekV3 specific RoPE logic
Implemented YaRN-specific scaling logic in the build_rope_cache function, including attention scaling computation, frequency blending, and smooth ramping between extrapolation and interpolation regimes.
Added tests/test_yarn.py, a comprehensive test comparing LitGPT’s DeepSeek V3 block with YaRN RoPE scaling against HuggingFace’s implementation.

Motivation

DeepSeekV3 and many other modern architectures use YaRN

ysjprojects added 2 commits February 15, 2026 17:49

yarn

e74fb7e

add rope_interleave to config.py

d7c5dbe

ysjprojects requested review from KaelanDt, andyland, k223kim, lantiga, lianakoleva and t-vi as code owners February 15, 2026 23:04

ysjprojects added 4 commits February 15, 2026 19:23

fix: test_yarn

504bed4

fix

3390ddd

Merge branch 'main' into sj/yarn_rope

74bae77

Merge branch 'main' into sj/yarn_rope

e880793

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add YaRN rope adjustment + DeepSeekV3 rope_interleave#2202

Add YaRN rope adjustment + DeepSeekV3 rope_interleave#2202
ysjprojects wants to merge 6 commits intomainfrom
sj/yarn_rope

ysjprojects commented Feb 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ysjprojects commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ysjprojects commented Feb 15, 2026 •

edited

Loading