Add Entropy Adaptive Fine Tuning to SFT Trainer #4802

electroglyph · 2026-01-10T10:11:55Z

from the paper: https://huggingface.co/papers/2601.02151

the implementation is 100% based on (and credited to):
hiyouga/LlamaFactory@main...ymxyll:LlamaFactory-EAFT:feature/eaft

i added a test =)

closes #4795

use like so:

trainer = SFTTrainer(
    args = SFTConfig(
        loss_type = "eaft",
        eaft_alpha = 1.0, # default
    ),
)

qgallouedec · 2026-01-12T13:42:54Z

Thanks! Can you add a short subsection here: https://github.com/huggingface/trl/blob/main/docs/source/paper_index.md#supervised-fine-tuning

electroglyph · 2026-01-21T00:51:50Z

Thanks! Can you add a short subsection here: https://github.com/huggingface/trl/blob/main/docs/source/paper_index.md#supervised-fine-tuning

just letting you know i added the section

electroglyph added 2 commits January 10, 2026 02:05

feat: EAFT

9a64fcb

Merge branch 'main' into EAFT

84cd1e6

electroglyph added 2 commits January 13, 2026 22:32

Merge branch 'huggingface:main' into EAFT

317705a

add info to paper index

643f97f

Merge branch 'main' into EAFT

467da3f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Entropy Adaptive Fine Tuning to SFT Trainer #4802

Add Entropy Adaptive Fine Tuning to SFT Trainer #4802

Uh oh!

electroglyph commented Jan 10, 2026 •

edited

Loading

Uh oh!

qgallouedec commented Jan 12, 2026

Uh oh!

electroglyph commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Entropy Adaptive Fine Tuning to SFT Trainer #4802

Are you sure you want to change the base?

Add Entropy Adaptive Fine Tuning to SFT Trainer #4802

Uh oh!

Conversation

electroglyph commented Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qgallouedec commented Jan 12, 2026

Uh oh!

electroglyph commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

electroglyph commented Jan 10, 2026 •

edited

Loading