GOLD training speed up #4888

141forever · 2026-01-22T13:36:36Z

What does this PR do?

In the GOLD algorithm, there is a mapping relationship between the student and the teacher when computing the loss. In the original implementation, this part involved frequent switching between the CPU and GPU, which not only incurred significant time overhead but also easily led to GPU memory fragmentation. This PR fixes this issue by keeping the mapping relationship on the GPU in advance.

Fixes #4864

Before submitting

[YES] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[YES] Did you read the contributor guideline,
Pull Request section?
[YES] Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case. If there are any training acceleration techniques? #4864
[YES] Did you make sure to update the documentation with your changes?
[NO] Did you write any new necessary tests?

Who can review?

People from HuggingFace.

training speed up

f0fa38e

141forever changed the title ~~training speed up~~ GOLD training speed up Jan 22, 2026

141forever mentioned this pull request Jan 22, 2026

If there are any training acceleration techniques? #4864

Closed

5 tasks

use self.accelerator.device

02e61c3

kashif approved these changes Jan 24, 2026

View reviewed changes

Merge branch 'main' into gold-train-opt

f69803a

kashif merged commit e106972 into huggingface:main Jan 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GOLD training speed up #4888

GOLD training speed up #4888

Uh oh!

141forever commented Jan 22, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

GOLD training speed up #4888

GOLD training speed up #4888

Uh oh!

Conversation

141forever commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

141forever commented Jan 22, 2026 •

edited

Loading