fix "Cannot copy out of meta tensor; no data!" issue for BartForConditionalGeneration model #36572

yao-matrix · 2025-03-06T06:08:53Z

fix issue: #36247

cause of this issue:

bart-large-cnn checkpoint only has model.decoder.embed_tokens.weight while load_state_dict, but in BartModel implementation, both model.decoder.embed_tokens.weight and model.encoder.embed_tokens.weight tie to the shared.weights which doesn't have corresponding real weights in checkpoints, so when tie_weights are called, things are back to a void shared.weights which in meta device, then lead to the model.to issue

@SunMarc @Rocketknight1 , I don't know whether this PR meet your standard since it changed modeling code, pls let me know your thoughts, thx.

github-actions · 2025-03-06T06:11:59Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. When it is ready for review, please click the Ready for review button (at the bottom of the PR page).

Rocketknight1 · 2025-03-06T11:48:27Z

hi @yao-matrix, this causes several other BART tests to fail! You can see the failures in the CI at the bottom of this post.

Some of the failures are just code style - these are simple and can be fixed with make style or make fixup after pip install transformers[quality]. However, the failures in tests_torch indicate that this PR breaks BART weight loading, probably because the old shared tensor doesn't match up with the architecture after your PR. We won't be able to merge the PR unless those errors are resolved, and I'm not sure that will be possible while self.shared is removed.

SunMarc

I don't think we want to change the modeling to fix one specific checkpoint even though I know that this is one of the most used one. Maybe other checkpoints have the shared module instead. Instead, maybe we can try to update _tie_weights instead by tying self.shared to self.decoder.embed_tokens instead if self.decoder.embed_tokens is not on meta device ?

…tionalGeneration model

yao-matrix · 2025-03-07T02:53:49Z

@Rocketknight1 @SunMarc , Marc's method is better, I used it to fix the issue, pls help review and comment, thx.

Signed-off-by: Yao, Matrix <[email protected]>

SunMarc

Thanks for fixing this ! Really appreciate it !

yao-matrix · 2025-03-10T02:20:25Z

@ArthurZucker , pls take a look, thx.

github-actions bot marked this pull request as draft March 6, 2025 06:11

yao-matrix marked this pull request as ready for review March 6, 2025 06:15

SunMarc reviewed Mar 6, 2025

View reviewed changes

yao-matrix and others added 3 commits March 6, 2025 15:01

fix "Cannot copy out of meta tensor; no data!" issue for BartForCondi…

d56974c

…tionalGeneration model

Merge branch 'huggingface:main' into main

5451453

Merge branch 'main' into main

ddd9443

follow Marc's suggestion to use _tie_weights to fix

9298d45

Signed-off-by: Yao, Matrix <[email protected]>

SunMarc approved these changes Mar 7, 2025

View reviewed changes

SunMarc requested a review from ArthurZucker March 7, 2025 14:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix "Cannot copy out of meta tensor; no data!" issue for BartForConditionalGeneration model #36572

fix "Cannot copy out of meta tensor; no data!" issue for BartForConditionalGeneration model #36572

yao-matrix commented Mar 6, 2025 •

edited

Loading

github-actions bot commented Mar 6, 2025

Rocketknight1 commented Mar 6, 2025

SunMarc left a comment

yao-matrix commented Mar 7, 2025

SunMarc left a comment

yao-matrix commented Mar 10, 2025

fix "Cannot copy out of meta tensor; no data!" issue for BartForConditionalGeneration model #36572

Are you sure you want to change the base?

fix "Cannot copy out of meta tensor; no data!" issue for BartForConditionalGeneration model #36572

Conversation

yao-matrix commented Mar 6, 2025 • edited Loading

github-actions bot commented Mar 6, 2025

Rocketknight1 commented Mar 6, 2025

SunMarc left a comment

Choose a reason for hiding this comment

yao-matrix commented Mar 7, 2025

SunMarc left a comment

Choose a reason for hiding this comment

yao-matrix commented Mar 10, 2025

yao-matrix commented Mar 6, 2025 •

edited

Loading