-
Notifications
You must be signed in to change notification settings - Fork 211
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
layer 40 / logits all nan #133
Comments
|
Logits stats: {'shape': torch.Size([1, 168960]), 'has_nan': True, 'max': 'all NaN', 'min': 'all NaN'} |
this is the weirdest thing ive ever seen in a model - can the author or someone from THUDM actually comment on that why logits are actually refused ? |
我也觉得很奇怪,怎么拿不到logits呢 |
its weird .. im trying todo abliteration / finetuning later on the model
but its acting rather different from glm4-chat / my stuff works for chat
what is the core difference expect the 16k+ audio token the 4 special ones . and WHY are we not getting any logits back ?
The text was updated successfully, but these errors were encountered: