Skip to content

Pull requests: huggingface/transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Reject assisted generation for LFM2 and LFM2-MoE (set _is_stateful)
#46937 opened Jun 27, 2026 by Sunt-ing Contributor Loading…
3 of 6 tasks
Fix crash in greedy assisted generation with different tokenizers
#46936 opened Jun 27, 2026 by Sunt-ing Contributor Loading…
1 task done
Fix TrackioCallback fails to log evaluation metrics after training ends
#46935 opened Jun 27, 2026 by lewtun Member Loading…
2 of 6 tasks
[Gemma4] Fix dtype casting for quantized vision/audio embedders
#46933 opened Jun 27, 2026 by sharmax-vikas Contributor Loading…
1 of 3 tasks
Bump minimum torch version to 2.5 and clean up torch<2.5 workarounds
#46930 opened Jun 27, 2026 by cyyever Contributor Loading…
1 of 6 tasks
Fix torch.export compatibility for Mixtral MoE experts
#46929 opened Jun 27, 2026 by Sarimsaljook Loading…
3 of 6 tasks
Fix yolos
#46922 opened Jun 26, 2026 by molbap Collaborator Loading…
docs: add prepare_for_model() v4→v5 migration example
#46921 opened Jun 26, 2026 by MushiSenpai Loading…
6 tasks
Adapt voxtral_realtime to mistral-common 1.11.5 audio API
#46920 opened Jun 26, 2026 by juliendenize Contributor Loading…
1 of 6 tasks
Fix RagTokenizer attribute delegation
#46919 opened Jun 26, 2026 by Pruthvi226 Loading…
5 of 6 tasks
fix(deepspeed): chunk ZeRO-3 missing-key param gather to avoid OOM
#46918 opened Jun 26, 2026 by itxsamad1 Loading…
1 of 2 tasks
Remove deprecated training args
#46917 opened Jun 26, 2026 by cyyever Contributor Loading…
1 of 6 tasks
Better and more extensive tests for RoPE
#46912 opened Jun 26, 2026 by zucchini-nlp Member Loading…
[Olmo3] different RoPE per layer type
#46911 opened Jun 26, 2026 by zucchini-nlp Member Loading…
ProTip! no:milestone will show everything without a milestone.