-
Notifications
You must be signed in to change notification settings - Fork 25.2k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Implementation Issue of Phi3SuScaledRotaryEmbedding
Feature request
Request for a new feature
#31339
opened Jun 10, 2024 by
ryan-minato
Support saving models trained with DeepSpeed in Trainer callbacks
Feature request
Request for a new feature
#31338
opened Jun 9, 2024 by
dwyatte
model_kwargs
is None when generation_config
is passed as a dict instead of generation.GenerationConfig
#31328
opened Jun 8, 2024 by
AADeLucia
MixtralFlashAttention2
subscripts position_ids
before checking if it is None
#31326
opened Jun 7, 2024 by
Luke20000429
2 of 4 tasks
Using a single 'RecurrentGemmaRglru' layer - "Trying to backward through the graph a second time" Error
#31324
opened Jun 7, 2024 by
talrub
2 of 4 tasks
Language modeling examples do not show how to do multi-gpu training / fine-tuning
#31323
opened Jun 7, 2024 by
csiefer2
2 of 4 tasks
[GGUF] Support new architectures/ quantisation schemes in Transformers
contributions-welcome
#31314
opened Jun 7, 2024 by
Vaibhavs10
AutoModelForCausalLM.from_pretrained silently fails
#31306
opened Jun 7, 2024 by
gpetters-amd
4 tasks
merge_and_unload
for a quantized model ruins its quality
Quantization
#31293
opened Jun 6, 2024 by
Aktsvigun
2 of 4 tasks
Having a function to verify if checkpoint is valid
Feature request
Request for a new feature
#31283
opened Jun 6, 2024 by
Bfault
Constraints in constrained beam search can be satisfied by the inputs.
Generation
#31281
opened Jun 6, 2024 by
zawedcvg
2 of 4 tasks
Stuck on Initializing Transformers Model with FSDP (Fully Sharded Data Parallel) using meta device
#31278
opened Jun 6, 2024 by
jiangjiadi
2 of 4 tasks
While using the integration of bitsandbytes, Error shows: name 'torch' is not defined
#31273
opened Jun 6, 2024 by
46319943
2 of 4 tasks
'FastSpeech2ConformerConfig' object has no attribute 'model_config'
Audio
#31270
opened Jun 6, 2024 by
spencerchubb
1 of 4 tasks
bf16 is more unstable than fp16, when looking at the difference of generation logprobs and forward logprobs
#31267
opened Jun 5, 2024 by
vwxyzjn
2 of 4 tasks
Flaky test - tests/models/mobilenet_v1/test_modeling_mobilenet_v1.py::MobileNetV1ModelTest::test_batching_equivalence
#31257
opened Jun 5, 2024 by
amyeroberts
4 tasks
Adaptive Decoding Support
Feature request
Request for a new feature
Generation
#31250
opened Jun 5, 2024 by
zwhong714
Intel/dpt-swinv2-tiny-256: TypeError: unsupported operand type(s) for //: 'NoneType' and 'NoneType'
Vision
#31249
opened Jun 5, 2024 by
yurithefury
2 of 4 tasks
Add support for non-CUDA architectures at the same time Bitsandbytes is doing it
Feature request
Request for a new feature
#31248
opened Jun 4, 2024 by
sealad886
We Need Compile Support For Mamba!
Compilation
Issues related to torchdynamo and torchinductor
Feature request
Request for a new feature
#31246
opened Jun 4, 2024 by
zhenglongjiepheonix
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.