huggingface / transformers Public

Notifications You must be signed in to change notification settings
Fork 28.2k
Star 141k

Code
Issues 1k
Pull requests 593
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: huggingface/transformers

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

1,018 Open 15,792 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Why are there so many variables named layrnorm in the codebase?

#36623 opened Mar 10, 2025 by jere357

Can not use flash-attention and flash-varlen-attention on Ascend NPU

#36618 opened Mar 9, 2025 by FightingZhen

In "02_how_to_generate", code cell 1 has an error message bug

#36613 opened Mar 8, 2025 by kvutien

2 of 4 tasks

Not installable on arm64 due to jaxlib upper bound

#36611 opened Mar 7, 2025 by paveldikov

Making attention mechanism stackable Feature request

Request for a new feature

#36609 opened Mar 7, 2025 by stas00

Whisper pipeline returns empty segment for each processed audio chunk bug

#36602 opened Mar 7, 2025 by as-suvorov

1 of 4 tasks

lm_head parameters missing from named_parameters() in Qwen2.5-VL-3B-Instruct model bug

#36598 opened Mar 7, 2025 by Buhua-Liu

2 of 4 tasks

The number of safetensors files is different when using CPU and CUDA. bug

#36595 opened Mar 6, 2025 by makcedward

2 of 4 tasks

Error when changing vocab size when fine tuning llama-vision bug

#36590 opened Mar 6, 2025 by Ssukriti

4 tasks done

Inconsistent Outputs When Using Flash Attention 2 and SDPA Attention with Attention Mask bug

#36585 opened Mar 6, 2025 by tartarleft

2 of 4 tasks

AutoModel failed with empty tensor error bug

#36579 opened Mar 6, 2025 by jiqing-feng

4 tasks

Some methods in TrainerControl seem not to be utilized.

#36576 opened Mar 6, 2025 by mst272

paligemma2-3B-mix in version4.49.0 not use GPU and 4.50.0.dev broken bug Cache

#36575 opened Mar 6, 2025 by hanggun

4 tasks

After tokenizers upgrade, the length of the token does not correspond to the length of the model bug

#36574 opened Mar 6, 2025 by CurtainRight

2 of 4 tasks

txt2vedio New model

#36573 opened Mar 6, 2025 by fangsanyong

2 tasks done

In the latest version of transformers (4.49.0) matrix transformation error is encountered bug

#36571 opened Mar 6, 2025 by idebroy

2 of 4 tasks

torch_dtype is actually used now? bug

#36567 opened Mar 5, 2025 by dakinggg

4 tasks

Add support for StableAdamW optimizer in Trainer Feature request

Request for a new feature

#36564 opened Mar 5, 2025 by capemox

Stop output to stdout in streamers.py methods bug

#36562 opened Mar 5, 2025 by wnm3

4 tasks

Improving expected test results

#36561 opened Mar 5, 2025 by ivarflakstad

Allow video objects (np array etc.) in apply_chat_template (not just paths or urls) Chat Template Feature request

Request for a new feature

VLM

#36560 opened Mar 5, 2025 by FredrikNoren

Error during processing: MllamaForCausalLM does not support Flash Attention 2.0 yet. Feature request

Request for a new feature

#36557 opened Mar 5, 2025 by sangramddreg

size mismatch for lm_head when fintune QWEN2.5 bug

#36550 opened Mar 5, 2025 by minmie

2 of 4 tasks

Facing issue while getting model from Rag,pretrained

#36548 opened Mar 5, 2025 by MAHESH18TECH

disable_compile not honored as a kwarg in generate bug

#36544 opened Mar 4, 2025 by pcuenca

1 of 4 tasks

Previous 1 2 3 4 5 … 40 41 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly