-
Notifications
You must be signed in to change notification settings - Fork 28.9k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Attention mask for multi-image input in gemma3
bug
#38053
opened May 9, 2025 by
deval281shah
1 of 4 tasks
Removing GenerateMixin inheritance from PreTrainedModel class results in Phi4 load fail
bug
#38050
opened May 9, 2025 by
yatindrav
1 of 4 tasks
Modernbert 3D attention mask
Feature request
Request for a new feature
#38040
opened May 9, 2025 by
meetdoshi-iitb
Trainer API doesnt stop after the training has been completed
bug
#38039
opened May 9, 2025 by
Awaisn25
2 of 4 tasks
transformers require torch >= 2.1.0 to run fp8 model, but im using 2.7.0
bug
#38034
opened May 9, 2025 by
O5-7
2 of 4 tasks
RuntimeError when loading InternVL3-14B model: Embedding size mismatch
#38033
opened May 9, 2025 by
wkzcml-1
Removing the modification of loss value due to rounding off to 4 digits
bug
#38032
opened May 9, 2025 by
harish6696
2 of 4 tasks
TimeSformer assumes a fixed number of frames in its layers even though it interpolates temporal embeddings based on the input
bug
#38027
opened May 8, 2025 by
kamila-chay
1 of 4 tasks
while using trainer to train mnist model, 'ValueError: Found input variables with inconsistent numbers of samples: [10000, 8750]'
bug
#38024
opened May 8, 2025 by
HaoyaWHL
2 of 4 tasks
Maybe the vocab_size can be duplicated to the mainconfig for PEFT to pick up
#38017
opened May 8, 2025 by
lancercat
Trainer Stuck at 0% Progress during Training on Multi-GPU Setup
bug
#38008
opened May 8, 2025 by
yanho824
2 of 4 tasks
Does Qwen_2_5_VL support variable length attention computation?
Feature request
Request for a new feature
#38007
opened May 8, 2025 by
yingtongxiong
[bug]
use_sliding_window
doesn't work as expected
bug
#38002
opened May 7, 2025 by
ZhiyuLi-Nvidia
1 of 4 tasks
RuntimeError when converting and saving Flax ViT model to PyTorch
bug
Flax
#37999
opened May 7, 2025 by
nobodyPerfecZ
4 tasks
Versions greater than 4.49 are not compatible with Ascend NPU
bug
#37992
opened May 7, 2025 by
1737686924
4 tasks
Bug Report: Unexpected Keyword Argument 'padding_side' in PreTrainedTokenizerFast
bug
#37989
opened May 7, 2025 by
yunqianluo
1 of 4 tasks
Support saving tensors to a file in Model addition debuggers
Feature request
Request for a new feature
#37983
opened May 6, 2025 by
RyanMullins
Add Request for a new feature
pruna
integration for loading model through transmorfers.from_pretrained
/ pipeline
.
Feature request
#37971
opened May 6, 2025 by
davidberenstein1957
Inconsistency in installation instructions for
venv
and uv
#37956
opened May 5, 2025 by
arjunaskykok
Previous Next
ProTip!
Follow long discussions with comments:>50.