-
Notifications
You must be signed in to change notification settings - Fork 28k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
GTrXL: Stabilizing Transformers for Reinforcement Learning
New model
#36220
opened Feb 16, 2025 by
ashok-arora
1 of 2 tasks
[bug] use_gather_object is not respected after the first eval in trainer
#36213
opened Feb 15, 2025 by
ducha-aiki
Token healing throws error with "Qwen/Qwen2.5-Coder-7B-Instruct"
bug
#36210
opened Feb 15, 2025 by
desaxce
2 of 4 tasks
modular_model_converter
cannot handle local imports with return
bug
#36208
opened Feb 15, 2025 by
xiuqhou
2 of 4 tasks
Request to add DINO object detector
contributions-welcome
New model
Vision
#36205
opened Feb 14, 2025 by
tcourat
2 tasks done
Request to add DEIM object detector
contributions-welcome
New model
Vision
#36204
opened Feb 14, 2025 by
tcourat
2 tasks done
checkpoint will be saved twice at the end of training when save_strategy is epoch
bug
#36203
opened Feb 14, 2025 by
AaronZLT
Dedicated tokenizer for byte level transformers
Feature request
Request for a new feature
#36202
opened Feb 14, 2025 by
apehex
ValueError: Unrecognized image processor in Qwen/Qwen2.5-VL-3B-Instruct.
bug
#36193
opened Feb 14, 2025 by
SkalskiP
Recent Qwen2VL merge request (#35837) break compatibility with DeepSpeed
#36187
opened Feb 14, 2025 by
ArdalanM
Whisper
.generate()
function not respecting max_new_tokens
or max_length
bug
#36183
opened Feb 13, 2025 by
mitchelldehaven
4 tasks
add Flash Attention Support for Helsinki-NLP/opus models
Feature request
Request for a new feature
#36169
opened Feb 13, 2025 by
AghaDurrani
TFViTModel
and interpolate_pos_encoding=True
bug
TensorFlow
#36155
opened Feb 13, 2025 by
carlosg-m
2 of 4 tasks
SDPA
is_causal=False
has no effect due to LlamaModel._prepare_4d_causal_attention_mask_with_cache_position
bug
#36150
opened Feb 12, 2025 by
ringohoffman
4 tasks
Torchao
int4_weight_only
save error when passing layout
bug
#36147
opened Feb 12, 2025 by
jiqing-feng
4 tasks
Add the support for deepseek architecture .gguf
Feature request
Request for a new feature
#36144
opened Feb 12, 2025 by
zh-jp
'MERTConfig' object has no attribute 'conv_pos_batch_norm'
bug
#36134
opened Feb 11, 2025 by
Timothy-John
1 of 4 tasks
Speaker Verification: All Speakers Getting Perfect 1.000 Similarity Scores
bug
#36124
opened Feb 10, 2025 by
misterpathologist
1 of 4 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.