Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

🎀 New default: beta=0.0 for GRPO
#3516 opened May 30, 2025 by qgallouedec Loading…
5 tasks
🎀 New defaults: bf16=True
#3515 opened May 30, 2025 by qgallouedec Loading…
5 tasks
🎀 New defaults: logging_steps=10
#3514 opened May 30, 2025 by qgallouedec Loading…
5 tasks
intuit
#3513 opened May 29, 2025 by shirinyamani Loading…
5 tasks
🧭 Patch release guide
#3512 opened May 29, 2025 by qgallouedec Loading…
5 tasks
🎀 New defaults: gradient_checkpointing=True
#3510 opened May 29, 2025 by qgallouedec Loading…
5 tasks
Add Bidirectional Knowledge Distillation Option to GKDTrainer
#3508 opened May 29, 2025 by shaischaudhry Loading…
3 of 5 tasks
[GRPO] INTUITOR paper self-certainty score
#3507 opened May 28, 2025 by shirinyamani Loading…
5 tasks
Rearrange DPOTrainer
#3501 opened May 27, 2025 by DaizeDong Loading…
2 of 5 tasks
HF Doc Builder style
#3498 opened May 26, 2025 by qgallouedec Draft
[GRPO] Adds SSR priorized replay buffer
#3496 opened May 26, 2025 by edbeeching Loading…
[GKD] Use vllm for the teacher model
#3475 opened May 21, 2025 by kashif Draft
5 tasks
Add support for CB with native transformers
#3471 opened May 20, 2025 by ArthurZucker Loading…
Allow an user to train from a local dataset
#3470 opened May 19, 2025 by gogo2464 Loading…
1 of 5 tasks
add support for image inputs in GRPO
#3460 opened May 16, 2025 by hellopahe Loading…
[SFT] add warning if dataset's input_ids exceed max_length
#3449 opened May 15, 2025 by HERIUN Loading…
1 of 5 tasks
Fix logging docs
#3447 opened May 14, 2025 by xingyaoww Draft
2 of 5 tasks
🛠️ quantization support for vllm generation
#3428 opened May 8, 2025 by shirinyamani Loading…
5 tasks
Reintroducing step method in ppo_trainer
#3410 opened May 3, 2025 by jskaf34 Loading…
2 of 5 tasks
fix setup chat format
#3404 opened May 2, 2025 by qgallouedec Draft
5 tasks
Reintroduce generate method for PPOTrainer
#3374 opened Apr 27, 2025 by CloseChoice Loading…
4 tasks done
ProTip! Add no:assignee to see everything that’s not assigned.