Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add dataset mixer
#3791 opened Jul 28, 2025 by lewtun Loading…
5 tasks
Fix bug when train_dataset = None in SFTTrainer
#3789 opened Jul 28, 2025 by h-tonywu Loading…
1 of 5 tasks
[GRPO] update transformer version for CB
#3786 opened Jul 28, 2025 by kashif Loading…
Add vLLM server mode support to OnlineDPOTrainer
#3783 opened Jul 27, 2025 by vaelev Loading…
6 tasks done
Add AlphaPO Trainer
#3776 opened Jul 26, 2025 by qingquansong Loading…
3 of 5 tasks
Add vLLM transformers backend to online methods
#3773 opened Jul 25, 2025 by merveenoyan Loading…
Dynamic sampling option in GRPO trainer based on DAPO paper
#3758 opened Jul 23, 2025 by almeidava93 Loading…
2 of 5 tasks
Support dLLM in GRPO reference model creation
#3743 opened Jul 18, 2025 by xijia-tao Loading…
Add basic support for FSDP/Lora when using TRL/VLLM
#3735 opened Jul 14, 2025 by ojh31 Loading…
5 tasks
[WIP] Fix ppo example accelerator initialization error
#3732 opened Jul 14, 2025 by ccs96307 Draft
2 of 5 tasks
FSDP2+GRPO
#3687 opened Jul 3, 2025 by SalmanMohammadi Loading…
5 tasks
[SFT] Dry up the sft tests
#3657 opened Jun 27, 2025 by kashif Loading…
5 tasks
feat: Initial implementation of RePO trainer and components
#3655 opened Jun 26, 2025 by celsowm Loading…
5 tasks
Ensure Chat Template Safe Prompt Truncation
#3646 opened Jun 25, 2025 by pramodith Loading…
4 of 5 tasks
[WIP] vllm-server-spec-dec-support
#3643 opened Jun 24, 2025 by shirinyamani Loading…
5 tasks
GRPO: Pack Responses within the same group.
#3642 opened Jun 24, 2025 by pramodith Draft
4 of 5 tasks
Add Entropy Control to GRPOTrainer
#3628 opened Jun 22, 2025 by 1485840691 Loading…
Feature: Add SGLang support for GRPO Trainer
#3627 opened Jun 21, 2025 by PrinsYin Draft
5 tasks
📘 SFT doc rewrite
#3619 opened Jun 18, 2025 by qgallouedec Loading…
ClearML logging of visualization in RewardTrainer evaluation
#3602 opened Jun 16, 2025 by ioverho Loading…
2 of 5 tasks
Fix: corrected fsdp in GRPO trainer
#3582 opened Jun 13, 2025 by tryumanshow Loading…
2 of 5 tasks
ProTip! Adding no:label will show everything without a label.