Skip to content

Pull requests: PrimeIntellect-ai/prime-rl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add correct loss scaling
#674 opened Aug 3, 2025 by samsja Draft
Reorganize configuration files
#670 opened Aug 2, 2025 by kalomaze Loading…
force fp32 logits
#664 opened Aug 1, 2025 by samsja Loading…
refactor logprob
#657 opened Jul 31, 2025 by samsja Loading…
[feat] expert parallel trainer
#650 opened Jul 30, 2025 by Jackmin801 Draft
1 of 12 tasks
Add GSPO
#645 opened Jul 29, 2025 by faresobeid Loading…
Multi-node DP + vLLM v0.10.0
#644 opened Jul 28, 2025 by mikasenghaas Loading…
RLOO and OPO baselines
#640 opened Jul 26, 2025 by faresobeid Draft
2 tasks
Length penalty
#638 opened Jul 26, 2025 by faresobeid Draft
1 task
Duplicating batch trick
#620 opened Jul 22, 2025 by faresobeid Draft
add SWE-RL verifier
#617 opened Jul 20, 2025 by rasdani Loading…
ProTip! no:milestone will show everything without a milestone.