Skip to content

Pull requests: sgl-project/SpecForge

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Feature] Qwen3-VL-30B-A3B-Instruct eagle3 support
#251 opened Oct 10, 2025 by dcw02 Draft
6 tasks
Enable locking GPU in CI
#250 opened Oct 10, 2025 by fzyzcjy Loading…
6 tasks
fix: remove attention mask shift & add pe shift
#244 opened Oct 3, 2025 by Liyuhui-12 Loading…
6 tasks
Fix resume offline train logic. Add loading optimizer state
#243 opened Sep 29, 2025 by hanq-moreh Loading…
6 tasks
Apply FSDP2 to offline training
#242 opened Sep 26, 2025 by j1young Loading…
6 tasks
fix: mid aux hidden layer id calculation in online mode
#240 opened Sep 24, 2025 by Liu-Xue-Song Loading…
6 tasks
support deepseek-v2-lite online train and support yarn rope
#224 opened Sep 8, 2025 by jiapingW Loading…
6 tasks
Added mistral model support
#208 opened Sep 1, 2025 by ValeGian Loading…
3 of 6 tasks
[Feature] VLM model support tp
#206 opened Sep 1, 2025 by KerwinKai Draft
6 tasks
Support Train Eagle-3 By DeepSpeed
#197 opened Sep 1, 2025 by xq25478 Loading…
Adapt Eagle3 for Deepseek architecture
#186 opened Aug 28, 2025 by xuhaojie-2025 Loading…
6 tasks
supported think mode
#182 opened Aug 26, 2025 by jiapingW Loading…
6 tasks
Add Draft LoRA scripts high priority
#138 opened Aug 13, 2025 by shuaills Draft
6 tasks
Added Eagle training support for Kimi-K2
#108 opened Aug 3, 2025 by xuhaojie-2025 Loading…
ProTip! Add no:assignee to see everything that’s not assigned.