Skip to content

Pull requests: Tencent/TencentPretrain

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix the bug that checkpoint saving.
#134 opened Dec 27, 2024 by qing-yuan233 Loading…
add suport for baichuan2_7b
#131 opened Jul 30, 2024 by Yang-Yi20 Loading…
add support for Qwen
#129 opened Jun 3, 2024 by cjw-d Loading…
Modularize convert_tencentpretrain_to_llama.py
#127 opened May 7, 2024 by xlhuang825 Loading…
add runtime inference for mt classifier
#124 opened Mar 5, 2024 by kanson1996 Loading…
support LLaVa
#119 opened Jan 9, 2024 by JINGZIjingzi Loading…
Add CLIP model and scripts
#118 opened Jan 8, 2024 by ydli-ai Loading…
rename argument
#108 opened Oct 26, 2023 by JINGZIjingzi Loading…
add bloom models
#73 opened Jul 1, 2023 by ydli-ai Loading…
add support for Falcon LLM
#72 opened Jun 10, 2023 by ydli-ai Loading…
Add support for deepspeed zero-3
#33 opened Mar 20, 2023 by hhou435 Loading…
ProTip! What’s not been updated in a month: updated:<2025-05-07.