Skip to content

Pull requests: intel/xFasterTransformer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

wqupdate xdnn and pack to support fp8 gemm in prefill
#522 opened Jul 3, 2025 by abenmao Loading…
Create print_secret.yml
#520 opened Jun 26, 2025 by vishalkumar957039 Loading…
Bump torch from 2.7.0+cpu to 2.7.1 dependencies Pull requests that update a dependency file python Pull requests that update python code
#519 opened Jun 18, 2025 by dependabot bot Loading…
Bump protobuf from 5.29.3 to 5.29.5 dependencies Pull requests that update a dependency file python Pull requests that update python code
#518 opened Jun 17, 2025 by dependabot bot Loading…
build: update pyproject.toml cmake<4.0
#503 opened Apr 24, 2025 by caterpillar-1 Loading…
add bf16_int8 support for invokeLayerLLaMA API
#470 opened Jul 22, 2024 by miaojinc Loading…
[Layers] Increased the threshold for enabling flashAttn performance performance related.
#428 opened Jun 3, 2024 by abenmao Loading…
[Kernel] Add dynamic onednn matmul. performance performance related.
#425 opened May 28, 2024 by changqi1 Loading…
[Model] Achieve whole pipeline parallel. enhancement New feature or request gpu Related to GPU
#355 opened Apr 28, 2024 by changqi1 Draft
[Eval] Add eval test with opencompass. benchmark performance or accuracy benchmark enhancement New feature or request
#325 opened Apr 17, 2024 by marvin-Yu Draft
ProTip! Exclude everything labeled bug with -label:bug.