-
Notifications
You must be signed in to change notification settings - Fork 85
Pull requests: Tencent/hpc-ops
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add block-sparse fp8 prefill attention (dim=128/192, two quant schemes)
#45
opened May 25, 2026 by
bcacdwk
Loading…
3 tasks done
decode bf16 smallm: support arbitrary 1<heads_per_group<=8 via direct Q/Y GMEM when TMA unsuitable
#37
opened Apr 2, 2026 by
Religious-J
Loading…
ProTip!
Filter pull requests by the default branch with base:main.