Skip to content

add dot_product abstraction to reduce preprocessor branching

16cfaba
Select commit
Loading
Failed to load commit list.
Open

vulkan: add v_dot2_f32_f16 support in matrix-matrix multiplication and Flash Attention #24123

add dot_product abstraction to reduce preprocessor branching
16cfaba
Select commit
Loading
Failed to load commit list.