GPU kernel engineer building from first principles | MLSys 2026 FlashInfer competitor
- Seattle, WA
- in/karnbir-khera
Pinned Loading
-
-
KarnbirKhera-MLSys2026-Deepseek-Sparse-Attention-TrackB
KarnbirKhera-MLSys2026-Deepseek-Sparse-Attention-TrackB PublicPython 1
-
-
CUDA-TwoTreeFramework
CUDA-TwoTreeFramework PublicA systematic and pedagogical way to derive the correctness structure of 2D Register Allocated GEMM before coding.
HTML 6
-
Kernel-To-Theory-9-Module-Learning-Plan
Kernel-To-Theory-9-Module-Learning-Plan PublicTo understand the underlying structure of kernels through abstract algebra, category theory and wherever the structure/patterns may takes us
Cuda
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
