Pull requests · ggml-org/llama.cpp

New pull request New

1,060 Open 11,150 Closed

examples server

#24190 opened Jun 5, 2026 by fiesh

Loading…

ggml Vulkan

#24186 opened Jun 5, 2026 by 0cc4m Contributor

Loading…

examples ggml

#24185 opened Jun 5, 2026 by charlie12345

Loading…

documentation examples ggml python server testing

#24179 opened Jun 5, 2026 by marty1885

Loading…

1 task done

testing

#24178 opened Jun 5, 2026 by tdakhran Contributor

Loading…

ggml Nvidia GPU

#24172 opened Jun 5, 2026 by zihaomu

Loading…

examples python

#24163 opened Jun 5, 2026 by tc-mb Contributor

Loading…

ggml model python script testing

#24162 opened Jun 5, 2026 by am17an Contributor • Draft

ggml OpenCL

#24160 opened Jun 5, 2026 by lhez Contributor • Draft

examples python server

#24154 opened Jun 5, 2026 by Anuj-Attri

Loading…

ggml SYCL

#24152 opened Jun 5, 2026 by Spruill-1

Loading…

examples server

#24150 opened Jun 4, 2026 by ngxson Contributor

Loading…

examples python server

#24143 opened Jun 4, 2026 by alainnothere

Loading…

ggml Nvidia GPU

#24129 opened Jun 4, 2026 by harkgill-amd

Loading…

ggml Nvidia GPU

#24127 opened Jun 4, 2026 by JohannesGaessler Contributor

Loading…

vulkan: add v_dot2_f32_f16 support in matrix-matrix multiplication and Flash Attention ggml Vulkan

#24123 opened Jun 4, 2026 by 0cc4m Contributor

Loading…

examples ggml python

#24122 opened Jun 4, 2026 by Donovoi • Draft

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Pull requests: ggml-org/llama.cpp

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Pull requests list