inferernce

Star

Here are 2 public repositories matching this topic...

ShankarKakumani / inference

Star

Zero-setup ML inference for Flutter using Rust engines (Candle, Linfa)

dart rust machine-learning ai ml pytorch neural-networks flutter candle linfa inferernce

Updated Jul 12, 2025
Dart

Hook12aaa / qwen3-mlx

Star

C++23 inference engine for Qwen3-4B on Apple Silicon. 125.77 tok/s (1.62x llama-cli). OpenAI-compatible streaming server. No Python, no llama.cpp, no Ollama.

mlx qwen3 inferernce

Updated May 21, 2026
C++

Improve this page

Add a description, image, and links to the inferernce topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inferernce topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inferernce

Here are 2 public repositories matching this topic...

ShankarKakumani / inference

Hook12aaa / qwen3-mlx

Improve this page

Add this topic to your repo