Distributed-Inference-Orchestrator

A lightweight system that discovers Apple Silicon Macs on a network, profiles their hardware (memory, bandwidth, GPU cores), and computes the optimal layer split for running LLM inference across multiple machines. It generates ready-to-run llama.cpp commands with the correct --rpc endpoints and --tensor-split ratios, so you don't have to figure out the distribution yourself. Agents heartbeat their specs to a central registry, and an interactive CLI lets you review the plan before launching.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
agent		agent
common		common
launcher		launcher
planner		planner
registry		registry
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed-Inference-Orchestrator

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Distributed-Inference-Orchestrator

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages