Skip to content
@LocalKinAI

LocalKinAI

Private AI agent swarm on your machine — 100+ specialized agents, self-evolving, zero cloud dependency. Open-source kernel family at localkin.dev.

LocalKin — Private AI Agent Swarm on Your Machine

100+ specialized AI agents running on a single Mac Mini. Self-evolving. Zero cloud dependency.

localkin.ai · apps · api.localkin.dev · chat · localkin.dev · for developers

papers · YouTube · Discord · 𝕏 @LocalKinAI


Surfaces

localkin.ai — knowledge-grounded apps

Direct chat with AI distilled from real human writings. Every answer grounded in actual source texts under enforced retrieval — zero hallucination. Three standalone properties:

  • 清晨甘露 · Morning Manna — daily devotional reading + audio. Offline + background playback, lock-screen control, resumable across episodes.
  • 细拉 · Selah — converse with 44 spiritual masters spanning 19 centuries (Augustine, John Chrysostom, Brother Lawrence, Spurgeon, 倪柝声, 钟马田 …). Every reply cites the source work + page.
  • 岐黄 · Yellow Emperor's Way — converse with 39 TCM masters spanning 4,500 years (黄帝 → 当代国医大师). 498 books, 159 MB corpus, enforced citation.

api.localkin.dev — multi-agent chat hub

Specialist conversational agents — language tutoring, citizenship prep, spiritual companion, TCM diagnosis. 7-day free trial, Pro upgrade for continued use.

  • 🕊️ Madame Guyon — spiritual companion & inner way guide
  • 🎓 English Tutor — 100 real-life scenarios + pronunciation coach
  • 🏥 TCM Master — classical diagnosis across 12 master physicians
  • 🇺🇸 Citizenship Coach — 128 USCIS questions, bilingual
  • 🇨🇳 Chinese Tutor — ABC kids edition · 25 scenarios + quiz
  • 🇪🇸 Spanish Tutor — fun bilingual Spanish for kids

localkin.dev — open source for developers

The agent runtime that powers everything above, plus the open-source ecosystem and research papers documenting how it all works. Single 23 MB binary, zero cloud dependency, soul-file configured. See Core Innovations + Ecosystem below for the technical surface.


What is LocalKin

LocalKin is a Go-based AI agent runtime that orchestrates 100+ domain-expert agents through structured multi-round debates, autonomous scheduling, and self-improvement cycles — all from a single 23MB binary.

Core Innovations

Thin Soul + Fat Skill — Agent identity in 30-line YAML soul files. Skill logic runs as subprocesses, never sent to the LLM. Token-efficient and injection-resistant.

Zero-Token Heartbeat — Tri-chamber autonomous scheduling (pulse / schedule / idle) coordinates 100+ agents via MQTT without burning tokens. Agents wake on schedule, catch up after restarts, and roll back silent wakeups with zero memory pollution.

Conductor-Driven Swarm Debates — Domain experts debate independently, then update positions after seeing others' arguments. Consensus inertia detection prevents false agreement. Cascade amplification guards catch herding behavior.

Self-Evolving Architecture — SAGE four-step improvement loop (Challenger → Planner → Solver → Critic) with experiential reflective learning. 29+ autonomous improvement cycles, 68+ auto-fixes.

Genesis Protocol — Bare binary + soul file self-bootstraps: hardware probing → skill forging → self-testing → checkpoint resume.

Ecosystem

The KinClaw family fits in 4 layers — apps on top, raw macOS bindings at the bottom, kernels in the middle, and a single Mac shell binding it all together. Same SSE event protocol + same .soul.md format across every kernel — one client codebase drives the whole stack, future Linux/Windows shells reuse it without changes.

┌───────────────────────────────────────────────────────────────┐
│  LAYER 4 — apps + chat hub                                     │
│  localkin.ai (Selah · Heal · Manna)   api.localkin.dev (100+) │
└──────────────────────────┬────────────────────────────────────┘
                           │
┌──────────────────────────▼────────────────────────────────────┐
│  LAYER 3 — desktop shell  (Apache-2.0)                         │
│                                                               │
│   ⌘⌥K  ┌──────────────────────────────────┐                  │
│        │  kinclaw-mac v0.4.1  (SwiftUI)   │                  │
│        │  Chat ┊ Cowork ┊ Code            │                  │
│        └────────┬───────────┬─────────────┘                  │
└─────────────────┼───────────┼────────────────────────────────┘
                  │           │
        ┌─────────▼──┐  ┌─────▼──────┐
        │ :5001      │  │ :5002      │
        │ kinclaw    │  │ kincode    │   LAYER 2 — kernels
        │ v1.17.0    │  │ v0.10.0    │   (Apache-2.0 / MIT)
        │ 5 claws    │  │ 10 tools   │
        └────────┬───┘  └────────────┘
                 │
┌────────────────▼──────────────────────────────────────────────┐
│  LAYER 1 — Pure-Go macOS bindings  (KinKit, MIT)               │
│  sckit-go · kinax-go · input-go · kinrec                      │
└───────────────────────────────────────────────────────────────┘

Repos:

Project Description
ollamadiffuser Local AI image generation, zero cloud dependency Downloads
localkin-service-audio High-performance local STT & TTS services Downloads
kinclaw macOS computer-use agent — 5 claws + floating chat UI + voice. Agent operates your real Mac. 67.3% on macbench v0.1 (first reference run, 2026-05-08). Apache-2.0 · v1.17.0
kinclaw-mac Native macOS Spotlight shell for the kernel family. ⌘⌥K → Chat (100+ cloud agents) · Cowork (kinclaw 5 claws) · Code (kincode + repo). Apache-2.0 · v0.4.1
kincode AI coding assistant, 10MB single binary. HTTP+SSE server mode for desktop shells. MIT · v0.10.0
macbench First publicly published macOS-native computer-use benchmark. 369 task slots, 15 categories, agent-agnostic Go runner. Inspired by OSWorld; adapted for the macOS app surface OSWorld can't reach. MIT · v0.2.0
notebooklm-go Unofficial pure-Go client + CLI for Google NotebookLM's internal batchexecute RPC. Reverse-engineered. List notebooks, add PDF/URL/text sources, generate Audio Overviews / Mind Maps / Reports / Deep Research. Powers LocalKin's content publishing pipeline. Apache-2.0 · v0.2.2

KinKit — pure-Go macOS bindings (zero cgo, embedded dylib, go install-able)

The libraries that power 4 of KinClaw's 5 claws. Each is independently usable in any Go project that needs to drive macOS at the framework level — no Xcode, no cgo, no Swift bridge.

Library Description
sckit-go ScreenCaptureKit bindings · sub-20ms streams · powers screen claw MIT
kinax-go Accessibility API bindings · navigate UI trees via AXUIElement · powers ui claw MIT
input-go CGEvent mouse + keyboard synthesis · target_pid background mode · powers input claw MIT
kinrec Screen + audio recorder · built on sckit-go · powers record claw MIT

See the Embedded Dylib paper for the distribution pattern these all share.

The 5th claw — web (3-tier, cost-aware)

The other 4 claws are macOS-bound. The web claw is cross-platform and arguably the most-used in real flows, because most modern productivity lives in browser tabs — Gmail, Linear, Notion, GitHub, Booking, Airbnb, Google Flights, the LocalKin family's own apps (localkin.ai, faith.localkin.ai, heal.localkin.ai, api.localkin.dev).

KinClaw's web claw is 3 tiers, picked by Pilot based on what the task actually needs — cheap-and-fast first, expensive-and-flexible only when nothing else works:

Tier 0 · URL-first (~50ms, no startup cost)

The most important operational doctrine in pilot.soul.md — go straight to result URLs, skip GUI clicking entirely. Covers ~80% of real flows.

Capability What it does
shell open <URL> macOS URL-handler routing — maps://, mailto:, music://, https:// — land at destination state without any clicking
web_fetch <URL> Server-side fetch + HTML strip → clean text, no Chromium, no JS render
web_search DuckDuckGo (default) or Tavily (when TAVILY_API_KEY set)
14 baked-in URL templates Google Flights · Kayak · Skyscanner · Booking · Airbnb · Zillow · Maps · Amazon · YouTube · GitHub search · ArXiv · 12306 · Reddit · etc. — canonical patterns in the soul prompt so Pilot knows the deep-link grammar of every popular site

Why this beats GUI puppeteering for an LLM agent: calendar pickers (clicking "previous month" 30 times to land on July is doomed; ?checkin=2025-07-08 lands instantly), faceted filters (URL params take 4 dimensions atomically), cookie banners + modals (skipped entirely), SPA accessibility gaps (React tree without AX labels — URL bypasses the DOM).

Tier 1 · web skill (~3s cold start, single-step Playwright)

When URL-first can't reach (page needs JS to render, content lives behind a click, you need a screenshot of the rendered DOM), Pilot drops to the web skill — a Playwright-driven single-shot.

web url=X
web url=X click=".btn" type_text="hello"
web url=X js="document.body.innerText"
web url=X screenshot=true

One call = one Playwright session, ~3s cold start, returns immediately. For 99% of one-off web tasks: scrape a page, run a snippet of JS, fill+submit a form, grab a screenshot.

Tier 2 · browser_session super-skill (~10-20s warmup, multi-step)

For genuinely complex flows — login + navigate + extract, fill + submit + verify, search + sort + dig three pages deep — KinClaw wraps browser-use (the 91K★ OSS that is the de-facto LLM-driven browser agent library) as a single-argument super-skill:

browser_session task="登录 GitHub 找我未读 PR"
browser_session task="去 weather.com 输 zip code 95014 拿一周预报表"
browser_session task="open my Linear, find tickets assigned to me this week, summarize"

What you get under the hood:

  • Persistent session — login state, cookies, localStorage all survive across steps
  • DOM element numbering — browser-use auto-numbers every interactive element so the LLM can refer to "click element 17" instead of fragile CSS selectors
  • Visual reasoning — screenshot + numbered elements fed back to the planning LLM each step
  • Cross-page flows — natural to chain login → navigate → extract; the inner LLM plans steps, you just give it the goal

Trigger rule (from pilot.soul.md): the task description has 2+ interaction verbs (login + navigate + extract / fill + submit + verify / search + sort + dig).

Task Tier picked
"查 hacker news 头条" web_fetch (Tier 0)
"抓 example.com 的 H1" web (Tier 1)
"登录 github 找我未读 PR" browser_session (Tier 2 — login is the multi-step signal)
"去 weather.com 输 zip code 拿一周预报" browser_session (Tier 2 — multi-step interaction)

Cost-aware: browser_session burns LLM tokens per planning step (~$0.05–0.15 Claude per 5-step task). Pilot's soul says explicitly: don't reach for browser_session when web would do. The tier system isn't just capability — it's economics.

Strategic angle

Most LocalKin's own products are web — Selah, Heal, Morning Manna, the 100+-agent chat hub. When Pilot drives a LocalKin user flow, the web claw is doing the lift. All three tiers are cross-platform (shell + Node.js, no macOS framework dependencies) — future kinclaw-pal (Linux/Windows shell) inherits the whole web stack with zero rewrite. Only the 4 macOS-bound claws need platform-specific rebinding. The web tier is what makes the Linux/Win story viable.

Public benchmarks

macbench — first publicly published macOS-native computer-use benchmark

First reference run (2026-05-08):
  kinclaw v1.15.0 + Kimi-K2.5(cloud) on macbench v0.1
    IMPLEMENTED:  101 / 150  =  67.3%
    STRICT:       101 / 369  =  27.4%   (stubs count as fail)

For context, Anthropic Computer Use scores ~38% on OSWorld (Linux desktop). macbench measures a different surface (macOS native), so the numbers aren't directly comparable, but the methodology + scoring discipline are the same.

xlang-ai/OSWorld (NeurIPS 2024) became the de-facto standard for desktop computer-use agents — but it benchmarks inside an Ubuntu/Windows VM. Nobody had published a comparable benchmark for macOS native apps. macbench fills that gap with 369 task slots across 15 macOS app categories (Finder · Safari · Mail · Notes · Calendar · Reminders · Settings · Terminal · Pages · Numbers · Keynote · Music · Photos · Maps · multi-app), an agent-agnostic Go runner (any binary that takes a prompt + drives macOS plugs in via -agent + -agent-args template), and per-task PID-snapshot isolation that preserves any pre-existing user app state.

v0.1 ships 150 implemented + 219 stubs (real prompts, no setup/eval scripts yet); fill rate over v0.2 → v1.0 is roughly 30-50 stubs/month. License MIT.

Domains

100+ specialized agents across expert traditions, each grounded in real source texts:

  • Spiritual · 44 masters spanning 19 centuries → faith.localkin.ai (Augustine, John Chrysostom, Brother Lawrence, Spurgeon, 倪柝声, 钟马田 …)
  • Traditional Chinese Medicine · 39 masters spanning 4,500 years → heal.localkin.ai (黄帝 → 当代国医大师, 498 books / 159 MB corpus)
  • More domains in active development — engineering, quant finance, education, language tutoring, design, game dev, spatial computing, and growing.

Research Papers

All 11 papers are on Zenodo with permanent concept DOIs (CC-BY-4.0). Each PDF bundles EN + 中文 in a single document, generated directly from the canonical Markdown source. Concept DOIs auto-resolve to the latest version, so citations stay correct across revisions.

# Title DOI
1 Self-Evolving Swarms 10.5281/zenodo.20094223
2 Zero-Token Heartbeat 10.5281/zenodo.20094225
3 Genesis Protocol 10.5281/zenodo.20094230
4 Grep Is All You Need · code 10.5281/zenodo.19777260
5 Thin Soul, Fat Skill 10.5281/zenodo.20094232
6 Domain Expert Debate 10.5281/zenodo.20094236
7 Knowledge Compile 10.5281/zenodo.20094238
8 Autonomous Swarm Genesis 10.5281/zenodo.20094240
9 Embedded Dylib · code 10.5281/zenodo.20094242
10 macbench · code 10.5281/zenodo.20094244
11 Grep-Routed Agents: Bypassing the LLM Tax on Computer-Use Tasks 10.5281/zenodo.20131046

Contact

📧 contact@localkin.ai

Popular repositories Loading

  1. ollamadiffuser ollamadiffuser Public

    Python 21 5

  2. kinclaw kinclaw Public

    A single-binary AI agent. 23 MB, zero dependencies, configured through Markdown.

    Go 8 2

  3. kincode kincode Public

    AI coding assistant in Go. 9MB single binary, zero dependencies. MCP support, Soul files for custom AI personas. 10 built-in tools + sub-agents + web search + memory. The only Claude Code alternati…

    Go 5 2

  4. localkin-service-audio localkin-service-audio Public

    Python 4

  5. kinbrowser kinbrowser Public

    Markdown-native browser for AI agents. Chrome renders pixels you don't read; kinbrowser extracts markdown you do. Three escalating backends (HTTP+readability → Lightpanda → chromedp), one uniform m…

    Go 2

  6. notebooklm-go notebooklm-go Public

    Unofficial pure-Go client (and CLI) for Google NotebookLM's internal batchexecute RPC. Reverse-engineered. List notebooks, add PDF/URL/text sources, generate Audio Overviews / Mind Maps / Reports /…

    Go 1

Repositories

Showing 10 of 14 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…