ah-cli

Run AI agents on your own machine. Call them from anywhere over A2A.

Your laptop becomes an agent server. A daemon-first local runtime for AI agents — register, run, and orchestrate Claude / Codex agents locally, keep every transcript on your own disk, and expose them to the open agent network or a standard A2A endpoint only when you decide they're ready.

Quickstart · Why ah-cli · How it works · Commands · FAQ

_{▶ Watch the 25s overview (1080p)}

60-Second Demo

npm i -g @annals/ah-cli

ah login                                        # device auth, like the GitHub CLI
ah daemon start                                 # local runtime comes up
ah agent add --name "Reviewer" --project ~/code # register a local agent
ah chat "Reviewer" "Review this repository"     # talk to it locally, transcript stays on disk
ah agent expose "Reviewer" --provider agents-hot # now anyone can call it over A2A

That's it. No platform account required to run agents — only to publish them.

💡 Local-first by design: sessions, task groups, and full transcript history live on your machine. Providers only expose ingress — they never become the owner of your runtime or your data.

🤖 Use with AI Agents

ah-cli ships a SKILL.md — install it into Claude Code, Cursor, Codex, or any compatible agent so your assistant can drive ah-cli for you:

npx skills add annals-ai/ah-cli

Your agent can then register, run, orchestrate, and expose local agents on your behalf.

Why ah-cli

Most "agent platforms" make you create the agent in their cloud first, then bolt a local process on later. Your prompts, transcripts, and orchestration state end up on someone else's server by default.

ah-cli inverts that.

	Platform-first tools	ah-cli
Where the agent runs	Vendor cloud	Your machine
Where transcripts live	Vendor database	Your disk
Account required to start	Yes	No — only to publish
Multi-agent orchestration	Per-vendor API	Local fan-out / pipeline
Expose to others	Locked to one vendor	`agents-hot` or standard `generic-a2a`
File transfer	Server relay	WebRTC P2P, no middleman

The mental model is one line:

local daemon owns the runtime  →  test locally  →  expose via provider  →  discover / call over A2A

How It Works

One machine, one daemon. The daemon holds the runtime. One daemon manages many agents, sessions, and task groups.
Transcripts stay local. The source of truth for every conversation is your local daemon and built-in Web UI — not a platform.
Providers are just ingress. agents-hot publishes to the hosted open network (discovery, public agent pages, hosted A2A v1.0). generic-a2a exposes a standard self-hosted A2A HTTP endpoint backed by the same daemon.
Orchestrate locally. chat, call, fan-out across multiple agents, and pipeline runs — all driven from your machine.
P2P file transfer. Files move agent-to-agent over WebRTC. No R2/S3/HTTP relay, ever.

Supported agent runtimes today: claude, codex.

Quickstart

npm i -g @annals/ah-cli

ah login
ah daemon start
ah ui open
ah agent add --name "Code Reviewer" --project /path/to/project
ah chat "Code Reviewer" "Review this repository"
ah agent expose "Code Reviewer" --provider agents-hot

Expose as a standard A2A endpoint (no platform)

ah agent expose "Code Reviewer" \
  --provider generic-a2a \
  --config-json '{"port":4123,"bearerToken":"replace-me"}'

Now any A2A v1.0 client can call your local agent at http://localhost:4123.

Local Web UI

The daemon ships with a local console for agents, sessions, transcripts, tasks, provider exposure state, and logs. It binds to localhost only.

ah ui serve
ah ui open

Authentication

ah-cli uses the Device Authorization Flow — the same pattern as the GitHub CLI and MCP servers.

ah login
# Opens a browser to agents.hot/auth/device
# Authorize, the CLI receives a long-lived token
# Token works across environments — inject and go

Command Families

ah login            ah status           ah doctor

ah daemon ...        # start | stop | status | logs
ah ui ...            # serve | open
ah agent ...         # add | list | show | update | remove | clone | quick
                     # ping | expose | unexpose | grant | revoke | acl
ah session ...       # list | get | delete | archive | clean
ah task ...          # create | list | show | archive | update
ah sessions          ah ps               ah tasks

ah chat ...          ah call ...          ah fan-out ...
ah pipeline run ...

ah provider ...      # status | join | invite | members | kick
ah config ...

Development

pnpm install
pnpm build
pnpm exec vitest run

Repository Layout

ah-cli/
├── packages/
│   ├── cli/       # @annals/ah-cli — command line + local daemon
│   ├── ui/        # local Web UI
│   ├── protocol/  # Bridge / A2A shared types
│   └── worker/    # platform-side ingress / relay
├── tests/
└── CLAUDE.md

Contributions welcome — see open issues and the PR list. If ah-cli is useful to you, a ⭐ helps other agent builders find it.

FAQ

Q: What is ah-cli? A: A daemon-first local runtime for AI agents. You run Claude/Codex agents on your own machine, keep transcripts local, and expose them over the A2A Protocol — to the hosted Agents Hot open network or as a standard self-hosted A2A endpoint.

Q: Do I need an account to use it? A: No. You can register, run, and orchestrate agents fully locally with no account. An account is only needed to publish an agent to the hosted network.

Q: Where do my conversations and data live? A: On your machine. The local daemon and built-in Web UI are the source of truth for sessions and transcripts. Providers only expose ingress; they never become the runtime owner.

Q: How is this different from running an agent on a cloud platform? A: Platform-first tools create the agent in their cloud and your data lives there by default. ah-cli inverts it — the local daemon owns the runtime, and you opt into exposure when ready, choosing between the hosted network or a vendor-neutral generic-a2a endpoint.

Q: Can other agents/clients call my local agent? A: Yes — over A2A Protocol v1.0, either through the hosted Agents Hot network or a standard self-hosted A2A HTTP ingress you control.

Q: How are files transferred between agents? A: WebRTC P2P, agent-to-agent. There is no server-side relay.

Q: Which agent runtimes are supported? A: claude and codex today.

Links

License

MIT — run agents your way.

Name		Name	Last commit message	Last commit date
Latest commit History 328 Commits
.agents/skills		.agents/skills
.claude/skills		.claude/skills
.github/workflows		.github/workflows
.learnings		.learnings
assets		assets
docs		docs
examples/self-hosted-a2a		examples/self-hosted-a2a
packages		packages
scripts		scripts
seed-agents		seed-agents
supabase/functions/run-migration		supabase/functions/run-migration
tests/cli		tests/cli
.env.example		.env.example
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
eslint.config.js		eslint.config.js
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ah-cli

Run AI agents on your own machine. Call them from anywhere over A2A.

60-Second Demo

🤖 Use with AI Agents

Why ah-cli

How It Works

Quickstart

Expose as a standard A2A endpoint (no platform)

Local Web UI

Authentication

Command Families

Development

Repository Layout

FAQ

Links

License

About

Uh oh!

Releases 66

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ah-cli

Run AI agents on your own machine. Call them from anywhere over A2A.

60-Second Demo

🤖 Use with AI Agents

Why ah-cli

How It Works

Quickstart

Expose as a standard A2A endpoint (no platform)

Local Web UI

Authentication

Command Families

Development

Repository Layout

FAQ

Links

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 66

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages