Your AI coding agent is only as good as its harness.
One command to scan, equip, and continuously improve any AI coding agent in your repository.
🚀 Get Started • 🔄 Living Loop • 📊 Dashboard • 🧭 Decisions • ⌨️ Commands • 💡 Scenarios • 🎯 Targets • ❓ FAQ
|
Your AI agent gets your repo's languages, frameworks, and patterns from the first prompt |
A closed feedback loop learns what works, tunes itself, and gets smarter every session |
Real-time dashboard shows every decision, token spend, and compaction — no black boxes |
| Without Harness Forge | With Harness Forge | |
|---|---|---|
| 🧠 Context | Agent guesses at project structure | Agent knows your languages, frameworks, boundaries |
| ⚡ Performance | Starts fresh every session | Self-improves over time via the Living Loop |
| 📊 Visibility | Black box — no idea what the agent decided | Real-time dashboard with 20 live panels |
| 🧭 Decisions | ADRs get buried or forgotten | Chronological decision timeline with stale-decision checks |
| 💰 Cost | Wasted tokens on retries and wrong paths | Compaction + auto-tuning saves 20-40% |
| 📤 Portability | Stuck on one machine, one setup | Export & import learned patterns as .hfb bundles |
Most tools configure once and forget. Harness Forge keeps learning.
┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐ ┌──────────┐
│ 🔍 │ │ 🧠 │ │ ⚡ │ │ 📤 │ │ 📥 │
│ OBSERVE │───▶│ LEARN │───▶│ ADAPT │───▶│ SHARE │───▶│ IMPORT │
│ │ │ │ │ │ │ │ │ │
│ Tracks │ │ Finds │ │ Auto- │ │ Export │ │ Bootstrap│
│ sessions │ │ patterns │ │ tunes │ │ bundles │ │ anywhere │
└──────────┘ └──────────┘ └──────────┘ └──────────┘ └──────────┘
▲ │
└──────────────────────────────────────────────────────────────┘
npx @harness-forge/cliScans your repo. Installs skills, rules, knowledge packs. Default settings. Everything works out of the box. |
|
hforge export --bundle my-team.hfb
# Send to a teammate →
hforge import my-team.hfb
# They get your learned patterns instantly |
hforge dashboardLoop health ring, effectiveness scores, pattern list, tuning log — live in your browser. |
The more you use it, the better it gets. After ~10 sessions, Harness Forge has learned your repo's patterns and tuned itself for optimal performance. No manual configuration needed.
🛡️ Guardrails — auto-tuning is safe
- Every tunable parameter has hard min/max bounds — the tuner can't go wild
- Every change is logged with before/after values and the pattern that triggered it
- If the next 3 sessions score worse, the tuning is automatically reverted
- Your manual config overrides are sacred — the tuner won't touch them
- The dashboard shows every tuning with a one-click revert button
hforge dashboard— see everything, live in your browser.
|
Live status of each loop stage with health score |
Session score trend — are things getting better? |
Discovered patterns with confidence bars |
Policy changes with one-click revert |
📋 All 20 dashboard panels
| Panel | What it shows |
|---|---|
| 🔢 KPI Cards | Total events, tokens, enforcement level, budget gauge |
| 📈 Event Timeline | Scatter plot of all events over time, color-coded by category |
| 💾 Memory Pressure | Token usage line chart with threshold marklines |
| 📊 Budget Breakdown | Donut chart of budget allocation (hot-path, output, tools, safety) |
| 📋 Live Event Feed | Searchable, expandable table of every harness decision |
| 🤖 Subagent Briefs | Delegated tasks, their context, and outcomes |
| 📊 Brief Metrics | Subagent activity summary and success rates |
| 🔇 Suppression Gauge | How many duplicate context items were removed |
| 🚪 Expansion Gate | History access requests — granted vs denied |
| ⚙️ Config Editor | Edit memory-policy, context-budget, load-order live |
| 🔄 Loop Health Ring | Self-improvement cycle status with stage counts |
| 📈 Effectiveness Trend | Session score sparkline (last 20 sessions) |
| 🧠 Insights Panel | Discovered patterns with confidence and "NEW" badges |
| ⚡ Tuning Log | Policy changes with before/after and revert button |
| 📊 Event Distribution | Bar chart of top event types |
| ⏱️ Event Rate | Events per minute over time |
| 🗺️ Event Heatmap | Category × time heatmap |
| 💰 Tokens Saved | Running counter of tokens saved by compaction |
| 📊 Profile Distribution | Output profile selection breakdown |
| ℹ️ Session Info | Session ID, uptime, version, connection status |
🔔 Desktop notifications for critical events — budget exceeded, memory rotation, tuning applied, pattern discovered.
🏢 Multi-project support — switch between projects in one dashboard. Your project list is saved in the browser.
hforge review --root . --json— see what your team decided, when it changed, and what needs attention.
AI work moves fast. The hard part is remembering why the team chose a path three weeks later. Harness Forge now turns ASR and ADR records into a simple timeline:
|
Decisions are sorted by creation time, so the newest architecture choices are easy to find. |
Superseded ADRs point to the newer decision, so old notes do not fight new direction. |
Stale proposals, broken links, and missing decision coverage show up in review output. |
# Review decision health, lineage, and architecture coverage
hforge review --root . --json
# Generate a readable decision log for handoff or onboarding
hforge runtime decision-log --root . --jsonIn plain words: if a task is architecture-significant, Harness Forge helps the team answer:
- Do we have a decision for this?
- Is it still current?
- Did another ADR replace it?
- Are we shipping with an uncovered architecture change?
That makes ADRs useful day to day, not just documents people write once and forget.
npx @harness-forge/cliThe CLI walks you through:
Then make npx @harness-forge/cli shell setup --yes |
One-liner for CI / scripts: hforge init \
--root . \
--agent codex \
--agent claude-code \
--setup-profile recommended \
--yesVerify everything is healthy: hforge doctor --root . --json |
Commands organized by when you use them — not alphabetically.
| Command | What it does | |
|---|---|---|
| 💡 | hforge next |
Recommends the single most useful action right now |
| 🏥 | hforge doctor |
Full health check with evidence |
| 🔄 | hforge refresh |
Regenerate runtime after code changes |
| 📋 | hforge status |
Review what's installed |
| Command | What it does | |
|---|---|---|
| 📊 | hforge dashboard |
Open the real-time browser dashboard |
| 🧭 | hforge review --root . --json |
Check decision health, lineage, and coverage |
| 📝 | hforge runtime decision-log --root . --json |
Generate a readable decision timeline |
| 📈 | hforge score |
Show recent session effectiveness scores |
| 🧠 | hforge insights |
Browse learned patterns with confidence |
| ⚡ | hforge adapt |
View/manage auto-tunings |
| 🔍 | hforge trace |
View recent session traces |
| 🔄 | hforge loop |
Living Loop health summary |
| Command | What it does | |
|---|---|---|
| 📦 | hforge export --bundle team.hfb |
Export tuned harness as portable bundle |
| 📥 | hforge import team.hfb |
Bootstrap from a shared bundle |
| 🔧 | hforge update |
Update harness to latest version in place |
| 🔬 | hforge audit |
Verify install integrity |
| 🔎 | hforge diff-install |
Check what drifted since last install |
| 🧹 | hforge prune |
Clean up unused artifacts |
| Command | What it does | |
|---|---|---|
| 🗺️ | hforge cartograph |
Map repo structure and boundaries |
| 🔍 | hforge recommend |
Evidence-backed setup recommendations |
| 🧬 | hforge recursive plan "..." |
Structured recursive analysis for hard problems |
| 🎯 | hforge target compare codex claude-code |
Side-by-side target comparison |
cd my-project
npx @harness-forge/cli
# Done — AI assistant understands this project |
hforge init --agent codex --agent claude-code --yes
hforge target compare codex claude-codeBoth agents share |
hforge next
# Tells you: refresh runtime, review stale artifacts |
hforge export --bundle our-team.hfb
# Teammate runs:
hforge import our-team.hfb
# Same learned patterns, instant bootstrap |
|
Use both together — they share the same hforge target compare codex claude-code |
🌐 14 Language packs
TypeScript, Python, Java, Go, Kotlin, Rust, C++, .NET, PHP, Perl, Swift, Shell, Lua, PowerShell
🏗️ 12 Framework packs
React, Next.js, Vite, Express, FastAPI, Django, ASP.NET Core, Spring Boot, Laravel, Symfony, Gin, Ktor
🛠️ 45+ Skills
Language engineering, workflow orchestration, operational helpers, and specialized skills like incident triage, dependency upgrades, API contract review, database migration review, release readiness, and token-budget-optimizer for context-aware compaction.
🗂️ What gets created in your repo
Your Repo
│
├── AGENTS.md ← AI agents read this first
├── .agents/skills/ ← Discoverable skills
├── .codex/ or .claude/ ← Target-specific config
└── .hforge/ ← Hidden canonical runtime
├── library/ ← Skills, rules, knowledge packs
├── runtime/ ← State, indexes, traces, insights
├── generated/ ← Command catalog, launchers
└── templates/ ← Workflow templates
Visible bridges where AI agents need discovery. Hidden canonical layer where runtime content stays authoritative.
Do I need to install anything globally?
No. npx @harness-forge/cli runs directly. For the shorter hforge command, run hforge shell setup --yes once.
Does it change my source code?
Never. Harness Forge only creates its own files (AGENTS.md, .agents/, .hforge/, .codex/, .claude/). Your application code is untouched.
Can I use it in CI/CD?
Yes. Add --yes for non-interactive and --json for machine-readable output:
hforge init --root . --agent codex --setup-profile recommended --yes
hforge doctor --root . --jsonHow do I remove it?
Delete: .hforge/, .agents/, .codex/, .claude/, AGENTS.md. Your project is back to normal.
Does it send data anywhere?
No. Everything stays local under .hforge/. Nothing is ever sent to the internet. Inspect, delete, or back up anytime.
What Node.js version?
Node.js 22 or newer. Check with node --version.
See CONTRIBUTING.md for development setup and guidelines.
Harness Forge was inspired by github/spec-kit. Credit to the GitHub team for shaping cleaner workflow models.
GPL-3.0 — see LICENSE.
Your AI agent deserves a better harness.
npx @harness-forge/cli

