🔨 Harness Forge

Your AI coding agent is only as good as its harness.
One command to scan, equip, and continuously improve any AI coding agent in your repository.

🚀 Get Started • 🔄 Living Loop • 📊 Dashboard • 🧭 Decisions • ⌨️ Commands • 💡 Scenarios • 🎯 Targets • ❓ FAQ

🔍 Scans & Equips

Your AI agent gets your repo's languages, frameworks, and patterns from the first prompt

🔄 Self-Improves

A closed feedback loop learns what works, tunes itself, and gets smarter every session

📊 Full Visibility

Real-time dashboard shows every decision, token spend, and compaction — no black boxes

	Without Harness Forge	With Harness Forge
🧠 Context	Agent guesses at project structure	Agent knows your languages, frameworks, boundaries
⚡ Performance	Starts fresh every session	Self-improves over time via the Living Loop
📊 Visibility	Black box — no idea what the agent decided	Real-time dashboard with 20 live panels
🧭 Decisions	ADRs get buried or forgotten	Chronological decision timeline with stale-decision checks
💰 Cost	Wasted tokens on retries and wrong paths	Compaction + auto-tuning saves 20-40%
📤 Portability	Stuck on one machine, one setup	Export & import learned patterns as `.hfb` bundles

🔄 The Living Loop — Your Harness Gets Smarter

Most tools configure once and forget. Harness Forge keeps learning.

  ┌──────────┐    ┌──────────┐    ┌──────────┐    ┌──────────┐    ┌──────────┐
  │ 🔍       │    │ 🧠       │    │ ⚡       │    │ 📤       │    │ 📥       │
  │ OBSERVE  │───▶│  LEARN   │───▶│  ADAPT   │───▶│  SHARE   │───▶│  IMPORT  │
  │          │    │          │    │          │    │          │    │          │
  │ Tracks   │    │ Finds    │    │ Auto-    │    │ Export   │    │ Bootstrap│
  │ sessions │    │ patterns │    │ tunes    │    │ bundles  │    │ anywhere │
  └──────────┘    └──────────┘    └──────────┘    └──────────┘    └──────────┘
        ▲                                                              │
        └──────────────────────────────────────────────────────────────┘

📅 Day 1 — You install

npx @harness-forge/cli

Scans your repo. Installs skills, rules, knowledge packs. Default settings. Everything works out of the box.

📅 Day 3 — After ~10 sessions

🧠 Pattern found: "Summarize" saves 40% more tokens
   than "Trim" in this repo (confidence: 82%)

⚡ Auto-tuned: compaction threshold 75% → 65%
   Result: 20% fewer budget warnings

📅 Day 5 — Share with your team

hforge export --bundle my-team.hfb
# Send to a teammate →
hforge import my-team.hfb
# They get your learned patterns instantly

📅 Ongoing — Dashboard shows it all

hforge dashboard

Loop health ring, effectiveness scores, pattern list, tuning log — live in your browser.

The more you use it, the better it gets. After ~10 sessions, Harness Forge has learned your repo's patterns and tuned itself for optimal performance. No manual configuration needed.

🛡️ Guardrails — auto-tuning is safe

Every tunable parameter has hard min/max bounds — the tuner can't go wild
Every change is logged with before/after values and the pattern that triggered it
If the next 3 sessions score worse, the tuning is automatically reverted
Your manual config overrides are sacred — the tuner won't touch them
The dashboard shows every tuning with a one-click revert button

📊 Real-Time Dashboard

hforge dashboard — see everything, live in your browser.

🔄 Loop Ring

Live status of each loop stage with health score

📈 Effectiveness

Session score trend — are things getting better?

🧠 Insights

Discovered patterns with confidence bars

⚡ Tuning Log

Policy changes with one-click revert

📋 All 20 dashboard panels

Panel	What it shows
🔢 KPI Cards	Total events, tokens, enforcement level, budget gauge
📈 Event Timeline	Scatter plot of all events over time, color-coded by category
💾 Memory Pressure	Token usage line chart with threshold marklines
📊 Budget Breakdown	Donut chart of budget allocation (hot-path, output, tools, safety)
📋 Live Event Feed	Searchable, expandable table of every harness decision
🤖 Subagent Briefs	Delegated tasks, their context, and outcomes
📊 Brief Metrics	Subagent activity summary and success rates
🔇 Suppression Gauge	How many duplicate context items were removed
🚪 Expansion Gate	History access requests — granted vs denied
⚙️ Config Editor	Edit memory-policy, context-budget, load-order live
🔄 Loop Health Ring	Self-improvement cycle status with stage counts
📈 Effectiveness Trend	Session score sparkline (last 20 sessions)
🧠 Insights Panel	Discovered patterns with confidence and "NEW" badges
⚡ Tuning Log	Policy changes with before/after and revert button
📊 Event Distribution	Bar chart of top event types
⏱️ Event Rate	Events per minute over time
🗺️ Event Heatmap	Category × time heatmap
💰 Tokens Saved	Running counter of tokens saved by compaction
📊 Profile Distribution	Output profile selection breakdown
ℹ️ Session Info	Session ID, uptime, version, connection status

🔔 Desktop notifications for critical events — budget exceeded, memory rotation, tuning applied, pattern discovered.

🏢 Multi-project support — switch between projects in one dashboard. Your project list is saved in the browser.

🧭 Decision Timeline

hforge review --root . --json — see what your team decided, when it changed, and what needs attention.

AI work moves fast. The hard part is remembering why the team chose a path three weeks later. Harness Forge now turns ASR and ADR records into a simple timeline:

🕒 What happened?

Decisions are sorted by creation time, so the newest architecture choices are easy to find.

🔎 What changed?

Superseded ADRs point to the newer decision, so old notes do not fight new direction.

🚦 What needs review?

Stale proposals, broken links, and missing decision coverage show up in review output.

# Review decision health, lineage, and architecture coverage
hforge review --root . --json

# Generate a readable decision log for handoff or onboarding
hforge runtime decision-log --root . --json

In plain words: if a task is architecture-significant, Harness Forge helps the team answer:

Do we have a decision for this?
Is it still current?
Did another ADR replace it?
Are we shipping with an uncovered architecture change?

That makes ADRs useful day to day, not just documents people write once and forget.

🚀 Get Started in 60 Seconds

npx @harness-forge/cli

The CLI walks you through:

🎯 Which AI targets (Codex, Claude Code, or both)
📊 How deep (quick / recommended / advanced)
👀 Preview of exactly what gets created
✅ One confirmation and you're done

Then make hforge available on your PATH:

npx @harness-forge/cli shell setup --yes

One-liner for CI / scripts:

hforge init \
  --root . \
  --agent codex \
  --agent claude-code \
  --setup-profile recommended \
  --yes

Verify everything is healthy:

hforge doctor --root . --json

⌨️ Your Daily Workflow

Commands organized by when you use them — not alphabetically.

🌅 Starting a session

	Command	What it does
💡	`hforge next`	Recommends the single most useful action right now
🏥	`hforge doctor`	Full health check with evidence
🔄	`hforge refresh`	Regenerate runtime after code changes
📋	`hforge status`	Review what's installed

🔄 While working

	Command	What it does
📊	`hforge dashboard`	Open the real-time browser dashboard
🧭	`hforge review --root . --json`	Check decision health, lineage, and coverage
📝	`hforge runtime decision-log --root . --json`	Generate a readable decision timeline
📈	`hforge score`	Show recent session effectiveness scores
🧠	`hforge insights`	Browse learned patterns with confidence
⚡	`hforge adapt`	View/manage auto-tunings
🔍	`hforge trace`	View recent session traces
🔄	`hforge loop`	Living Loop health summary

📤 Sharing & maintenance

	Command	What it does
📦	`hforge export --bundle team.hfb`	Export tuned harness as portable bundle
📥	`hforge import team.hfb`	Bootstrap from a shared bundle
🔧	`hforge update`	Update harness to latest version in place
🔬	`hforge audit`	Verify install integrity
🔎	`hforge diff-install`	Check what drifted since last install
🧹	`hforge prune`	Clean up unused artifacts

🧬 Advanced

	Command	What it does
🗺️	`hforge cartograph`	Map repo structure and boundaries
🔍	`hforge recommend`	Evidence-backed setup recommendations
🧬	`hforge recursive plan "..."`	Structured recursive analysis for hard problems
🎯	`hforge target compare codex claude-code`	Side-by-side target comparison

💡 Real-World Scenarios

📂 "Just cloned a repo, want AI help"

cd my-project
npx @harness-forge/cli
# Done — AI assistant understands this project

🤝 "I use both Codex and Claude Code"

hforge init --agent codex --agent claude-code --yes
hforge target compare codex claude-code

Both agents share .hforge/ but get their own config bridges.

🔙 "Coming back to a project after a break"

hforge next
# Tells you: refresh runtime, review stale artifacts

👥 "Standardize AI setup across my team"

hforge export --bundle our-team.hfb
# Teammate runs:
hforge import our-team.hfb
# Same learned patterns, instant bootstrap

🎯 Supported Targets

	Codex	Claude Code
Runtime	✅ Full	✅ Full
Maintenance	✅ Full	✅ Full
Hooks	📄 Docs-driven	✅ Native
Plugins	📄 Manual	✅ Native
Shared `.hforge/`	✅ Yes	✅ Yes

Use both together — they share the same .hforge/ runtime.

hforge target compare codex claude-code

📦 What's Included

🌐 14 Language packs

TypeScript, Python, Java, Go, Kotlin, Rust, C++, .NET, PHP, Perl, Swift, Shell, Lua, PowerShell

🏗️ 12 Framework packs

React, Next.js, Vite, Express, FastAPI, Django, ASP.NET Core, Spring Boot, Laravel, Symfony, Gin, Ktor

🛠️ 45+ Skills

Language engineering, workflow orchestration, operational helpers, and specialized skills like incident triage, dependency upgrades, API contract review, database migration review, release readiness, and token-budget-optimizer for context-aware compaction.

⚙️ How It Works Under the Hood

🗂️ What gets created in your repo

Your Repo
  │
  ├── AGENTS.md              ← AI agents read this first
  ├── .agents/skills/        ← Discoverable skills
  ├── .codex/ or .claude/    ← Target-specific config
  └── .hforge/               ← Hidden canonical runtime
         ├── library/        ← Skills, rules, knowledge packs
         ├── runtime/        ← State, indexes, traces, insights
         ├── generated/      ← Command catalog, launchers
         └── templates/      ← Workflow templates

Visible bridges where AI agents need discovery. Hidden canonical layer where runtime content stays authoritative.

❓ FAQ

Do I need to install anything globally?

No. npx @harness-forge/cli runs directly. For the shorter hforge command, run hforge shell setup --yes once.

Does it change my source code?

Never. Harness Forge only creates its own files (AGENTS.md, .agents/, .hforge/, .codex/, .claude/). Your application code is untouched.

Can I use it in CI/CD?

Yes. Add --yes for non-interactive and --json for machine-readable output:

hforge init --root . --agent codex --setup-profile recommended --yes
hforge doctor --root . --json

How do I remove it?

Delete: .hforge/, .agents/, .codex/, .claude/, AGENTS.md. Your project is back to normal.

Does it send data anywhere?

No. Everything stays local under .hforge/. Nothing is ever sent to the internet. Inspect, delete, or back up anytime.

What Node.js version?

Node.js 22 or newer. Check with node --version.

📈 Project Activity

🤝 Contributing

See CONTRIBUTING.md for development setup and guidelines.

🙌 Acknowledgements

Harness Forge was inspired by github/spec-kit. Credit to the GitHub team for shaping cleaner workflow models.

📄 License

GPL-3.0 — see LICENSE.

Your AI agent deserves a better harness.
npx @harness-forge/cli

Name		Name	Last commit message	Last commit date
Latest commit History 140 Commits
.agents/skills		.agents/skills
.github/workflows		.github/workflows
agents		agents
assets		assets
commands		commands
contexts		contexts
docs		docs
examples		examples
hooks		hooks
knowledge-bases		knowledge-bases
manifests		manifests
mcp		mcp
profiles		profiles
rules		rules
schemas		schemas
scripts		scripts
skills		skills
src		src
targets		targets
templates		templates
tests		tests
.gitignore		.gitignore
.npmignore		.npmignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
RESEARCH-SOURCES.md		RESEARCH-SOURCES.md
VALIDATION.md		VALIDATION.md
eslint.config.js		eslint.config.js
install.ps1		install.ps1
install.sh		install.sh
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.tsbuildinfo		tsconfig.tsbuildinfo
vitest.config.ts		vitest.config.ts

Folders and files

Latest commit

History

Repository files navigation

🔨 Harness Forge

🔍 Scans & Equips

🔄 Self-Improves

📊 Full Visibility

🔄 The Living Loop — Your Harness Gets Smarter

📅 Day 1 — You install

📅 Day 3 — After ~10 sessions

📅 Day 5 — Share with your team

📅 Ongoing — Dashboard shows it all

📊 Real-Time Dashboard

🔄 Loop Ring

📈 Effectiveness

🧠 Insights

⚡ Tuning Log

🧭 Decision Timeline

🕒 What happened?

🔎 What changed?

🚦 What needs review?

🚀 Get Started in 60 Seconds

⌨️ Your Daily Workflow

🌅 Starting a session

🔄 While working

📤 Sharing & maintenance

🧬 Advanced

💡 Real-World Scenarios

📂 "Just cloned a repo, want AI help"

🤝 "I use both Codex and Claude Code"

🔙 "Coming back to a project after a break"

👥 "Standardize AI setup across my team"

🎯 Supported Targets

📦 What's Included

⚙️ How It Works Under the Hood

❓ FAQ

📈 Project Activity

🤝 Contributing

🙌 Acknowledgements

📄 License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages