ShipGuard

Ship with confidence. ShipGuard finds bugs before your users do.

Three AI-powered modules. Use one, two, or all three. No test files to write.

📸 Visual E2E Debugger

Auto-discover routes, generate tests, mark bugs on screenshots — AI traces to source code and fixes automatically.

/sg-visual-run

🎬 Macro Recorder

Record your browser interactions and turn them into replayable tests. Like Excel's macro recorder, but for visual testing.

/sg-record http://localhost:3000

🔍 Code Audit

Parallel AI agents scan your entire codebase, find bugs, and fix them automatically. Race conditions, auth gaps, silent exceptions, resource leaks.

/sg-code-audit

Install

claude plugin add bacoco/shipguard
npm install -g agent-browser && agent-browser install --with-deps

⚠️ Token Usage — Code audits are token-intensive. standard (10 agents) ≈ 2M tokens. deep (15 agents, 2 rounds) ≈ 5M+. paranoid (20 agents, 3 rounds) can exceed 10M.

Visual E2E Debugger

Mark bugs directly on screenshots. The AI traces each annotation to source code and fixes it.

/sg-visual-run I changed the sidebar

Commands

Command	What it does
`/sg-visual-discover`	Scan codebase, generate YAML test manifests per route
`/sg-record <url>`	Record browser interactions as replayable test manifests
`/sg-visual-run [what]`	Execute manifests — natural language or flags
`/sg-visual-review`	Launch interactive screenshot review dashboard
`/sg-visual-fix`	Auto-fix bugs annotated in the review dashboard
`/sg-visual-review-stop`	Stop the review server

Smart Annotations (Gemini-style)

The review dashboard uses draggable annotation cards to mark visual bugs on screenshots. Click anywhere on a screenshot to place a pin, then describe the problem.

How it works:

Open a screenshot in the lightbox
Double-click anywhere on the image — a pin appears instantly (or click + Add Note first)
Click = point pin. Drag = rectangle zone selection (highlights the problem area)
Choose severity + type your note → a card appears connected to the pin
Drag the pin to reposition — zone, card, and leader line all move together
Drag the card separately to reposition just the label
Double-click a card to edit text/severity, click X to delete
Click Validate & Generate Report when done → produces fix-manifest.json with zone coordinates
Run /sg-visual-fix → AI reads your annotations + zone coords, traces to source code, fixes automatically

Severity colors:

Color	Level	When to use
🔴 Red	Critical	Broken layout, missing content, crash
🟠 Orange	High	Wrong alignment, color mismatch, bad spacing
🔵 Blue	Medium	Minor visual inconsistency, polish needed
⚪ Gray	Info	Suggestion, not a bug

sg-visual-run options

/sg-visual-run                                  # Interactive — choose scope
/sg-visual-run I changed the sidebar, check it  # Natural language
/sg-visual-run --from-audit                     # Test audit-impacted routes
/sg-visual-run --regressions                    # Re-run previously failed tests
/sg-visual-run --all                            # Full suite

--from-audit reads impacted_routes from audit-results.json — a natural bridge between the two features.

Discover options

/sg-visual-discover                    # Current project
/sg-visual-discover --all              # Full discovery
/sg-visual-discover --refresh-existing # Regenerate existing manifests

Supports Next.js (App Router & Pages Router), React Router, Vue, Angular.

Macro Recorder

Record what you do in the browser and turn it into a replayable test. Like Excel's macro recorder, but for visual testing.

/sg-record http://localhost:3000/dashboard --name my-test

How it works

Launch — Opens a Playwright browser with a floating toolbar
Navigate — Browse your app normally. Clicks, inputs, uploads are captured automatically
Check — Click the Check button, then click an element to mark it as an assertion
Undo / Delete / Pause — Fix mistakes without restarting
Stop — Saves a YAML manifest ready for /sg-visual-run

Test Library

Recorded tests appear as cards in the review dashboard under the Recorded Tests tab.

Select the tests you want to run, click Run — the command is ready to copy.

Two ways to create tests

	`/sg-visual-discover`	`/sg-record`
Source	AI scans your code	Human records interactions
When	After code changes	After manual QA, bug reproduction, new feature walkthrough
Output	Same YAML format	Same YAML format

Both feed into the same pipeline: sg-visual-run executes them, sg-visual-review shows results, sg-visual-fix fixes failures.

Options

/sg-record http://localhost:3000                              # Interactive — asks for name on stop
/sg-record http://localhost:3000 --name login-flow            # Preset name
/sg-record http://localhost:3000 --storage auth.json          # Skip login (reuse saved auth)
/sg-record http://localhost:3000 --save-storage auth.json     # Save auth for future recordings

Code Audit

Dispatch parallel AI agents to audit your entire codebase. Each agent reviews a non-overlapping zone, finds bugs, fixes them, and produces structured JSON. Watch progress in real-time on the Mission Control dashboard.

/sg-code-audit deep

Modes

Mode	Agents	Rounds	Coverage
quick	5	1	Surface scan
standard	10	1	Full codebase (default)
deep	15	2	Surface + runtime behavior
paranoid	20	3	Surface + runtime + edge cases & security

Multi-round depth

R1 — Null refs, missing guards, type mismatches
R2 — Race conditions, async pitfalls, state management
R3 — Edge cases, injection, auth bypass, data leaks

Smart Scope

By default, ShipGuard detects what changed and asks whether to limit the audit:

/sg-code-audit       # "12 files changed since main. Audit only what changed?"

Override with flags:

Flag	Effect
`--all`	Force full scope, skip the question
`--diff=<ref>`	Use a specific base reference
`--focus=path/`	Restrict to a directory
`--report-only`	Find bugs but do not fix them

Flags combine freely: /sg-code-audit deep --focus=src/ --report-only

Live Dashboard

At startup, the audit offers to open the Mission Control dashboard. The Code Audit tab shows real-time agent pods (running/done/pending), severity heatmap, bug table filterable by severity and free-text search. Polls every 3s during active audit.

Output

Results are written to audit-results.json:

summary — totals by severity and category
bugs[] — file, line, severity, description, fix status
impacted_routes[] — UI routes affected (consumed by /sg-visual-run --from-audit)

Supported languages

Python, TypeScript/React, Next.js, Infrastructure (Docker/YAML/CI), Go, Rust, JVM.

Compatibility

Built for Claude Code. Partial support for other AI CLIs:

Feature	Claude Code	Codex CLI / Gemini CLI
Code Audit (parallel)	✅ Full	❌ Requires Agent tool
Visual E2E Debugger	✅ Full	✅ agent-browser is CLI-independent
Macro Recorder	✅ Full	✅ Playwright is CLI-independent
Review Dashboard	✅ Full	✅ Pure Node.js
Visual Discover/Fix	✅ Full	✅ Bash + LLM prompts

The visual testing pipeline works with any AI CLI that can run shell commands and read/write files. Code audit parallelization requires Claude Code's Agent tool with worktree isolation.

Community adapters welcome.

Quick Start

# Install
claude plugin add bacoco/shipguard
npm install -g agent-browser && agent-browser install --with-deps

# Audit your code
/sg-code-audit

# Record a test manually
/sg-record http://localhost:3000

# Run all tests
/sg-visual-run

Configuration

Create visual-tests/_config.yaml:

base_url: "http://localhost:3000"
credentials:
  username: "testuser"
  password: "testpass"
build_command: "docker compose up -d --build frontend"  # optional

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 110 Commits
.claude-plugin		.claude-plugin
.github		.github
docs		docs
examples		examples
plugins/shipguard		plugins/shipguard
visual-tests		visual-tests
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ShipGuard

📸 Visual E2E Debugger

🎬 Macro Recorder

🔍 Code Audit

Install

Visual E2E Debugger

Commands

Smart Annotations (Gemini-style)

sg-visual-run options

Discover options

Macro Recorder

How it works

Test Library

Two ways to create tests

Options

Code Audit

Modes

Multi-round depth

Smart Scope

Live Dashboard

Output

Supported languages

Compatibility

Quick Start

Configuration

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ShipGuard

📸 Visual E2E Debugger

🎬 Macro Recorder

🔍 Code Audit

Install

Visual E2E Debugger

Commands

Smart Annotations (Gemini-style)

sg-visual-run options

Discover options

Macro Recorder

How it works

Test Library

Two ways to create tests

Options

Code Audit

Modes

Multi-round depth

Smart Scope

Live Dashboard

Output

Supported languages

Compatibility

Quick Start

Configuration

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages