Portable agent workflows I actually use.
This repo is my working layer for Codex, Claude Code, and any future agent host that can load the same underlying methods. The point is not to collect prompts. The point is to package durable ways of working: study the system before changing it, verify claims against live evidence, preserve the user's intent, converge messy work into clean artifacts, and keep each workflow in a namespace that can be installed or removed cleanly.
Plugins are the organizing unit. A plugin should feel like a coherent vertical, not a folder of unrelated tricks.
From the repo root:
codex plugin marketplace add .Restart Codex, open /plugins, install the plugin you want, then start a new thread.
Common invocations:
$workbench:devour
$workbench:research-report
$workbench:max-prompt
$agents:birdwatch
$agents:autoreview
$videos:storyboard
$extra:trail-scriber
After publishing, the marketplace can be added by repo instead:
codex plugin marketplace add kennykankush/skillpackAdd the marketplace once:
/plugin marketplace add github.com/kennykankush/skillpack
Install the bundles you want:
/plugin install workbench@kennykankush-skillpack
/plugin install agents@kennykankush-skillpack
/plugin install videos@kennykankush-skillpack
/plugin install extra@kennykankush-skillpack
/plugin install agent-announcer-when-agent-finishes@kennykankush-skillpack
/reload-plugins
I built this pack around the failure modes I keep seeing when agents become part of real work instead of one-off demos.
The failure mode is shallow context. The agent reads the nearest files, makes a plausible edit, and misses the route, lifecycle, schema, auth state, test seam, or live behavior that actually controls the bug.
The fix is workbench:devour: a codebase mastery mode that builds a grounded atlas before implementation. It maps entrypoints, runtime flows, temporal behavior, blast radius, safe extension points, verification commands, and unknowns. The goal is judgment, not file-count tourism.
The failure mode is research sprawl. A topic starts with a useful question and ends with fragments spread across chat, browser history, Markdown scraps, and half-rendered drafts.
The fix is workbench:research-report: one research engine with two outputs. Scan mode answers inline when the question is lightweight. Official mode converges to exactly notes.md and report.html under research/<umbrella>/<title>/. The shape matters because the end state stays inspectable.
The failure mode is vague feedback. "This feels off" becomes random CSS churn, generic polish, or a prompt that describes the desired mood without telling an engineer what to change.
The fix is workbench:max-prompt for UI/product feedback and the videos pack for motion work. max-prompt translates instinct into mechanism, ownership, constraints, implementation direction, and verification. videos separates narrative, visual contract, storyboard generation, frame QA, batching, and motion handoff so taste does not collapse into a single generation prompt.
The failure mode is guessing from stale memory. A user tests an app, the agent summarizes vibes, and nobody checks the logs, API, DB rows, browser state, branch state, PR state, or issue state that would prove what happened.
The fix is agents:birdwatch: a live watcher posture for product/dev sweeps. It observes first, records discrepancies, separates user-visible friction from backend/data bugs, and avoids destructive actions unless explicitly asked. For code closeout, agents:autoreview runs a structured review helper, verifies accepted findings against the real code path, applies narrow fixes only when appropriate, and reruns until no actionable findings remain.
The failure mode is flat skill sprawl. Useful workflows get installed through different mechanisms, names collide, related tools drift apart, and uninstalling leaves leftovers.
The fix is plugin-first packaging. Each vertical has its own namespace, metadata, README, skills, commands, scripts, and host adapters. workbench:skill-advisor helps choose from the installed toolkit. workbench:skill-distiller turns a successful workflow into a reusable skill. The Claude-side skill-manager agent handles lifecycle across plugins, npx skills, skillfish, and manual installs.
The failure mode is invisible completion. Multiple terminal tabs are running, one agent finishes or requests input, and the result sits unnoticed.
The fix is agent-announcer-when-agent-finishes: a hook plugin that speaks a one-line contextual handover with the Ghostty tab number. It can summarize through OpenAI-compatible chat endpoints and speak through local Qwen, ElevenLabs, OpenAI TTS, or macOS say.
The shared posture across this repo:
- Portable first. The core workflow should outlive Codex, Claude Code, or any single model. Host-specific commands and manifests are adapters.
- Evidence before narration. If live state is cheap to inspect, inspect it before explaining it.
- Vertical bundles over grab bags. A plugin should represent one workflow domain with clear boundaries.
- Human-gated mutation. Watching, reviewing, researching, and reconstructing can be active without being destructive.
- Converged artifacts. Research, memory, trail reconstruction, storyboard batches, and review closeout should have clear final states.
- User voice matters. Some workflows preserve exact phrasing because wording carries intent that summaries often erase.
- Install only what earns its place. Experiments can live elsewhere. A plugin lands here after it has proven useful in day-to-day work.
Everyday meta-work for coding agents.
workbench:devour- Codebase mastery mode before risky implementation, refactors, migrations, debugging, or onboarding. Persists its atlas toMAP.mdat the repo root.workbench:bedrock- Foundation audit mode: adversarial inspection of load-bearing logic to bank-grade, limit-tested feature flows, runnable repros, and a persistentAUDIT.mdledger. Report or report-then-fix.workbench:potential- Generative counterpart to bedrock: reads the structure and surfaces what it wants to become, every vision cited to real beams and graded by honest distance. Open mode or wish mode; never implements, never writes files.workbench:vision- Keeps a project'sVISION.mdwritten and alive: birth (handoff from exploration), backfill (repos with no vision doc), refresh (drifted intent). The vision comes from the visionary - evidence drafts, the user's voice decides.workbench:research-report- Deep research with scan mode for inline answers and official mode fornotes.mdplus renderedreport.html.workbench:memory-scriber- Captures the residue of a meaningful session into the active host's memory directory.workbench:max-prompt- Converts fuzzy feedback, screenshots, taste, or frustration into an implementation-ready coding-agent prompt.workbench:skill-advisor- Recommends from already-installed skills without installing anything.workbench:skill-distiller- Generalizes a successful workflow into a reusableSKILL.md.workbench:isomorph- Systemic-thinking mode that maps a system onto its mature twin in another domain to surface required invariants and blindspots.skill-manager- Claude Code agent for discovering, installing, updating, and cleaning up skills across multiple mechanisms.
Full plugin docs: plugins/workbench/README.md
Reusable operating modes for live work and closeout.
agents:birdwatch- Evidence-backed watcher mode while a user tests a product flow, API, DB state, logs, issues, or PR behavior.agents:autoreview- Structured code-review closeout for local changes, branch diffs, or commits, with verified findings and focused fixes.
Full plugin docs: plugins/agents/README.md
Product-launch and SaaS motion preproduction.
videos:launch-narrative- Shapes the product film spine before visual generation.videos:style-contract- Captures visual rules from product screenshots, brand assets, and references.videos:storyboard- Generates cohesive storyboard frames with Codex imagegen and quality gates.videos:frame-qa- Reviews frames or contact sheets and gives keep/regenerate decisions.videos:batch-prep- Copies accepted frames into upload-ready AI-video batches with manifests and prompt placeholders.videos:motion-handoff- Writes keyframe prompts and node workflows for tools like Higgsfield Canvas, Gemini/Omni, Runway, and Krea.
Full plugin docs: plugins/videos/README.md
Personal workflows that are durable enough to package but do not belong in Workbench.
extra:trail-scriber- Reconstructs day-by-day work trails from Codex and Claude conversation evidence into Obsidian-friendly day, agenda, domain, and conversation notes.
Full plugin docs: plugins/extra/README.md
Small runtime utilities that make agent work easier to notice.
agent-announcer-when-agent-finishes- Speaks a contextual one-line handover when Codex or Claude finishes a turn or needs input.
Full plugin docs: plugins/agent-announcer-when-agent-finishes/README.md
plugins/
workbench/ meta-work: devour, research, memory, prompting, skill tooling
agents/ reusable operating modes: birdwatch, autoreview
videos/ launch-film and AI-video preproduction
extra/ personal workflows outside Workbench
agent-announcer-when-agent-finishes/ hook-based spoken completion handoffs
Each plugin carries its own host metadata:
.codex-plugin/plugin.json
.claude-plugin/plugin.json
The root marketplace files expose the bundle:
.agents/plugins/marketplace.json
.claude-plugin/marketplace.json
Flat skills are useful for quick experiments. This repo is for workflows I depend on.
Plugins give the pack a few properties that matter in daily use:
- Namespacing.
workbench,agents,videos, andextrastay distinct instead of competing in one global list. - Bundling. Related skills, commands, scripts, templates, hooks, and metadata travel together.
- Clean lifecycle. Install, upgrade, pin, or remove a vertical as a unit.
- Host adapters. Codex and Claude Code can expose different commands while sharing the same method.
- A discoverable home. A plugin reads as a maintained workflow package, not a scattered set of files.
The cost is a bit of manifest scaffolding. The payoff is a toolkit with boundaries.
MIT.