Package Exports
This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (agent-workflow-studio) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.
Readme
Agent Workflow Studio
Built entirely by Codex in a single session (default 258k context), with Claude Code providing evaluation, suggestions, and code review. This project is both the tool and the proof that structured agent workflows work.
Make every Codex and Claude Code run leave an auditable evidence trail.
Zero dependencies - local-first - Git-native - npm-installable
Agent Workflow Studio turns:
- tasks into strong prompts
- runs into evidence
- evidence into refreshed docs and checkpoints
- long jobs into resumable handoffs instead of lost context
Try it in 1 minute
npm install agent-workflow-studio
npx agent-workflow init --root .
npx agent-workflow quick "My first task" --task-id T-001 --agent codex --root .
npx agent-workflow dashboard --root . --port 4173Then open http://localhost:4173.
For the cleaner helper-directory install flow, see docs/GETTING_STARTED.md.
Core capabilities
quick- create a durable task bundle in one step: profile refresh, task docs, prompt, run request, launch pack, and checkpointmemory:bootstrap- generate a local-only handoff prompt that helps Codex or Claude Code fill grounded project memoryrun:execute- launch a local adapter when you explicitly opt intocommandMode: exec, with shared preflight, logs, and evidence captureverification gate- compare repo-relative task scope against the current repository snapshot and show which scoped files still need explicit proofproof anchors- keep passed evidence and refreshed manual proof tied to content fingerprints, not fragilemtimealoneskills:generate- writeAGENTS.md,CLAUDE.md, and Claude slash commands so the workflow becomes part of the agent's default contextdashboard- inspect tasks, evidence, freshness, risks, execution state, and quick-create flows from a local control plane atlocalhost:4173
Built for agents too
Teach Codex and Claude Code the workflow automatically:
npx agent-workflow skills:generate --root .This writes AGENTS.md, CLAUDE.md, and Claude slash commands so the agent can follow the same task/evidence/checkpoint flow without manual setup.
See AGENT_GUIDE.md for the full workflow guide.
Architecture at a glance
Task creation Agent execution Evidence + resume
| | |
v v v
+-------------------------------------------------------------------+
| .agent-workflow/ |
| |
| memory/ tasks/T-001/ adapters/ |
| - product.md - task.md - codex.json |
| - architecture.md - context.md - claude-code.json|
| - rules.md - verification.md - custom *.json |
| - prompt.codex.md |
| - run-request.codex.json |
| - launch.codex.md |
| - checkpoint.md |
| - runs/ evidence + proof anchors |
+-------------------------------------------------------------------+
| |
v v
Git-trackable repo Dashboard / CLIDaily workflow
- Create a task with
quickortask:new. - Hand the compiled prompt to Codex or Claude Code, or use
run:executewhen a local adapter is ready. - Review proof in
verification.mdand recorded runs under.agent-workflow/tasks/<taskId>/runs/. - Refresh
checkpoint.md, keep moving, and resume later without losing context.
Verification model
Two ideas sit at the center of the project:
- Verification gate: compare repo-relative task scope against the current repository snapshot (Git-backed when available, filesystem fallback otherwise) and explain which scoped files still need explicit proof.
- Proof anchor: persist content fingerprints with passed run evidence and refreshed manual proof, so freshness survives misleading
mtimechurn, branch switches, and agent handoff noise. Strong proof requirespaths + checks or artifacts; path-only proof stays weak.
Why this exists
Most teams using coding agents still lack:
- stable project memory
- structured task context
- trustworthy verification state
- resumable checkpoints
- a shared control plane across Codex and Claude Code
Agent Workflow Studio is designed to become that missing layer.
Commands
- Onboarding:
init,scan,memory:bootstrap,memory:validate - Tasking:
recipe:list,quick,task:new,task:list - Adapters:
adapter:list,adapter:create - Execution:
prompt:compile,run:prepare,run:execute,run:add,checkpoint - Inspection:
dashboard,validate - Skills:
skills:generate
Adapter layer
Adapters bridge the workflow layer and real agent CLIs.
- Built-in Codex and Claude Code adapters ship as
manualby default - Switch to
commandMode: execwhen you are ready to automate local runs adapter:createscaffolds a custom adapter for any CLI agentstdinMode: promptFilelets non-interactive CLIs receive prompts over stdin- Execution captures stdout/stderr, timeout, interruption, and cancellation metadata
- Shared preflight checks verify runner availability, env vars, and stdio compatibility before spawn
Both Codex and Claude Code have been dogfooded on this repository with real local runs. See docs/ADAPTERS.md for the full adapter contract and docs/RUN_EXECUTE_DESIGN.md for the executor design.
Recipes and schema validation
- Recipes (
audit,feature,review) are indexed in.agent-workflow/recipes/index.jsonand attached to tasks viarecipeId validatechecks project config, adapters, tasks, and run records for missing or malformed fields- The dashboard surfaces schema issues, memory freshness, and verification gate status in one view
See docs/RECIPES_AND_SCHEMA.md.
Relocatable by design
No absolute paths are written into workflow files. The CLI and dashboard resolve the target repository from --root or the current working directory. See docs/RELOCATABLE_DESIGN.md.
Layout
agent-workflow-studio/
src/ CLI + core modules
dashboard/ static frontend (zero build step)
docs/ design docs and guides
scripts/ smoke test + unit test runner
test/ unit testsInitialized target repository:
.agent-workflow/
project.json
project-profile.json / .md
memory/ product, architecture, domain-rules, runbook
recipes/ audit, feature, review + index.json
adapters/ codex.json, claude-code.json, custom *.json
tasks/ T-001/, T-002/, ...
handoffs/ memory-bootstrap.md
decisions/Contributor workflow
From this project root:
npm run init -- --root ../some-repo
npm run scan -- --root ../some-repo
npm run memory:bootstrap -- --root ../some-repo
npm run quick -- "Build the scanner" --task-id T-001 --priority P1 --agent codex --root ../some-repo
npm run dashboard -- --root ../some-repo --port 4173
npm run run:execute -- T-001 --agent codex --root ../some-repo
npm run run:add -- T-001 "Scanner pass completed." --status passed --root ../some-repo
npm run checkpoint -- T-001 --root ../some-repo
npm run validate -- --root ../some-repo
npm testLearn more
- Getting Started - the full npm-first onboarding flow
- Documentation Index - the map for all design and reference docs
- Architecture - how the scaffold, dashboard, adapters, and evidence model fit together
- Verification Design - verification gates, proof anchors, and freshness rules
- Executor Design - local executor planning, preflight, and evidence capture
- Adapters - built-in adapters and custom adapter scaffolding
- Roadmap - the likely next build steps
- Publishing - npm release checklist
Contributing
Read CONTRIBUTING.md. Keep changes local-first, relocatable, and schema-aware. Run npm test and npm run smoke before opening a PR.
Community
CODE_OF_CONDUCT.md defines how we collaborate. Issues and PRs should stay focused on strong prompts, evidence quality, checkpoints, and agent handoff durability.
License
Released under the MIT License. See LICENSE.