Package Exports
- agent-workflow-studio
- agent-workflow-studio/dist/cli.js
- agent-workflow-studio/dist/index.js
- agent-workflow-studio/dist/lib/adapters.js
- agent-workflow-studio/dist/lib/checkpoint.js
- agent-workflow-studio/dist/lib/dashboard-execution.js
- agent-workflow-studio/dist/lib/done.js
- agent-workflow-studio/dist/lib/evidence-collectors.js
- agent-workflow-studio/dist/lib/evidence-utils.js
- agent-workflow-studio/dist/lib/freshness.js
- agent-workflow-studio/dist/lib/fs-utils.js
- agent-workflow-studio/dist/lib/http-errors.js
- agent-workflow-studio/dist/lib/mcp-install.js
- agent-workflow-studio/dist/lib/mcp-prompts.js
- agent-workflow-studio/dist/lib/mcp-resources.js
- agent-workflow-studio/dist/lib/mcp-tools.js
- agent-workflow-studio/dist/lib/memory-bootstrap.js
- agent-workflow-studio/dist/lib/memory-placeholders.js
- agent-workflow-studio/dist/lib/memory-validator.js
- agent-workflow-studio/dist/lib/orchestrator.js
- agent-workflow-studio/dist/lib/overview.js
- agent-workflow-studio/dist/lib/prompt-compiler.js
- agent-workflow-studio/dist/lib/quick-task.js
- agent-workflow-studio/dist/lib/recipes.js
- agent-workflow-studio/dist/lib/repository-snapshot.js
- agent-workflow-studio/dist/lib/run-executor.js
- agent-workflow-studio/dist/lib/run-plan.js
- agent-workflow-studio/dist/lib/run-preflight.js
- agent-workflow-studio/dist/lib/run-preparer.js
- agent-workflow-studio/dist/lib/scanner.js
- agent-workflow-studio/dist/lib/schema-validator.js
- agent-workflow-studio/dist/lib/skill-generator.js
- agent-workflow-studio/dist/lib/smart-defaults.js
- agent-workflow-studio/dist/lib/task-documents.js
- agent-workflow-studio/dist/lib/task-service.js
- agent-workflow-studio/dist/lib/trust-summary.js
- agent-workflow-studio/dist/lib/undo-log.js
- agent-workflow-studio/dist/lib/undo.js
- agent-workflow-studio/dist/lib/verification-gates.js
- agent-workflow-studio/dist/lib/verification-proof.js
- agent-workflow-studio/dist/lib/webhook-evidence.js
- agent-workflow-studio/dist/lib/workspace.js
- agent-workflow-studio/dist/mcp-server.js
- agent-workflow-studio/dist/server.js
- agent-workflow-studio/package.json
- agent-workflow-studio/src/cli.js
- agent-workflow-studio/src/index.ts
- agent-workflow-studio/src/lib/adapters.js
- agent-workflow-studio/src/lib/checkpoint.js
- agent-workflow-studio/src/lib/dashboard-execution.js
- agent-workflow-studio/src/lib/done.js
- agent-workflow-studio/src/lib/evidence-collectors.js
- agent-workflow-studio/src/lib/evidence-utils.js
- agent-workflow-studio/src/lib/freshness.js
- agent-workflow-studio/src/lib/fs-utils.js
- agent-workflow-studio/src/lib/fs-utils.ts
- agent-workflow-studio/src/lib/http-errors.js
- agent-workflow-studio/src/lib/mcp-install.js
- agent-workflow-studio/src/lib/mcp-prompts.js
- agent-workflow-studio/src/lib/mcp-resources.js
- agent-workflow-studio/src/lib/mcp-tools.js
- agent-workflow-studio/src/lib/memory-bootstrap.js
- agent-workflow-studio/src/lib/memory-placeholders.js
- agent-workflow-studio/src/lib/memory-validator.js
- agent-workflow-studio/src/lib/orchestrator.js
- agent-workflow-studio/src/lib/overview.js
- agent-workflow-studio/src/lib/prompt-compiler.js
- agent-workflow-studio/src/lib/quick-task.js
- agent-workflow-studio/src/lib/recipes.js
- agent-workflow-studio/src/lib/repository-snapshot.js
- agent-workflow-studio/src/lib/run-executor.js
- agent-workflow-studio/src/lib/run-plan.js
- agent-workflow-studio/src/lib/run-preflight.js
- agent-workflow-studio/src/lib/run-preparer.js
- agent-workflow-studio/src/lib/scanner.js
- agent-workflow-studio/src/lib/schema-validator.js
- agent-workflow-studio/src/lib/skill-generator.js
- agent-workflow-studio/src/lib/smart-defaults.js
- agent-workflow-studio/src/lib/task-documents.js
- agent-workflow-studio/src/lib/task-service.js
- agent-workflow-studio/src/lib/trust-summary.js
- agent-workflow-studio/src/lib/undo-log.js
- agent-workflow-studio/src/lib/undo.js
- agent-workflow-studio/src/lib/verification-gates.js
- agent-workflow-studio/src/lib/verification-proof.js
- agent-workflow-studio/src/lib/webhook-evidence.js
- agent-workflow-studio/src/lib/workspace.js
- agent-workflow-studio/src/mcp-server.js
- agent-workflow-studio/src/server.js
Readme
Agent Workflow Studio
A workbench for human-agent collaboration on projects.
Not an AI coding assistant. A project management system where humans and agents work together to ship products.
What is this?
Agent Workflow Studio is a collaboration workbench that lets humans and AI agents work together on projects — not just code, but any project with tasks, deliverables, and quality requirements.
Think of it as:
- Jira for human-agent teams
- GitHub Actions for agent orchestration
- A shared workspace where work is visible, traceable, and handoff-ready
Not:
- A better autocomplete
- A chat interface for code
- A code review tool
Why does this exist?
Current AI tools (Cursor, Claude Code, Codex) are code assistants — they help you write code in the moment. But when you want to:
- Manage a project with multiple tasks
- Collaborate with multiple agents over days or weeks
- Hand off work between agents or sessions
- Track progress and quality across the project
- Run agents autonomously overnight
...you need a workbench, not an assistant.
Agent Workflow Studio is that workbench.
Core capabilities
1. Project-level management
Tasks, not files. Deliverables, not functions. Progress tracking, not just code completion.
# Create a task
npx agent-workflow quick "Add user authentication" --root .
# Agent works on it, records evidence
# (via MCP tools or CLI)
# View progress in dashboard
npx agent-workflow dashboard --root .2. Work leaves a trail
Every agent action creates evidence: what changed, what was tested, what was verified. This evidence serves two purposes:
- For humans: Trust, auditability, decision-making
- For agents: Context, handoff documents, knowledge base
3. Multi-agent collaboration
Agents can hand off work to each other with full context. No information loss.
// Agent A finishes part of the work
workflow_handoff({
taskId: "T-001",
summary: "Completed database schema",
remaining: "Need to update ORM models"
})
// Agent B picks up where A left off
workflow_pickup({
taskId: "T-001",
agent: "codex"
})
// Returns: full context, checkpoint, evidence so far4. Human-agent collaboration loop
Humans review agent work, approve or reject with feedback. Rejections create correction tasks automatically.
Dashboard → Review task → Approve (trust +10) or Reject with feedback → Agent continues
5. Autonomous execution
Run agents overnight. The orchestrator watches the task queue and spawns agent sessions automatically.
# Start the orchestrator
npx agent-workflow orchestrate --agent claude --root .
# Go to sleep
# Wake up, open dashboard, see what agents completedQuick start
Install
npm install agent-workflow-studio
cd your-project
npx agent-workflow init --root .
npx agent-workflow scan --root .Option 1: Use with MCP (Claude Code, Codex, Cursor)
npx agent-workflow mcp:install --client claude --root .Then in your editor:
"Create a task called 'Add authentication'"
"Work on T-001 and record evidence when done"Option 2: Use from CLI
# Create task
npx agent-workflow quick "Add authentication" --lite --root .
# Work on it (manually or with agent)
# ...
# Record evidence
npx agent-workflow done T-001 "Completed auth flow" --complete --root .Option 3: Run autonomously
# Create tasks
npx agent-workflow quick "Task 1" --root .
npx agent-workflow quick "Task 2" --root .
# Start orchestrator (spawns agents to work on tasks)
npx agent-workflow orchestrate --agent claude --root .
# Open dashboard to monitor
npx agent-workflow dashboard --root .Architecture
┌─────────────────────────────────────────────────────────┐
│ Dashboard (Web UI) │
│ Review · Approve · Monitor Progress │
└─────────────────────────────────────────────────────────┘
▲
│
┌─────────────────────────────────────────────────────────┐
│ Orchestrator (Daemon) │
│ Watches queue · Spawns agents · Manages work │
└─────────────────────────────────────────────────────────┘
▲
│
┌─────────────────────────────────────────────────────────┐
│ Task Queue (MCP) │
│ Claimable tasks · Priority sorting │
└─────────────────────────────────────────────────────────┘
▲
│
┌─────────────────────────────────────────────────────────┐
│ Agents (Claude Code, Codex, etc) │
│ Claim tasks · Do work · Record evidence · Handoff │
└─────────────────────────────────────────────────────────┘
▲
│
┌─────────────────────────────────────────────────────────┐
│ Evidence Chain (Local) │
│ Git diffs · Test results · CI status · Proofs │
└─────────────────────────────────────────────────────────┘Key concepts
Tasks
The unit of work. Has a goal, scope, verification criteria, and evidence.
Evidence
What happened during the work. Git diffs, test results, manual proofs, CI status. Stored locally, append-only.
Trust score
Quality signal derived from evidence coverage, verification status, human review, and CI results. Helps humans decide what to approve.
Checkpoint
Snapshot of task state for resuming work. Includes context, progress, and next steps.
Handoff
Structured way for agents to pass work to each other. Includes summary, remaining work, and full context.
Orchestrator
Daemon that watches the task queue and spawns agent sessions automatically. Enables overnight autonomous work.
Use cases
Today: Software projects
- Multi-task feature development
- Refactoring with verification
- Bug fixes with evidence
- Overnight agent work
Tomorrow: Beyond code
The workbench model works for any project:
- Content creation: Tasks = articles, agents = writers/editors
- Data analysis: Tasks = analyses, agents = data scientists
- Business operations: Tasks = processes, agents = operators
The architecture is project-agnostic. Code is just the first domain.
Status: Experimental
This is a new product category. We're figuring it out with early users.
What works:
- Task management and tracking
- Evidence collection and display
- Multi-agent handoff
- Human review and approval
- Autonomous orchestration
- CI integration
What's experimental:
- Agent reliability in autonomous mode
- Trust score calibration
- Orchestrator stability over long runs
- Non-coding use cases
We need your feedback. Try it, break it, tell us what works and what doesn't.
Documentation
- Product Vision - Why this exists
- MCP Setup - Connect your agents
- Orchestrator Guide - Run agents autonomously
- CI Integration - Connect GitHub Actions
- Agent Guide - How agents use the workbench
- Roadmap - What's next
Requirements
- Node.js 18+
- Git repository
- MCP-compatible agent (Claude Code, Codex) or CLI usage
License
MIT
Contributing
This is experimental. We're learning what works. If you have ideas or feedback, open an issue or PR.
Agent Workflow Studio — Where humans and agents build together.