Package Exports

agent-workflow-studio
agent-workflow-studio/dist/cli.js
agent-workflow-studio/dist/index.js
agent-workflow-studio/dist/lib/adapters.js
agent-workflow-studio/dist/lib/checkpoint.js
agent-workflow-studio/dist/lib/dashboard-execution.js
agent-workflow-studio/dist/lib/done.js
agent-workflow-studio/dist/lib/evidence-collectors.js
agent-workflow-studio/dist/lib/evidence-utils.js
agent-workflow-studio/dist/lib/freshness.js
agent-workflow-studio/dist/lib/fs-utils.js
agent-workflow-studio/dist/lib/http-errors.js
agent-workflow-studio/dist/lib/mcp-install.js
agent-workflow-studio/dist/lib/mcp-prompts.js
agent-workflow-studio/dist/lib/mcp-resources.js
agent-workflow-studio/dist/lib/mcp-tools.js
agent-workflow-studio/dist/lib/memory-bootstrap.js
agent-workflow-studio/dist/lib/memory-placeholders.js
agent-workflow-studio/dist/lib/memory-validator.js
agent-workflow-studio/dist/lib/orchestrator.js
agent-workflow-studio/dist/lib/overview.js
agent-workflow-studio/dist/lib/prompt-compiler.js
agent-workflow-studio/dist/lib/quick-task.js
agent-workflow-studio/dist/lib/recipes.js
agent-workflow-studio/dist/lib/repository-snapshot.js
agent-workflow-studio/dist/lib/run-executor.js
agent-workflow-studio/dist/lib/run-plan.js
agent-workflow-studio/dist/lib/run-preflight.js
agent-workflow-studio/dist/lib/run-preparer.js
agent-workflow-studio/dist/lib/scanner.js
agent-workflow-studio/dist/lib/schema-validator.js
agent-workflow-studio/dist/lib/skill-generator.js
agent-workflow-studio/dist/lib/smart-defaults.js
agent-workflow-studio/dist/lib/task-documents.js
agent-workflow-studio/dist/lib/task-service.js
agent-workflow-studio/dist/lib/trust-summary.js
agent-workflow-studio/dist/lib/undo-log.js
agent-workflow-studio/dist/lib/undo.js
agent-workflow-studio/dist/lib/verification-gates.js
agent-workflow-studio/dist/lib/verification-proof.js
agent-workflow-studio/dist/lib/webhook-evidence.js
agent-workflow-studio/dist/lib/workspace.js
agent-workflow-studio/dist/mcp-server.js
agent-workflow-studio/dist/server.js
agent-workflow-studio/package.json
agent-workflow-studio/src/cli.js
agent-workflow-studio/src/index.ts
agent-workflow-studio/src/lib/adapters.js
agent-workflow-studio/src/lib/checkpoint.js
agent-workflow-studio/src/lib/dashboard-execution.js
agent-workflow-studio/src/lib/done.js
agent-workflow-studio/src/lib/evidence-collectors.js
agent-workflow-studio/src/lib/evidence-utils.js
agent-workflow-studio/src/lib/freshness.js
agent-workflow-studio/src/lib/fs-utils.js
agent-workflow-studio/src/lib/fs-utils.ts
agent-workflow-studio/src/lib/http-errors.js
agent-workflow-studio/src/lib/mcp-install.js
agent-workflow-studio/src/lib/mcp-prompts.js
agent-workflow-studio/src/lib/mcp-resources.js
agent-workflow-studio/src/lib/mcp-tools.js
agent-workflow-studio/src/lib/memory-bootstrap.js
agent-workflow-studio/src/lib/memory-placeholders.js
agent-workflow-studio/src/lib/memory-validator.js
agent-workflow-studio/src/lib/orchestrator.js
agent-workflow-studio/src/lib/overview.js
agent-workflow-studio/src/lib/prompt-compiler.js
agent-workflow-studio/src/lib/quick-task.js
agent-workflow-studio/src/lib/recipes.js
agent-workflow-studio/src/lib/repository-snapshot.js
agent-workflow-studio/src/lib/run-executor.js
agent-workflow-studio/src/lib/run-plan.js
agent-workflow-studio/src/lib/run-preflight.js
agent-workflow-studio/src/lib/run-preparer.js
agent-workflow-studio/src/lib/scanner.js
agent-workflow-studio/src/lib/schema-validator.js
agent-workflow-studio/src/lib/skill-generator.js
agent-workflow-studio/src/lib/smart-defaults.js
agent-workflow-studio/src/lib/task-documents.js
agent-workflow-studio/src/lib/task-service.js
agent-workflow-studio/src/lib/trust-summary.js
agent-workflow-studio/src/lib/undo-log.js
agent-workflow-studio/src/lib/undo.js
agent-workflow-studio/src/lib/verification-gates.js
agent-workflow-studio/src/lib/verification-proof.js
agent-workflow-studio/src/lib/webhook-evidence.js
agent-workflow-studio/src/lib/workspace.js
agent-workflow-studio/src/mcp-server.js
agent-workflow-studio/src/server.js

Readme

Agent Workflow Studio

A workbench for human-agent collaboration on projects.

Not an AI coding assistant. A project management system where humans and agents work together to ship products.

What is this?

Agent Workflow Studio is a collaboration workbench that lets humans and AI agents work together on projects — not just code, but any project with tasks, deliverables, and quality requirements.

Think of it as:

Jira for human-agent teams
GitHub Actions for agent orchestration
A shared workspace where work is visible, traceable, and handoff-ready

Not:

A better autocomplete
A chat interface for code
A code review tool

Why does this exist?

Current AI tools (Cursor, Claude Code, Codex) are code assistants — they help you write code in the moment. But when you want to:

Manage a project with multiple tasks
Collaborate with multiple agents over days or weeks
Hand off work between agents or sessions
Track progress and quality across the project
Run agents autonomously overnight

...you need a workbench, not an assistant.

Agent Workflow Studio is that workbench.

Core capabilities

1. Project-level management

Tasks, not files. Deliverables, not functions. Progress tracking, not just code completion.

# Create a task
npx agent-workflow quick "Add user authentication" --root .

# Agent works on it, records evidence
# (via MCP tools or CLI)

# View progress in dashboard
npx agent-workflow dashboard --root .

2. Work leaves a trail

Every agent action creates evidence: what changed, what was tested, what was verified. This evidence serves two purposes:

For humans: Trust, auditability, decision-making
For agents: Context, handoff documents, knowledge base

3. Multi-agent collaboration

Agents can hand off work to each other with full context. No information loss.

// Agent A finishes part of the work
workflow_handoff({
  taskId: "T-001",
  summary: "Completed database schema",
  remaining: "Need to update ORM models"
})

// Agent B picks up where A left off
workflow_pickup({
  taskId: "T-001",
  agent: "codex"
})
// Returns: full context, checkpoint, evidence so far

4. Human-agent collaboration loop

Humans review agent work, approve or reject with feedback. Rejections create correction tasks automatically.

Dashboard → Review task → Approve (trust +10) or Reject with feedback → Agent continues

5. Autonomous execution

Run agents overnight. The orchestrator watches the task queue and spawns agent sessions automatically.

# Start the orchestrator
npx agent-workflow orchestrate --agent claude --root .

# Go to sleep
# Wake up, open dashboard, see what agents completed

Quick start

Install

npm install agent-workflow-studio
cd your-project
npx agent-workflow init --root .
npx agent-workflow scan --root .

Option 1: Use with MCP (Claude Code, Codex, Cursor)

npx agent-workflow mcp:install --client claude --root .

Then in your editor:

"Create a task called 'Add authentication'"
"Work on T-001 and record evidence when done"

Option 2: Use from CLI

# Create task
npx agent-workflow quick "Add authentication" --lite --root .

# Work on it (manually or with agent)
# ...

# Record evidence
npx agent-workflow done T-001 "Completed auth flow" --complete --root .

Option 3: Run autonomously

# Create tasks
npx agent-workflow quick "Task 1" --root .
npx agent-workflow quick "Task 2" --root .

# Start orchestrator (spawns agents to work on tasks)
npx agent-workflow orchestrate --agent claude --root .

# Open dashboard to monitor
npx agent-workflow dashboard --root .

Architecture

┌─────────────────────────────────────────────────────────┐
│                     Dashboard (Web UI)                   │
│              Review · Approve · Monitor Progress         │
└─────────────────────────────────────────────────────────┘
                            ▲
                            │
┌─────────────────────────────────────────────────────────┐
│                  Orchestrator (Daemon)                   │
│         Watches queue · Spawns agents · Manages work     │
└─────────────────────────────────────────────────────────┘
                            ▲
                            │
┌─────────────────────────────────────────────────────────┐
│                    Task Queue (MCP)                      │
│           Claimable tasks · Priority sorting             │
└─────────────────────────────────────────────────────────┘
                            ▲
                            │
┌─────────────────────────────────────────────────────────┐
│              Agents (Claude Code, Codex, etc)            │
│      Claim tasks · Do work · Record evidence · Handoff   │
└─────────────────────────────────────────────────────────┘
                            ▲
                            │
┌─────────────────────────────────────────────────────────┐
│                  Evidence Chain (Local)                  │
│        Git diffs · Test results · CI status · Proofs     │
└─────────────────────────────────────────────────────────┘

Key concepts

Tasks

The unit of work. Has a goal, scope, verification criteria, and evidence.

Evidence

What happened during the work. Git diffs, test results, manual proofs, CI status. Stored locally, append-only.

Trust score

Quality signal derived from evidence coverage, verification status, human review, and CI results. Helps humans decide what to approve.

Checkpoint

Snapshot of task state for resuming work. Includes context, progress, and next steps.

Handoff

Structured way for agents to pass work to each other. Includes summary, remaining work, and full context.

Orchestrator

Daemon that watches the task queue and spawns agent sessions automatically. Enables overnight autonomous work.

Use cases

Today: Software projects

Multi-task feature development
Refactoring with verification
Bug fixes with evidence
Overnight agent work

Tomorrow: Beyond code

The workbench model works for any project:

Content creation: Tasks = articles, agents = writers/editors
Data analysis: Tasks = analyses, agents = data scientists
Business operations: Tasks = processes, agents = operators

The architecture is project-agnostic. Code is just the first domain.

Status: Experimental

This is a new product category. We're figuring it out with early users.

What works:

Task management and tracking
Evidence collection and display
Multi-agent handoff
Human review and approval
Autonomous orchestration
CI integration

What's experimental:

Agent reliability in autonomous mode
Trust score calibration
Orchestrator stability over long runs
Non-coding use cases

We need your feedback. Try it, break it, tell us what works and what doesn't.

Documentation

Product Vision - Why this exists
MCP Setup - Connect your agents
Orchestrator Guide - Run agents autonomously
CI Integration - Connect GitHub Actions
Agent Guide - How agents use the workbench
Roadmap - What's next

Requirements

Node.js 18+
Git repository
MCP-compatible agent (Claude Code, Codex) or CLI usage

License

MIT

Contributing

This is experimental. We're learning what works. If you have ideas or feedback, open an issue or PR.

Agent Workflow Studio — Where humans and agents build together.