JSPM – conductor-agents@0.7.0

Package Exports

This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (conductor-agents) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

Readme

conductor

Multi-agent orchestrator with eval-gated quality gates. Spawn parallel agent workers per track, verify output before merging.

Local multi-agent tools like Octogent are compelling — parallel workers, visible progress, git isolation. But they share the same fatal flaw: workers run until they say they're done, with no way to know if the output is actually correct. Sessions die when you close the terminal. There's no retry, no quality gate, no remote control.

conductor is what that category of tool looks like when built for real work. It uses evalgate as its quality gate layer — workers can't merge until their verifier passes.

How it works

You organize your project into tracks — scoped areas of ownership, each with a CONTEXT.md (what this area owns) and a todo.md (eval-gated tasks).

.conductor/
  config.json
  tracks/
    auth/
      CONTEXT.md      ← what this track owns + constraints
      todo.md         ← eval-gated tasks for this area
    payments/
      CONTEXT.md
      todo.md

Each task in todo.md follows the evalgate contract format:

- [ ] Add JWT validation middleware
  - eval: `npm test -- --testPathPattern=auth`
  - retries: 2

- [ ] Write integration tests for /login
  - eval: `npm run test:integration`

When you run conductor run auth, it spawns one agent worker per pending task — each in its own git worktree. A worker's changes only merge back to main when the eval: verifier exits 0.

Install

npm install -g conductor-agents

Or run directly:

npx conductor-agents help

Quick start

# 1. Initialize in your repo
cd your-project
git init  # must be a git repo
conductor init

# Option A — generate a plan from a goal (v0.5+)
conductor plan "add JWT authentication with middleware and login endpoint"
# Review .conductor/plan-draft.md, then:
conductor plan apply

# Option B — add tracks manually
conductor add auth --desc="Authentication layer" --files="src/auth/**"
# Edit .conductor/tracks/auth/todo.md with your tasks

# 2. Run
conductor run auth         # single track
conductor run --all        # all tracks in parallel

# 3. Check results
conductor status auth

CLI reference

Command	Description
`conductor init [--yes]`	Create `.conductor/` — `--yes` skips the first-run wizard
`conductor add <name> [opts]`	Add a new track
`conductor rm <name>`	Remove a track
`conductor list`	List all tracks with color-coded progress bars
`conductor run <name> [opts]`	Run a track's worker swarm
`conductor run --all`	Run all tracks sequentially
`conductor retry <worker-id> <track>`	Retry a failed worker
`conductor logs <worker-id> <track>`	Print a worker's session log
`conductor status [name]`	Show worker states with duration and eval result
`conductor plan "<goal>"`	Generate tracks + tasks from a natural-language goal
`conductor plan apply [--dry-run]`	Apply the generated plan draft to create tracks
`conductor plan show`	Print the current plan draft
`conductor report [track]`	Show cost report — tokens + estimated USD per track
`conductor telegram setup`	Configure Telegram bot token + chat ID
`conductor telegram [start]`	Start Telegram bot (foreground)
`conductor ui [--port=8080]`	Start web dashboard
`conductor help`	Show usage

`conductor add` options

--desc="description"                    Track description
--files="src/auth/**,src/users/**"      Owned file globs (comma-separated)

`conductor run` options

--concurrency=N        Worker concurrency (default: 3)
--agent=cmd            Agent command (default: claude)
--resume               Resume from existing state, skip done workers

`conductor logs` options

--follow, -f           Tail the log live (polls every 500ms until Ctrl+C)

Worker IDs can be abbreviated — any unique prefix works:

conductor logs eb2eadd4 auth
conductor logs eb2eadd4 auth --follow

Task format

Tasks live in .conductor/tracks/<name>/todo.md and follow the evalgate contract format:

- [ ] Task title
  - eval: `shell verifier command`
  - retries: 2

The verifier command runs inside the worker's git worktree after the agent finishes. Exit 0 = merge. Anything else = fail (and retry if retries remain).

Composite verifiers are supported:

- [ ] Build, lint, and test all pass
  - eval.all: `npm run build` | `npm run lint` | `npm test`

- [ ] README is clear
  - eval.llm: Does README.md explain the auth flow in plain English?

AI planning (`conductor plan`)

conductor plan uses an LLM agent to turn a natural-language goal into a ready-to-run multi-track plan. The agent explores your codebase, designs track ownership, and writes eval:-gated tasks for each track.

# Generate a plan (agent explores your codebase, writes .conductor/plan-draft.md)
conductor plan "add JWT auth with middleware, login endpoint, and rate limiting"

# Preview what would be created without applying
conductor plan apply --dry-run

# Apply — creates tracks with CONTEXT.md and todo.md
conductor plan apply

# Run everything
conductor run --all

The generated todo.md files are ready for conductor run — each task has a verifier that evalgate will run after the agent completes its work.

Web dashboard

conductor ui
# → http://localhost:8080

The dashboard shows all tracks in a sidebar with progress bars, and workers in the main panel with status badges (pending / running / verifying / done / failed). Failed workers have a Retry button. All updates stream live via SSE — no page refresh needed.

Relationship to evalgate

conductor uses evalgate as a library for its worker execution engine. evalgate handles:

Spawning agents in git worktrees
Running verifiers after each worker completes
Persisting swarm state to .evalgate/swarm-state.json
Emitting events that conductor's SSE server re-broadcasts to the dashboard

You don't need to install or run evalgate separately. It's a bundled dependency.

If you want evalgate's standalone features (pre-commit hooks, MCP server, trigger daemon, ANSI dashboard), install it separately — the two tools are independent and coexist fine in the same repo.

Track CONTEXT.md

Each track gets a CONTEXT.md that describes what the track owns. The conductor prompt builder injects this into each worker agent's system prompt, giving workers scoped context without polluting each other.

# auth

Authentication and session management layer.

## Owned files
- `src/auth/**`
- `src/middleware/auth.ts`

## Constraints
- Never store plaintext credentials
- All tokens must be signed with the app secret
- Session expiry must be <= 24h

Edit this file freely. The next conductor run will pick it up.

Roadmap

Version	Feature	Status
v0.1.0	Core track model, CLI, eval-gated workers, SSE dashboard	Shipped
v0.1.1	Fix: initial UI state loading, progress bar derived from workers	Shipped
v0.2	Interactive init wizard, live log streaming in UI, color CLI output, `conductor logs` command	Shipped
v0.3	Rename: tentacle → track (breaking change, filesystem path updated)	Shipped
v0.4	Telegram bot gateway — run/retry/status from phone	Shipped
v0.5	`conductor plan "<goal>"` — LLM generates tracks + tasks automatically	Shipped
v0.6	Per-track cost tracking, `conductor report`, `/api/tracks/:id/cost`, SSE cost events	Shipped
v0.7	UI v2 — 4-tab layout: Tracks deck (kanban with eval badges), Workers, History, Settings	Shipped
v0.8	MCP server — `conductor mcp` over stdio; tools: list/run/retry/status/cost from any Claude conversation	Planned
v0.9	Scheduling + webhooks — `conductor schedule add <track> "<cron>"`, `conductor webhook start`	Planned
v1.0	Stable API, full docs, Docker image, `conductor doctor`	Planned

Contributing

git clone https://github.com/jorgejac1/conductor-agents
cd conductor
npm install
npm run build
npm test

PRs welcome. Run npm run lint:fix before committing.

License

MIT

conductor-agents