Package Exports
- forgehive
- forgehive/cli
Readme
ForgeHiveAI
Makes every Claude Code session smarter than the last.
What ForgeHive does
ForgeHive is a memory and learning layer for Claude Code. Every autonomous run writes successes, failures, and anti-patterns into a local vector DB. The next run injects semantically relevant memories and stack-specific warnings directly into the planning prompt.
In one sentence: Claude remembers what you taught it last time, even across sessions and machines.
What ForgeHive is not
- Not a new LLM
- Not a cloud service
- Not a Cursor replacement
- Not a multi-tenant SaaS
ForgeHive is the persistence layer Claude Code is missing — local-first, deterministic, provider-agnostic for embeddings (OpenAI, Voyage) and chat (Anthropic SDK, Claude CLI).
Quick Start
npm install -g forgehive
cd your-project
fh init
fh auto "Add OAuth login to the API"fh init scans your stack, infers conventions from sibling files, and
sets up a .forgehive/ directory inside your project. Claude reads it
automatically. fh auto runs the full plan → implement → test → PR
loop with persisted memory.
Five core commands cover the everyday flow:
| Command | What it does |
|---|---|
fh init |
One-shot setup — scans the stack, runs fh doctor, writes .forgehive/. |
fh confirm |
Confirms the inferred capabilities (draft → confirmed). |
fh status |
Project state, run history, dashboard. |
fh auto "<task>" |
Autonomous run: plan → implement → test → PR. |
fh ship |
Pre-ship checklist; --gate blocks the merge on hard conditions. |
fh help --all lists 39 additional advanced commands (memory, security,
party, MCP, etc.). See docs/reference.md for the
full surface.
How the memory loop works
- Scan —
fh initreads yourpackage.json/Cargo.toml/pyproject.tomland computes a deterministic stack signature. - Infer —
conventions.tsexamines sibling files for naming and structural patterns, so prompts can say "match the existing style" concretely. - Embed — Each successful or failed run writes a
MemoryEntrytosqlite-vec(192-dim Voyage Matryoshka by default, OpenAI fallback). - Recall — Before each plan/implement prompt, ForgeHive runs a semantic search on the task description and pulls the top k hits plus any stack-matching anti-patterns into the prompt.
- Learn — After three consecutive failures of the same task type with the same error signature, an anti-pattern is recorded. The next prompt warns Claude explicitly and tells it to try a different approach.
This is the loop that makes the second session smarter than the first.
Security model
- Credentials live in the OS keychain (
keytar— macOS Keychain, libsecret on Linux, Credential Manager on Windows). The legacy encrypted-file store remains as a graceful fallback for headless environments and migrates transparently on first use. - Tool outputs are wrapped in an
__fh_untrustedJSON boundary before re-injection; the system prompt instructs Claude to treat them as data, not as instructions. - A configurable spend hard-cap (
fh cost --limit <usd>) is enforced before every Anthropic call. No silent runaway runs. - MCP skill pulls require GPG-signed commits by default;
FH_ALLOW_UNSIGNED=1is the only override and is intentionally an environment variable, not a CLI flag.
VS Code Extension
ForgeHive ships a VS Code extension that surfaces your project's memory directly in the editor.
Features:
- Memory Sidebar — shows your top anti-patterns and recent learnings in a dedicated panel
- Stack Indicator — status bar shows detected stack + hash (
⬡ typescript, react #a3f2b1) - Cost Pill — accumulated session cost visible at a glance
Install (manual until Marketplace listing):
cd extensions/vscode
npm install
npm run package # builds forgehive-0.1.0.vsix
code --install-extension forgehive-0.1.0.vsixThe extension activates automatically when a .forgehive/capabilities.yaml is found in the workspace. Run fh init first if it hasn't activated.
Marketplace listing coming — tracked in ROADMAP.md.
What's new in v3.0.0
- 200 expert skills — 49 new Wave 3 skills across 8 categories: deployment & release (Deployment Strategies, SLO Patterns, Release Management, Feature Lifecycle), compliance & audit (SOC2, HIPAA, ISO27001, PCI-DSS), AI/ML engineering (Agent Architecture, Fine-Tuning, Context Engineering, LLM Caching), platform & DX (Developer Portal, Golden Path Templates, Platform Metrics, Documentation), data governance (Data Mesh, Data Governance, Data Quality, CDC, Data Lakehouse), performance & edge (Edge Computing, API Gateway, Web Performance, WebRTC, Real-Time Sync), security deep (Secure Coding, Pen-Test Methodology, Secrets Rotation), full-stack patterns (CQRS, Multi-Cloud, Saga Orchestration, Form Patterns, Analytics Instrumentation, A/B Testing, Idempotency, Bulk Operations, File Handling, Email Patterns, Visual Regression, Test Data Management, A11y Deep, Build Systems, Dependency Management)
- 56 agents — 5 new agents: Chip (Edge Computing), Mona (Developer Experience), Prue (Data Governance), Cal (Compliance Deep), Webb (WebRTC & Real-Time Sync)
- 3 new party sets —
/edge-party(Chip + Theo + Hugo),/platform-dx-party(Mona + Ivan + Dot),/compliance-deep-party(Cal + Pia + Vera) — 33 total party sets - Schema fix —
architecturefamily added toAgentFamilySchema;initForgehiveRuntimeskipsagents/memory/subdirectory during copy - 1148 tests passing — all wiring, agent, skill-selector, init-confirm, and commands-registry tests green
What's new in v2.5.0
- 151 expert skills — 84 new skills added across 10 categories: full language coverage (Ruby, PHP, Elixir, Node, Scala, R, Dart, Groovy, Clojure, Lua, WASM, Solidity), framework-specific (Next.js, NestJS, Django, FastAPI, React, Vue, Svelte, Nuxt, gRPC, Spring Security, HTMX, Tauri), ML/AI deep (MLOps, RAG, Prompt Engineering, Feature Engineering, Vector DB, LLM Observability, AI Safety, Model Deployment, Data Contracts, Streaming Analytics, LLM Evaluation, Synthetic Data), cloud extended (Azure, GCP, Cloudflare, FinOps), security deep (Cryptography, Threat Modeling, Container Security, Incident Response, SAST/DAST, Network Security, API Security, DevSecOps), testing extended (Mutation Testing, Property-Based Testing, API Testing, Fuzz Testing), architecture extended (Hexagonal, Vertical Slice, Modular Monolith, API Versioning, ADR, Technical Debt Strategy, Strangler Fig, Event Storming, Service Mesh, Distributed Systems, Protocol Design, Scheduling), frontend/mobile deep (iOS, Android, PWA, Micro-frontend, CSS Architecture, React Native), backend extended (Rate Limiting, Webhooks, Search, Background Jobs, Concurrency, Multi-tenancy), data/domain (Data Science, Feature Store, Streaming Processing, Data Visualization, Payment, Geospatial, Notification Systems, Audit Logging)
- 51 agents — 5 new agents: Gabi (Azure/GCP Cloud), Ori (iOS/Android Native), Rox (Data Science), Pax (Advanced Testing), Mod (Architecture Evolution)
- 4 new party sets —
/native-mobile-party,/data-science-party,/testing-advanced-party,/architecture-party— 30 total party sets - Skill-selector extended —
fh initnow maps 200+ capability IDs to the right skill (azure → azure-patterns, mlops → mlops-patterns, stryker → mutation-testing, etc.)
What's new in v2.4.0
- 67 expert skills — 28 new skills added: full language coverage (Python, Go, Rust, Java, Kotlin, Swift), backend architecture (Microservices, Event-Driven, SQL Optimization, NoSQL, Event Sourcing, Serverless, WebSocket), cloud & infrastructure (AWS, Kubernetes, GitOps, Platform Engineering, Secrets Management, Zero-Trust, Supply Chain Security), quality & testing (Contract Testing, Load Testing, Chaos Engineering, E2E Testing), and architecture & design (DDD, Feature Flags, Design System, i18n)
- 46 agents — 21 new agents with full Skill-Matrix wiring: language/framework specialists (Nova, Ember, Quinn, Jax, Ash, Cruz, Blake), systems engineers (Rio, Marco, Dev), domain/ops (Dot, Finn, Lex, Bex, Zoe, Lena, Ivan, Pia, Sage, Cy, Rex)
- 10 new party sets —
/systems-party,/cloud-party,/security-deep-party,/realtime-party,/frontend-deep-party,/framework-party,/compliance-party,/domain-party,/ops-party— each with full command files and optimized agent coalitions - YAML-driven orchestrator —
party/orchestrator.tsreads all sets fromparty/defaults.yaml; no hardcoded party logic;buildAgentContext()loads skills dynamically from agent YAMLskills:block - Skill-selector extended —
fh initnow maps 50+ capability IDs to the right skills (kafka → event-driven-patterns, playwright → testing-e2e, dynamodb → nosql-patterns, etc.)
What's new in v2.3.4
- 34 expert skills — 8 new:
docker-patterns,caching-strategies,graphql-patterns,refactoring-patterns,mobile-patterns,data-pipeline-patterns,iac-patterns,sre-patterns; plusauth-security,gdpr-compliance,owasp-top10indexed - Stack-aware
fh init— detects your stack and writesactive_skillstocapabilities.yaml. Claude loads the right skills automatically at session start. No manual selection. - Milo (Mobile Engineer), Dani (Data Engineer), Ray (GraphQL Architect) — 3 new agents with full skill files and party sets (
/mobile-party,/data-party,/graphql-party) - Registry populated —
registry/skills.yamllists all 34 core skills;fh skills searchnow returns remote results
What's new in v2.3.3
- 26 expert skills — 5 new:
frontend-patterns(Luna, Core Web Vitals),accessibility(WCAG 2.1 AA, ARIA),ci-cd-patterns(GitHub Actions, deployment gates),state-management(React Query, Zustand, Pinia),embedded-patterns(Otto, memory/ISR safety) fh skills search <query>— search local skills + remote registry by keyword, tag, or description. Community skills viafh skills pull <git-url>.fh doctor --fix— auto-repairs: activatesdraftcapabilities, injects missing CLAUDE.md block. Prints manual instructions for everything else.- Multi-provider LLM (fully documented) —
FH_PROVIDER=openai|bedrock|ollamafor non-Anthropic backends. Auto-selected via env vars. - Luna + Otto agents — Frontend/UX engineer and Embedded/IoT engineer now fully operational with their skill files.
What's new in v2.3.0
- SQLite outcome store —
outcomes.sqlite(WAL mode) replacesoutcomes.jsonl; O(1) spend queries, no rotation logic, cost-summary.json sidecar for the VS Code extension. - 4-stage routing cascade —
routeTaskWithHistoryAsync()now runs: Party dispatch → History-boosted → Semantic persona match → Keyword fallback; every result carries acomplexityfield (simple | moderate | complex). - Agent memory in party — each party agent gets the last 50 lines of its
.forgehive/agents/memory/<name>.mdinjected into its system prompt. - Kai–Sam dialog in review-party —
runKaiSamDialog()runs automatically after the parallel review phase for a consensus synthesis. - ForgeHive SDK (
packages/memory-layer) —ForgeHiveMemoryfacade class for embedding ForgeHive memory in Cursor, Continue, and Copilot; includes YAML-based anti-pattern reader and portable JSONL outcome store.
See CHANGELOG.md for the full list.
Project layout
.forgehive/
├── capabilities.yaml # inferred stack + tools
├── memory/ # static MD + vectorised entries
│ ├── feedback.md
│ ├── project.md
│ ├── stories/
│ └── epics/
├── outcomes.sqlite # per-run ledger (cost, success, errors)
├── patterns/ # stack-specific anti-patterns
├── runs/ # per-run state
└── scan-result.yamlDocumentation
- Tutorial — step-by-step walkthrough of a first run
- Command reference — every command, every flag
- Architecture — memory layer, pattern store, agents
- User guide — workflows and recipes
Contributing
ForgeHive is a solo-founder project. Issues and PRs are welcome but the roadmap stays focused on the memory-layer thesis. If your contribution adds breadth (new agents, new wires, new docs surface) please open an issue first.
License: Elastic License 2.0.