JSPM

  • ESM via JSPM
  • ES Module Entrypoint
  • Export Map
  • Keywords
  • License
  • Repository URL
  • TypeScript Types
  • README
  • Created
  • Published
  • Downloads 70
  • Score
    100M100P100Q98465F
  • License MIT

Seracade is a drop-in OpenAI-compatible routing proxy for AI agent teams. Six named capabilities: Call (every request, addressable and replayable), Step (sub-Call routing context inside agent trajectories), Quality Score (calibrated, version-stamped quality fingerprint), Routing Decision (the unit of the routing table), Counterfactual Replay (re-run any historical Call against alternate models), Quality Gate (customer-set policy constraining the routing table). 15% of the price difference per routed Call, free until $500/month.

Package Exports

  • seracade
  • seracade/dist/index.js

This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (seracade) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

Readme

seracade

Route your AI agent's LLM calls to the cheapest model that meets your quality bar — automatically. Seracade audits your real traffic, proves savings with empirical scoring, then routes every call to the Pareto-optimal model for its task type.

Free audit. 15% of per-call savings after $500/month threshold. Your API keys, never stored.

Setup

Recommended: Set your API key as an environment variable so it is never passed inline through tool calls.

export SERACADE_API_KEY=your-provider-api-key

Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "seracade": {
      "command": "npx",
      "args": ["-y", "seracade"],
      "env": {
        "SERACADE_API_KEY": "your-provider-api-key"
      }
    }
  }
}

Claude Code

claude mcp add seracade -- npx -y seracade

CrewAI

from crewai import Agent
from crewai_tools import MCPServerAdapter
import os

mcp_adapter = MCPServerAdapter(
    {"command": "npx", "args": ["-y", "seracade"]},
    env={"SERACADE_API_KEY": os.environ["SERACADE_API_KEY"]}
)

agent = Agent(
    role="LLM Cost Optimizer",
    tools=mcp_adapter.tools
)

LangGraph

from langchain_mcp_adapters.client import MultiServerMCPClient
import os

client = MultiServerMCPClient({
    "seracade": {
        "command": "npx",
        "args": ["-y", "seracade"],
        "env": {"SERACADE_API_KEY": os.environ["SERACADE_API_KEY"]},
        "transport": "stdio",
    }
})

tools = await client.get_tools()

Vercel AI SDK

import { experimental_createMCPClient } from "ai";
import { Experimental_StdioMCPTransport } from "ai/mcp-stdio";

const mcp = await experimental_createMCPClient({
  transport: new Experimental_StdioMCPTransport({
    command: "npx",
    args: ["-y", "seracade"],
    env: { SERACADE_API_KEY: process.env.SERACADE_API_KEY! },
  }),
});

const tools = await mcp.tools();

How it works

Step 1 — Audit. Call seracade with action=start_audit. Set OPENAI_BASE_URL=https://seracade.com/v1. Your API calls flow through unchanged — zero latency impact, your keys, your data. At 50 calls, the audit triggers automatically.

Step 2 — Review. Call seracade with action=get_status to check progress. When the report is ready, the tool returns a URL. Share it with a human decision maker before activating routing.

Step 3 — Activate. Call seracade with action=activate_routing. Every subsequent call routes to the optimal model on the efficient frontier for its task type.

Step 4 — Route. Use seracade_route in place of your direct LLM calls. Seracade selects the cheapest model that meets your quality bar and returns the full completion with per-call savings.

Step 5 — Track. Call seracade with action=check_savings to see your monthly savings. Free until the savings Seracade generates for you exceed $500/month — then 15% of the per-call price difference. You keep 85%.

Tools

seracade_route

Route an LLM completion to the cheapest model that meets your quality bar. Requires routing to be active — call seracade with action=activate_routing first.

Parameter Type Required Description
model string Yes The model you would normally use (e.g. gpt-4o, claude-sonnet-4-6)
messages array Yes OpenAI-format messages array
temperature number No Sampling temperature
max_tokens integer No Max output tokens
api_key string No Your provider API key. Prefer SERACADE_API_KEY env var.

Returns: full completion, routed model, task type, and estimated savings per call.

seracade

Manage your Seracade account. Action-dispatched.

Parameter Type Required Description
action string Yes One of: start_audit, get_status, activate_routing, check_savings, set_budget
api_key string No Your provider API key. Prefer SERACADE_API_KEY env var.
email string No Email for report delivery (start_audit only)
strategy string No Routing strategy: balanced, max_savings, conservative (activate_routing only, default: balanced)
agent_name string No Agent identifier (set_budget only)
monthly_budget_usd number No Monthly spend cap in USD (set_budget only)
action_on_exceed string No block, downgrade, or alert when budget exceeded (set_budget only, default: alert)

Remote HTTP (no local install)

Connect directly without npx — for hosts that support streamable HTTP transport:

{
  "mcpServers": {
    "seracade": { "url": "https://seracade.com/mcp" }
  }
}

Discovery endpoint: https://seracade.com/.well-known/mcp.json

Pricing

  • Free audit, always
  • Smart routing: 15% of the price difference per routed call
  • Free until the savings Seracade generates for you exceed $500/month
  • You keep 85% of every dollar saved

Security

  • BYOK: your API keys, your data — Seracade never bills for LLM usage
  • Keys hashed for identification, never stored in plaintext
  • All traffic over TLS
  • 30-day response body retention, 90-day metadata retention
  • Terms of Service | Privacy Policy | Security