Package Exports

openrouter-kit
openrouter-kit/dist/index.js

This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (openrouter-kit) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

Readme

OpenRouter Kit

🇷🇺 Русский | 🇬🇧 English

OpenRouter Kit is a powerful, flexible, and user-friendly TypeScript/JavaScript library for interacting with the OpenRouter API. It simplifies working with LLMs by providing a unified API for chats, history management with metadata, history analysis, tool handling (function calling), request routing, web search, reasoning tokens, and much more.

📦 Installation

npm install openrouter-kit
# or
yarn add openrouter-kit
# or
pnpm add openrouter-kit

🚀 Quick Start: Usage Examples

Here are a few examples to get you started quickly:

1. Simple Response Generation

The most basic example for sending a request and getting a response from the model.

// simple-chat.ts
import { OpenRouterClient } from 'openrouter-kit';

// Initialize the client with your API key
const client = new OpenRouterClient({
  apiKey: process.env.OPENROUTER_API_KEY || 'sk-or-v1-...',
  model: "x-ai/grok-4-fast" // Default model for all calls
});

async function main() {
  console.log('Sending a simple request...');
  try {
    const result = await client.chat({
      prompt: 'Write a short greeting for a README.',
      model: 'openai/gpt-4o-mini', // Override the model for this call
      temperature: 0.7,
    });

    console.log('--- Result ---');
    console.log('Model Response:', result.content);
    console.log('Model Used:', result.model);
    console.log('Tokens Used:', result.usage);
  } catch (error: any) {
    console.error("Error:", error.message);
  } finally {
    await client.destroy();
  }
}

main();

2. Dialog Example (with History Management)

To maintain conversation context, use historyAdapter and pass a user ID. The library will automatically load and save the history along with API call metadata.

// dialog-chat.ts
import { OpenRouterClient, MemoryHistoryStorage } from 'openrouter-kit';

const client = new OpenRouterClient({
  apiKey: process.env.OPENROUTER_API_KEY || 'sk-or-v1-...',
  // Use MemoryHistoryStorage to store history in memory
  historyAdapter: new MemoryHistoryStorage(),
  enableCostTracking: true, // Enable cost calculation (saved in metadata)
  model: "x-ai/grok-4-fast",
  debug: false,
});

const userId = 'dialog-user-123'; // Unique ID for the user

async function runDialog() {
  try {
    // First message
    console.log(`[${userId}] You: Hi! Remember, my favorite color is blue.`);
    const result1 = await client.chat({
      user: userId, // <-- Pass the user ID for automatic history management
      prompt: 'Hi! Remember, my favorite color is blue.',
    });
    console.log(`[${userId}] Bot: ${result1.content}`);
    console.log(`(Cost: $${result1.cost?.toFixed(8) || 'N/A'})`);

    // Second message (model should remember the context)
    console.log(`\n[${userId}] You: What is my favorite color?`);
    const result2 = await client.chat({
      user: userId, // <-- Same user ID
      prompt: 'What is my favorite color?',
    });
    console.log(`[${userId}] Bot: ${result2.content}`);
    console.log(`(Cost: $${result2.cost?.toFixed(8) || 'N/A'})`);

  } catch (error: any) {
    console.error(`\n--- Error ---`);
    console.error(`Message: ${error.message}`);
    if (error.code) console.error(`Error Code: ${error.code}`);
  } finally {
    console.log('\nEnding dialog...');
    await client.destroy();
  }
}

runDialog();

3. Tools Example (Function Calling with Details)

This example shows how the model can use functions (tools) you provide to get external information, and how to retrieve detailed information about each tool call.

// tools-example.js (CommonJS)
const { OpenRouterClient } = require("openrouter-kit");

// --- Example Data (replace with your real sources) ---
const users = [ { id: "user_1001", nick: "alice" }, { id: "user_1002", nick: "bob" } ];
const messages = [ { id: "msg_101", userId: "user_1001", content: "Hi from alice!" }, { id: "msg_102", userId: "user_1002", content: "Hi from bob!" } ];
// ---

// --- Tool Definitions ---
const userTools = [
  {
    type: "function",
    function: {
      name: "getUserIdByNick",
      description: "Gets the user ID by their nickname",
      parameters: { type: "object", properties: { nick: { type: "string" } }, required: ["nick"] },
    },
    execute: async (args) => {
      console.log(`[Tool Execute: getUserIdByNick] Args: ${JSON.stringify(args)}`);
      const user = users.find(u => u.nick.toLowerCase() === args.nick.toLowerCase());
      // Simulate slight delay
      await new Promise(res => setTimeout(res, 50));
      return user ? { userId: user.id, found: true } : { userId: null, found: false };
    }
  },
  {
    type: "function",
    function: {
      name: "getUserMessages",
      description: "Gets all messages for a user by their ID",
      parameters: { type: "object", properties: { userId: { type: "string" } }, required: ["userId"] },
    },
    execute: async (args) => {
      console.log(`[Tool Execute: getUserMessages] Args: ${JSON.stringify(args)}`);
      const userMessages = messages.filter(m => m.userId === args.userId);
      // Simulate slight delay
      await new Promise(res => setTimeout(res, 100));
      return { messages: userMessages, count: userMessages.length, found: userMessages.length > 0 };
    }
  }
];
// ---

const client = new OpenRouterClient({
  apiKey: process.env.OPENROUTER_API_KEY || "sk-or-v1-...",
  model: "x-ai/grok-4-fast", // Model that supports tools
});

async function main() {
  try {
    const promptAlice = "Find all messages from user alice.";
    console.log(`\nPrompt: "${promptAlice}"`);
    const resultAlice = await client.chat({
      prompt: promptAlice,
      tools: userTools,
      temperature: 0.5,
      includeToolResultInReport: true // <-- Request full call details
    });
    console.log(`Response:\n${resultAlice.content}`);
    console.log(`(Total Tool Calls: ${resultAlice.toolCallsCount})`);

    // --- Display Tool Call Details ---
    if (resultAlice.toolCalls && resultAlice.toolCalls.length > 0) {
        console.log("\n--- Tool Call Details ---");
        resultAlice.toolCalls.forEach((call, index) => {
            console.log(`Call ${index + 1}:`);
            console.log(`  Tool Name: ${call.toolName}`);
            console.log(`  Status: ${call.status}`);
            console.log(`  Duration: ${call.durationMs}ms`);
            if (call.status === 'success') {
                 // Display result because includeToolResultInReport: true
                 console.log(`  Result:`, call.result);
            } else if (call.error) {
                 console.log(`  Error: ${call.error.message} (Type: ${call.error.type})`);
            }
            console.log(`  Arguments (Parsed):`, call.parsedArgs);
            console.log("-------------------------");
        });
    }
    // ---

  } catch (error) {
    console.error("\n--- Error ---");
    console.error(error);
  } finally {
    await client.destroy();
  }
}

main();

4. Requesting JSON Object Response (`json_object`)

This example shows how to request a response from the model as any valid JSON object.

// json-object-example.ts
import { OpenRouterClient } from 'openrouter-kit';

const client = new OpenRouterClient({
  apiKey: process.env.OPENROUTER_API_KEY || 'sk-or-v1-...',
  model: 'openai/gpt-4o-mini', // Model that works well with JSON
});

async function main() {
  try {
    const prompt = "Provide information about user John Doe: age 30, city New York, in JSON format.";
    console.log(`Prompt: "${prompt}" (expecting JSON object)`);

    const result = await client.chat({
      prompt: prompt,
      temperature: 0.2,
      responseFormat: {
        type: 'json_object', // <-- Request JSON object
      },
    });

    console.log('--- Result ---');
    console.log('Model Response (type):', typeof result.content);
    console.log('Model Response (content):', result.content);
    console.log('Model Used:', result.model);

    if (result.content && typeof result.content === 'object') {
      console.log('Username from response:', result.content.name || result.content.userName);
    }

  } catch (error: any) {
    console.error(`\n--- Error ---`);
    console.error(`Message: ${error.message}`);
    if (error.code) console.error(`Error Code: ${error.code}`);
    if (error.details) console.error(`Details:`, error.details);
  } finally {
    console.log('\nShutting down...');
    await client.destroy();
  }
}

main();

5. Requesting Response by JSON Schema (`json_schema`)

This example shows how to request a response that strictly adheres to a provided JSON Schema.

// json-schema-example.ts
import { OpenRouterClient } from 'openrouter-kit';

const client = new OpenRouterClient({
  apiKey: process.env.OPENROUTER_API_KEY || 'sk-or-v1-...',
  model: 'openai/gpt-4o-mini', // Model that works well with schemas
});

// Define our JSON Schema
const answerSchema = {
  type: "object",
  properties: {
    summary: {
      type: "string",
      description: "A brief summary of the answer"
    },
    confidence: {
      type: "number",
      description: "Confidence in the answer from 0.0 to 1.0",
      minimum: 0,
      maximum: 1
    },
    tags: {
        type: "array",
        description: "A list of relevant keywords (tags)",
        items: {
            type: "string"
        }
    }
  },
  required: ["summary", "confidence", "tags"]
};

async function main() {
  try {
    const prompt = "Briefly explain quantum entanglement, estimate your confidence, and add tags.";
    console.log(`Prompt: "${prompt}" (expecting JSON matching 'answer' schema)`);

    const result = await client.chat({
      prompt: prompt,
      temperature: 0.3,
      responseFormat: {
        type: 'json_schema', // <-- Request JSON by schema
        json_schema: {
          name: 'answer',
          schema: answerSchema,
          strict: true // <-- Ask the model to strictly follow the schema (if supported)
        }
      },
    });

    console.log('--- Result ---');
    console.log('Model Response (type):', typeof result.content);
    console.log('Model Response (content):', result.content);
    console.log('Model Used:', result.model);

    if (result.content && typeof result.content === 'object') {
      console.log('Summary:', result.content.summary);
      console.log('Tags:', result.content.tags?.join(', '));
    }

  } catch (error: any) {
    console.error(`\n--- Error ---`);
    console.error(`Message: ${error.message}`);
    if (error.code) console.error(`Error Code: ${error.code}`);
    if (error.details) console.error(`Details:`, error.details);
  } finally {
    console.log('\nShutting down...');
    await client.destroy();
  }
}

main();

6. Streaming Response (Real-time Output)

Stream responses in real-time for a better user experience with long-form content generation.

// streaming-example.js
const { OpenRouterClient } = require('openrouter-kit');

const client = new OpenRouterClient({
  apiKey: process.env.OPENROUTER_API_KEY || 'sk-or-v1-...',
  model: 'x-ai/grok-4-fast',
  enableCostTracking: true, // Track costs even in streaming mode
});

async function main() {
  try {
    console.log('Streaming response:\n');

    const result = await client.chatStream({
      prompt: 'Write a haiku about coding',
      streamCallbacks: {
        onContent: (content) => {
          // Print each chunk as it arrives
          process.stdout.write(content);
        },
        onComplete: (fullContent, usage) => {
          console.log('\n\n--- Stream Complete ---');
          console.log('Total tokens:', usage?.total_tokens);
        },
        onError: (error) => {
          console.error('Stream error:', error.message);
        }
      }
    });

    // Result includes full content, usage, cost, and duration
    console.log('Cost:', result.cost ? `$${result.cost.toFixed(6)}` : 'N/A');
    console.log('Duration:', `${result.durationMs}ms`);

  } catch (error) {
    console.error('Error:', error.message);
  } finally {
    await client.destroy();
  }
}

main();

Tool Execution in Streaming:

// Tools execute automatically when provided (just like client.chat())
const result = await client.chatStream({
  prompt: "What's the weather in Paris?",
  tools: myTools, // 🔥 If tools provided, they execute automatically!
  streamCallbacks: {
    onContent: (content) => console.log(content),
    onToolCallExecuting: (name, args) => console.log(`Calling ${name}...`),
    onToolCallResult: (name, result) => console.log(`Got: ${result}`)
  }
});

Streaming Features:

✅ Real-time content delivery via callbacks
✅ Automatic cost tracking and metrics
✅ Automatic tool execution (when tools provided)
✅ AbortSignal support for cancellation
✅ Full authentication and security checks
✅ History management with streaming responses

Examples:

examples/streaming-demo.js - Comprehensive streaming examples
examples/streaming-test.js - Test suite for streaming features

📚 Detailed Guide

Now that you've seen the basic examples, you can delve deeper into the library's capabilities.

🌟 Why Use OpenRouter Kit?
🚀 Key Features
🚕 Example: Taxi Bot
⚙️ API and Concepts
📄 License

🌟 Why Use OpenRouter Kit?

Simplicity: Complex API interactions, history management, tool handling, and routing are hidden behind a simple client.chat() method.
Flexibility: Configure models, generation parameters, history storage (requires adapter), security, provider/model routing, and more, both globally and per request.
Analytics: Built-in tools to analyze stored chat history (cost, tokens, model usage).
Security: An integrated security module helps protect your applications and users when using tools.
Extensibility: Use plugins and middleware to add custom logic without modifying the library core.
Reliability: Fully typed with TypeScript, predictable error handling (including structured tool errors), and resource management.
Modern Features: Support for web search, reasoning tokens, structured outputs, and other OpenRouter API capabilities.

🚀 Key Features

🤖 Universal Chat: Simple and powerful API (client.chat) for interacting with any model available via OpenRouter.
🌊 Streaming Support: Real-time response streaming with client.chatStream() for better UX. Includes callbacks, cost tracking, and full security checks.
📜 History Management with Metadata: Requires historyAdapter configuration. Automatic loading and saving of dialog history for each user (user), including API call metadata (model, tokens, cost, etc.).
- Flexible history system based on adapters (IHistoryStorage).
- Includes: MemoryHistoryStorage, DiskHistoryStorage.
- Easily plug in your own adapters or use the provided plugin (createRedisHistoryPlugin).
- Configurable history cache TTL (historyTtl) and cleanup intervals (historyCleanupInterval).
📊 History Analysis: Get aggregated statistics from stored history via client.getHistoryAnalyzer():
- getStats(): Total cost, tokens, API calls, stats by model and finish reasons.
- getCostOverTime(): Cost trends by day/hour/minute.
- getTokenUsageByModel(): Token usage distribution by model.
🛠️ Tool Handling (Function Calling): Seamless integration of model-driven function calls.
- Define tools (Tool) with JSON Schema for argument validation.
- Automatic argument parsing, validation, and security checks.
- Execution of your execute functions with context (ToolContext).
- Automatic sending of results back to the model.
- Structured tool error handling for better model understanding.
- Detailed tool call reporting available in ChatCompletionResult.toolCalls (optionally includes full result via includeToolResultInReport).
- Configurable limit on recursive calls (maxToolCalls).
🛡️ Security Module: Comprehensive and configurable protection.
- Authentication: JWT (built-in), api-key, custom.
- Access Control (ACL): By roles, scopes, API keys, explicit rules.
- Rate Limiting: Configurable limits for users/roles. (Default implementation is not suitable for distributed systems).
- Argument Sanitization: Protection against dangerous patterns (SQLi, XSS, etc.). Audit mode available.
- Event System for monitoring.
📈 Cost Tracking: (Optional) Automatic calculation of approximate API call costs. Background price updates. getCreditBalance() method. Cost is also saved in history metadata.
🔄 Routing (Models and Providers):
- Models: Specify fallback models (modelFallbacks in config or models in request).
- Providers: Fine-tune provider selection per request (provider in request) or globally (defaultProviderRouting in config) - sorting (price, throughput), order, ignore, parameter requirements, etc.
🌐 Web Search: (Optional) Integrate web search results into the model's response via the plugins: [{ id: 'web', ... }] option or the :online suffix on the model name. Returns annotations with sources.
🤔 Reasoning Tokens: (Optional) Request and receive the model's reasoning steps via the reasoning option.
📐 Structured Outputs: Request responses in JSON format (responseFormat: { type: 'json_object' }) or according to a strict JSON Schema (responseFormat: { type: 'json_schema', json_schema: {...} }), including support for the strict flag.
⚙️ Flexible Configuration: Set API key, model, endpoint, timeouts, proxy, headers, history adapter, and much more via OpenRouterConfig.
💡 Typing: Fully implemented in TypeScript.
🚦 Error Handling: Clear hierarchy of custom errors (OpenRouterError and subclasses) with codes (ErrorCode) and details.
📝 Logging: Built-in logger (Logger) with prefixes and debug mode (debug).
✨ Ease of Use: High-level API.
🧹 Resource Management: client.destroy() method for proper resource cleanup.
🧩 Plugin System and Middleware: Extend functionality without modifying the core.

🚕 Example: Taxi Bot

This example demonstrates using dialog history and tool calling. Note the mandatory inclusion of historyAdapter and the corresponding require.

// taxi-bot.js (CommonJS)
const { OpenRouterClient, MemoryHistoryStorage } = require("openrouter-kit");
const readline = require('readline').createInterface({
  input: process.stdin,
  output: process.stdout
});

const client = new OpenRouterClient({
  apiKey: process.env.OPENROUTER_API_KEY || "sk-or-v1-...",
  model: "x-ai/grok-4-fast",
  historyAdapter: new MemoryHistoryStorage(), // Required for history
  enableCostTracking: true,
  debug: false,
});

let orderAccepted = false;

// --- Tool Definitions ---
const taxiTools = [
  {
    type: "function",
    function: {
      name: "estimateRideCost",
      description: "Estimates the cost of a taxi ride between two addresses.",
      parameters: {
        type: "object",
        properties: {
          from: { type: "string", description: "Pickup address (e.g., '1 Lenin St, Moscow')" },
          to: { type: "string", description: "Destination address (e.g., '10 Tverskaya St, Moscow')" }
        },
        required: ["from", "to"]
      },
    },
    execute: async (args) => {
      console.log(`[Tool estimateRideCost] Calculating cost from ${args.from} to ${args.to}...`);
      const cost = Math.floor(Math.random() * 900) + 100;
      console.log(`[Tool estimateRideCost] Calculated cost: ${cost} RUB`);
      return { estimatedCost: cost, currency: "RUB" };
    }
  },
  {
    type: "function",
    function: {
      name: "acceptOrder",
      description: "Accepts and confirms a taxi order, assigns a driver.",
      parameters: {
        type: "object",
        properties: {
          from: { type: "string", description: "Confirmed pickup address" },
          to: { type: "string", description: "Confirmed destination address" },
          estimatedCost: { type: "number", description: "Approximate ride cost (if known)"}
        },
        required: ["from", "to"]
      },
    },
    execute: async (args, context) => {
      console.log(`[Tool acceptOrder] Accepting order from ${args.from} to ${args.to}...`);
      console.log(`[Tool acceptOrder] Order initiated by user: ${context?.userInfo?.userId || 'anonymous'}`);
      const driverNumber = Math.floor(Math.random() * 100) + 1;
      orderAccepted = true;
      return `Order accepted successfully! Driver #${driverNumber} is assigned and will arrive shortly at ${args.from}. Destination: ${args.to}.`;
    }
  }
];

function askQuestion(query) {
  return new Promise((resolve) => {
    readline.question(query, (answer) => {
      resolve(answer);
    });
  });
}

const systemPrompt = `You are a friendly and efficient taxi service operator named "Kit". Your task is to help the customer order a taxi.
1. Clarify the pickup address ('from') and destination address ('to') if the customer hasn't provided them. Be polite.
2. Once the addresses are known, you MUST use the 'estimateRideCost' tool to inform the customer of the approximate cost.
3. Wait for the customer to confirm they accept the cost and are ready to order (e.g., with words like "order", "okay", "yes", "sounds good").
4. After customer confirmation, use the 'acceptOrder' tool, passing it the 'from' and 'to' addresses.
5. After calling 'acceptOrder', inform the customer of the result returned by the tool.
6. Do not invent driver numbers or order statuses yourself; rely on the response from the 'acceptOrder' tool.
7. If the user asks something unrelated to ordering a taxi, politely steer them back to the topic.`;

async function chatWithTaxiBot() {
  const userId = `taxi-user-${Date.now()}`;
  console.log(`\nBot Kit: Hello! I'm your virtual assistant... (Session ID: ${userId})`);

  try {
    while (!orderAccepted) {
      const userMessage = await askQuestion("You: ");
      if (userMessage.toLowerCase() === 'exit' || userMessage.toLowerCase() === 'quit') {
          console.log("Bot Kit: Thank you for contacting us! Goodbye.");
          break;
      }

      console.log("Bot Kit: One moment, processing your request...");
      const result = await client.chat({
        user: userId,
        prompt: userMessage,
        systemPrompt: systemPrompt,
        tools: taxiTools,
        temperature: 0.5,
        maxToolCalls: 5
        // includeToolResultInReport: true // Can add for debugging
      });

      console.log(`\nBot Kit: ${result.content}\n`);

      // Show details if debug is on or if tools were called
      if (client.isDebugMode() || (result.toolCalls && result.toolCalls.length > 0)) {
          console.log(`[Debug] Model: ${result.model}, Tool Calls: ${result.toolCallsCount}, Cost: ${result.cost !== null ? '$' + result.cost.toFixed(8) : 'N/A'}, Reason: ${result.finishReason}`);
          // Display tool call details if available
          if (result.toolCalls && result.toolCalls.length > 0) {
              console.log("[Debug] Tool Call Details:");
              result.toolCalls.forEach((call, i) => console.log(`  ${i+1}. ${call.toolName} (${call.status}, ${call.durationMs}ms)`));
          }
          if (result.reasoning) console.log(`[Debug] Reasoning: ${result.reasoning}`);
          if (result.annotations && result.annotations.length > 0) console.log(`[Debug] Annotations:`, result.annotations);
      }


      if (orderAccepted) {
        console.log("Bot Kit: If you have any more questions, I'm here to help!");
      }
    }
  } catch (error) {
    console.error("\n--- An Error Occurred ---");
    if (error instanceof Error) {
        console.error(`Type: ${error.constructor.name}`);
        console.error(`Message: ${error.message}`);
        if ((error as any).code) console.error(`Code: ${(error as any).code}`);
        if ((error as any).statusCode) console.error(`Status: ${(error as any).statusCode}`);
        if ((error as any).details) console.error(`Details:`, (error as any).details);
    } else {
        console.error("Unknown error:", error);
    }
  } finally {
    readline.close();
    await client.destroy();
    console.log("\nClient stopped. Session ended.");
  }
}

chatWithTaxiBot();

⚙️ API and Concepts

`OpenRouterClient`

The main class for interacting with the library.

Configuration (`OpenRouterConfig`)

When creating the client (new OpenRouterClient(config)), a configuration object is passed. Key fields:

apiKey (string, required): Your OpenRouter API key.
apiEndpoint? (string): URL endpoint for chat completions.
apiBaseUrl? (string): Base URL for auxiliary endpoints (e.g., /models, /auth/key). Defaults to https://openrouter.ai/api/v1.
model? (string): Default model for requests (e.g., "x-ai/grok-4-fast").
debug? (boolean): Enable detailed logging (default: false).
proxy? (string | object | null): HTTP/HTTPS proxy settings.
referer? (string): Value for the HTTP-Referer header.
title? (string): Value for the X-Title header.
axiosConfig? (object): Additional configuration for Axios.
historyAdapter? (IHistoryStorage): Required for history and analysis. An instance of a history storage adapter (e.g., new MemoryHistoryStorage()).
historyTtl? (number): Time-to-live (TTL) for entries in the UnifiedHistoryManager cache (in milliseconds).
historyCleanupInterval? (number): Interval for cleaning expired entries from the UnifiedHistoryManager cache (in milliseconds).
defaultProviderRouting? (ProviderRoutingConfig): Default provider routing rules.
modelFallbacks? (string[]): Default list of fallback models.
responseFormat? (ResponseFormat | null): Default response format.
maxToolCalls? (number): Maximum number of tool call cycles per chat() call (default: 10).
strictJsonParsing? (boolean): Throw an error on invalid JSON in response (if JSON format requested)? (default: false, returns null).
security? (SecurityConfig): Configuration for the security module.
enableCostTracking? (boolean): Enable cost tracking (default: false).
priceRefreshIntervalMs? (number): Interval for refreshing model prices (default: 6 hours).
initialModelPrices? (object): Initial model prices to avoid the first price request.
Deprecated fields: historyStorage, chatsFolder, maxHistoryEntries, historyAutoSave, enableReasoning, webSearch.

Core Methods

chat(options: OpenRouterRequestOptions): Promise<ChatCompletionResult>: The main method for sending chat requests.
chatStream(options: OpenRouterRequestOptions, abortSignal?: AbortSignal): Promise<ChatStreamResult>: (New) Stream responses in real-time with callbacks. Returns result with cost, duration, and optional tool calls.
getHistoryManager(): UnifiedHistoryManager: Returns the history manager.
getHistoryAnalyzer(): HistoryAnalyzer: (New) Returns the history analyzer.
getSecurityManager(): SecurityManager | null: Returns the security manager.
getCostTracker(): CostTracker | null: Returns the cost tracker.
getCreditBalance(): Promise<CreditBalance>: Fetches the credit balance.
getModelPrices(): Record<string, ModelPricingInfo>: Returns the cached model prices.
refreshModelPrices(): Promise<void>: Force-refreshes the price cache.
createAccessToken(userInfo, expiresIn?): string: Generates a JWT (if configured).
use(plugin): Promise<void>: Registers a plugin.
useMiddleware(fn): void: Registers middleware.
on(event, handler) / off(event, handler): Subscribe/unsubscribe from events ('error', 'security:*', 'tool:call', etc.).
destroy(): Promise<void>: Releases resources.

`client.chat` Request Options (`OpenRouterRequestOptions`)

These options are passed to the client.chat() method to configure a specific request:

prompt? (string): Simple user text prompt. Either prompt or customMessages is required.
customMessages? (Message[] | null): Full array of messages to send (overrides prompt and history). Either prompt or customMessages is required.
user? (string): User ID for automatic history management (requires historyAdapter).
group? (string | null): Group ID for history (used with user).
systemPrompt? (string | null): System prompt for the request.
accessToken? (string | null): Access token for security checks.
model? (string): Model ID for this request (overrides default). Can use the :online suffix to enable web search.
temperature?, maxTokens?, topP?, presencePenalty?, frequencyPenalty?, stop?, seed?, logitBias?: Standard LLM generation parameters.
tools? (Tool[] | null): Array of available tools for this request.
toolChoice?: Control model's tool selection ('auto', 'none', { type: "function", function: { name: "..." } }).
parallelToolCalls? (boolean): Allow the model to request multiple tools in parallel.
maxToolCalls? (number): Override the recursive tool call limit for this request.
includeToolResultInReport? (boolean): (New) Whether to include the full tool execution result in the result field of ToolCallDetail objects within the final ChatCompletionResult.toolCalls. Defaults to false.
responseFormat? (ResponseFormat | null): Request a specific response format (JSON Object or JSON Schema).
strictJsonParsing? (boolean): Override the strict JSON parsing setting for this request.
provider? (ProviderRoutingConfig): Provider routing rules for this request.
models? (string[]): List of models (primary + fallbacks) for this request.
plugins? (PluginConfig[]): List of plugins to activate (e.g., [{ id: 'web', max_results: 3 }]).
reasoning? (ReasoningConfig): Settings for requesting reasoning tokens (effort, max_tokens, exclude).
transforms? (string[]): OpenRouter transforms (e.g., ["middle-out"]).
route?: Deprecated OpenRouter routing parameter.
streamCallbacks? (StreamCallbacks): (For chatStream only) Callbacks for handling streaming events:
- onChunk?: (chunk: StreamChunk) => void: Called for each SSE chunk received.
- onContent?: (content: string) => void: Called when new content arrives.
- onToolCallExecuting?: (toolName: string, args: any) => void: Called when a tool starts executing (tools auto-execute when provided).
- onToolCallResult?: (toolName: string, result: any) => void: Called when a tool finishes executing.
- onComplete?: (fullContent: string, usage?: UsageInfo, toolCalls?: ToolCall[]) => void: Called when streaming completes.
- onError?: (error: Error) => void: Called on stream errors.

`client.chat` Result (`ChatCompletionResult`)

The client.chat() method returns a Promise that resolves to a ChatCompletionResult object with the following fields:

content (any): The final response content from the model (string, JSON object, etc., depending on the request and response).
usage (UsageInfo | null): Total tokens used (prompt + completion, including tool calls).
model (string): ID of the model that generated the final response.
toolCallsCount (number): Total number of tool calls made during this request (may include failed attempts).
toolCalls? (ToolCallDetail[]): (New) An array containing detailed information about each tool call (name, args, status, duration, optionally result/error). Available if tools were called.
finishReason (string | null): Reason the final response generation stopped ('stop', 'length', 'tool_calls', 'content_filter', null).
durationMs (number): Total execution time of the chat() request in milliseconds.
id? (string): ID of the last generation step from the OpenRouter API.
cost? (number | null): Calculated approximate cost of the request (if enableCostTracking: true).
reasoning? (string | null): String containing the model's reasoning steps (if requested and returned).
annotations? (UrlCitationAnnotation[]): Array of annotations (e.g., web search citations) related to the final response.

ChatStreamResult (returned by client.chatStream()):

content (string): The complete streamed content.
usage? (UsageInfo | null): Token usage statistics.
model? (string): Model that generated the response.
finishReason? (string | null): Reason streaming stopped.
id? (string): OpenRouter response ID.
toolCalls? (ToolCall[]): Tool calls detected during streaming (not auto-executed).
reasoning? (string): Reasoning tokens if requested.
annotations? (UrlCitationAnnotation[]): Web search citations.
cost? (number | null): Calculated cost (if cost tracking enabled).
durationMs? (number): Total streaming duration in milliseconds.

🧩 Plugins and Middleware

Plugins: Modules that extend client functionality. Registered via client.use(plugin). Can initialize services, replace standard managers (setSecurityManager, setCostTracker), add middleware.
Middleware: Functions executed sequentially for each client.chat() call. Allow modifying the request (ctx.request), response (ctx.response), or performing side effects (logging, auditing). Registered via client.useMiddleware(fn).

📜 History Management (Adapters and Analysis)

To enable automatic dialog history management and subsequent analysis, you must configure historyAdapter in OpenRouterConfig.

Storage Format (HistoryEntry): The library saves HistoryEntry objects, which include the message itself and the apiCallMetadata (if applicable) from the API call that produced it. This allows precise tracking of models, tokens, and costs for each dialog step.
Adapter (IHistoryStorage): Defines the interface for storage (load, save, delete, listKeys, destroy?), operating on HistoryEntry[].
UnifiedHistoryManager: Internal component using the adapter and managing an in-memory cache of HistoryEntry. Provides methods like getHistoryEntries, addHistoryEntries, getHistoryMessages.
Built-in Adapters: MemoryHistoryStorage, DiskHistoryStorage.

Setup:

import { OpenRouterClient, MemoryHistoryStorage } from 'openrouter-kit';
const client = new OpenRouterClient({ /*...,*/ historyAdapter: new MemoryHistoryStorage() });

Redis Plugin: Use createRedisHistoryPlugin.
History Analysis (HistoryAnalyzer):
- Access the analyzer via client.getHistoryAnalyzer().
- Use methods getStats(), getCostOverTime(), getTokenUsageByModel() to get aggregated data.
- Methods accept an optional HistoryQueryOptions object for filtering entries before analysis (by date, models, etc.).

🛠️ Tool Handling (Function Calling)

Allows LLM models to call your custom functions.

Define Tool (Tool): Create an object with type: 'function', function: { name, description?, parameters? } (JSON Schema for args), and your execute: (args, context?) => Promise<any> | any function. Optionally add security rules.
Use in client.chat(): Pass the array of tools in options.tools. The library automatically:
- Sends definitions to the model.
- Intercepts the call request (finish_reason: 'tool_calls').
- Parses and validates arguments.
- Performs security checks (SecurityManager).
- Calls your execute function.
- Sends the result (or a structured error) back to the model.
- Returns the final model response to the user.
Result: Final response in ChatCompletionResult.content. Number of calls in ChatCompletionResult.toolCallsCount. Details of each call (name, args, status, duration, optionally result/error) are available in ChatCompletionResult.toolCalls. Use the includeToolResultInReport: boolean option to control the inclusion of the full result in the report.

🔒 Security Module (`SecurityManager`)

Activated by passing a security: SecurityConfig object to the OpenRouterClient constructor. Provides authentication, access control, rate limiting, and argument sanitization for tool calls. Requires careful configuration, especially userAuthentication.jwtSecret. Default Rate Limiter is not suitable for distributed systems.

📈 Cost Tracking

Enabled via enableCostTracking: true. Calculates approximate API call costs based on usage data and cached model prices. Provides getCreditBalance(), getModelPrices(), refreshModelPrices() methods. Cost is also saved in history metadata (ApiCallMetadata.cost) for each step.

🔄 Routing (Models and Providers)

Models: Define fallback models in OpenRouterConfig (modelFallbacks) or per-request in OpenRouterRequestOptions (models). The request models list takes precedence.
Providers: Control provider selection via defaultProviderRouting in OpenRouterConfig or provider in OpenRouterRequestOptions. The request provider option overrides the default. Allows setting order (order), enabling/disabling fallbacks (allow_fallbacks), ignoring providers (ignore), requiring parameter support (require_parameters), filtering by data policy (data_collection) or quantization (quantizations), and sorting (sort).

🌐 Web Search

Activation:
- Add the :online suffix to the model name in options.model (e.g., 'openai/gpt-4o-mini:online').
- Or pass the plugin in options.plugins: plugins: [{ id: 'web' }]. You can also configure max_results and search_prompt: plugins: [{ id: 'web', max_results: 3 }].
Result: The final model response may incorporate web search results. Source links will be available in the ChatCompletionResult.annotations field.

🤔 Reasoning Tokens

Request: Pass a reasoning object in the options of client.chat().
- effort: 'low', 'medium', or 'high'.
- max_tokens: Number of tokens for reasoning.
- exclude: true to have the model reason but not include it in the response.
Result: Reasoning steps will be available in the ChatCompletionResult.reasoning field (if exclude: false).

⚙️ Response Format (`responseFormat` and Structured Outputs)

Request responses in JSON format to simplify parsing and data handling.

Configuration: The responseFormat option in OpenRouterConfig (for default) or OpenRouterRequestOptions (for specific request).
Types:
- { type: 'json_object' }: Model must return any valid JSON object.
- { type: 'json_schema', json_schema: { name: string, schema: object, strict?: boolean, description?: string } }: Model must return JSON matching your JSON Schema.
Parsing Error Handling: If the model returns invalid JSON, behavior depends on the strictJsonParsing setting (in OpenRouterConfig or OpenRouterRequestOptions).
⚠️ Compatibility Warning with tools: Not all models support using responseFormat and tools simultaneously. Check OpenRouter documentation or experiment.

⚠️ Error Handling

Use try...catch and check errors via instanceof or error.code (ErrorCode). Subscribe to the client's 'error' event for global logging.

📝 Logging

Enabled via debug: true. Uses console with component prefixes.

🌐 Proxy

Configured via the proxy option (URL string or object { host, port, user?, pass? }) in OpenRouterConfig.

📄 License

MIT

openrouter-kit