JSPM

  • ESM via JSPM
  • ES Module Entrypoint
  • Export Map
  • Keywords
  • License
  • Repository URL
  • TypeScript Types
  • README
  • Created
  • Published
  • Downloads 85
  • Score
    100M100P100Q80172F
  • License MIT

Local-first LLM inference engine with OpenAI-compatible API, MCP tools, and hardware-aware optimization

Package Exports

  • darksol
  • darksol/src/cli.js

This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (darksol) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

Readme

DARKSOL

Built by DARKSOL ๐ŸŒ‘

npm version license node >=20

Darksol Studio

Local-first LLM inference engine with hardware-aware optimization, OpenAI-compatible API, MCP tool integration, and Ollama model reuse. Your Ollama alternative โ€” built smarter.

Website ยท GitLab ยท npm

Features

  • โšก Hardware-aware inference โ€” auto-detects GPU, VRAM, CPU, RAM and optimizes settings
  • ๐Ÿ”Œ OpenAI-compatible API โ€” drop-in /v1/chat/completions, /v1/completions, /v1/models, /v1/embeddings
  • ๐Ÿฆ™ Ollama model reuse โ€” finds and runs your existing Ollama models directly, no daemon required
  • ๐Ÿ” HuggingFace directory โ€” browse, search, and pull GGUF models with "will it fit?" indicators
  • ๐Ÿ”ง MCP tool integration โ€” connect external tools via Model Context Protocol (CoinGecko, DexScreener, Etherscan, DefiLlama pre-configured)
  • ๐Ÿ’ฐ Cost tracking โ€” every local inference is $0.00, track usage and savings vs cloud
  • ๐ŸŒก๏ธ Thermal monitoring โ€” real-time GPU/CPU temperature tracking
  • ๐Ÿ“ก SSE streaming โ€” real-time token streaming with abort support

Install

npm i -g darksol

Quick Start

# Search for models (with hardware fit check)
darksol search llama --limit 5

# Pull a model from HuggingFace
darksol pull llama-3.2-3b-gguf

# Run a prompt
darksol run llama-3.2-3b "Write a haiku about local inference."

# Use an existing Ollama model directly
darksol run ollama/llama3.2:latest "hello world"

# Start the API server
darksol serve
# โ†’ http://127.0.0.1:11435

CLI Commands

Command Description
darksol serve Start the OpenAI-compatible API server
darksol run <model> <prompt> Run a one-shot inference
darksol pull <model> Download a GGUF model from HuggingFace
darksol list List installed models (local + Ollama)
darksol search <query> Search HuggingFace with hardware-aware fit
darksol ps Show loaded model processes
darksol status System and server status
darksol usage Show inference stats and cost tracking
darksol rm <model> Remove a downloaded model
darksol browse Interactive model browser
darksol doctor System diagnostics
darksol mcp list List MCP server registry
darksol mcp enable <name> Enable an MCP server
darksol mcp disable <name> Disable an MCP server

API Endpoints

Default: http://127.0.0.1:11435

Endpoint Method Description
/health GET Service liveness and metadata
/v1/models GET Installed models (OpenAI format)
/v1/chat/completions POST Chat completions with SSE streaming
/v1/completions POST Text completions
/v1/embeddings POST Text embeddings
/v1/ollama/models GET Ollama local model inventory
/v1/directory/models GET HuggingFace model search (q, limit, task, sort)
/v1/app/usage GET Inference stats and cost tracking
/v1/app/meta GET App metadata and route inventory
/v1/mcp/servers GET MCP server registry
/v1/mcp/servers/:name/enable POST Enable an MCP server
/v1/mcp/servers/:name/disable POST Disable an MCP server
/v1/models/pull POST Pull a model from HuggingFace
/v1/models/import-ollama POST Import an Ollama model into Darksol
/v1/runtime/ports GET Check port availability
/v1/runtime/ports/find POST Find a free port
/v1/runtime/config POST Update runtime host/port config
/v1/runtime/status GET Engine runtime status
/v1/runtime/start POST Start managed runtime
/v1/runtime/stop POST Stop managed runtime
/v1/runtime/restart POST Restart managed runtime
/v1/runtime/keepwarm GET/POST Keep-warm scheduler config
/v1/bankr/health GET Bankr gateway status
/v1/bankr/config GET/POST Bankr gateway config
/v1/bankr/models GET Bankr cloud model list
/v1/bankr/usage GET Bankr usage summary

Example: Chat Completion

curl -X POST http://127.0.0.1:11435/v1/chat/completions \
  -H "content-type: application/json" \
  -d '{
    "model": "llama-3.2-3b",
    "messages": [
      { "role": "user", "content": "Hello from Darksol." }
    ]
  }'

Example: Search Models

curl "http://127.0.0.1:11435/v1/directory/models?q=llama&limit=3&sort=popular"

MCP Integration

Darksol supports the Model Context Protocol for connecting external tools to your models. Pre-configured servers:

  • CoinGecko โ€” crypto prices and market data
  • DexScreener โ€” DEX trading pairs and analytics
  • Etherscan โ€” Ethereum blockchain data
  • DefiLlama โ€” DeFi protocol TVL and yields

All servers are disabled by default. Enable them with darksol mcp enable <name>.

Config: ~/.darksol/mcp-servers.json

Environment

Variable Default Description
HUGGINGFACE_TOKEN โ€” Auth token for private HuggingFace models
DARKSOL_OLLAMA_ENABLED true Enable Ollama interop
DARKSOL_OLLAMA_BASE_URL http://127.0.0.1:11434 Ollama endpoint
BANKR_BASE_URL โ€” Bankr LLM gateway URL
BANKR_API_KEY โ€” Bankr API key

Runtime config: ~/.darksol/config.json

Desktop + Web Notes

For architecture details (desktop shell + web portal implementation), see:

  • docs/PHASE8_DESKTOP_WEB_ARCHITECTURE.md

Desktop Dev + Installer (Windows)

From repo root:

# install desktop runtime deps
npm --prefix desktop install

# run Electron desktop shell (auto-checks/boots local darksol backend)
npm run desktop:dev

# build Windows NSIS installer
npm run desktop:build:win

Installer output path:

  • desktop/dist/darksol-inference-desktop-<version>-setup.exe

The npm README stays focused on install + usage.

License

MIT

Built with teeth. ๐ŸŒ‘