Package Exports

darksol
darksol/src/cli.js

This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (darksol) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

Readme

Built by DARKSOL 🌑

Darksol Studio

Local-first LLM inference engine with hardware-aware optimization, OpenAI-compatible API, MCP tool integration, and Ollama model reuse. Your Ollama alternative — built smarter.

Website · GitLab · npm

Features

⚡ Hardware-aware inference — auto-detects GPU, VRAM, CPU, RAM and optimizes settings
🔌 OpenAI-compatible API — drop-in /v1/chat/completions, /v1/completions, /v1/models, /v1/embeddings
🦙 Ollama model reuse — finds and runs your existing Ollama models directly, no daemon required
🔍 HuggingFace directory — browse, search, and pull GGUF models with "will it fit?" indicators
🔧 MCP tool integration — connect external tools via Model Context Protocol (CoinGecko, DexScreener, Etherscan, DefiLlama pre-configured)
💰 Cost tracking — every local inference is $0.00, track usage and savings vs cloud
🌡️ Thermal monitoring — real-time GPU/CPU temperature tracking
📡 SSE streaming — real-time token streaming with abort support

Install

npm i -g darksol

Quick Start

# Search for models (with hardware fit check)
darksol search llama --limit 5

# Pull a model from HuggingFace
darksol pull llama-3.2-3b-gguf

# Run a prompt
darksol run llama-3.2-3b "Write a haiku about local inference."

# Use an existing Ollama model directly
darksol run ollama/llama3.2:latest "hello world"

# Start the API server
darksol serve
# → http://127.0.0.1:11435

CLI Commands

Command	Description
`darksol serve`	Start the OpenAI-compatible API server
`darksol run <model> <prompt>`	Run a one-shot inference
`darksol pull <model>`	Download a GGUF model from HuggingFace
`darksol list`	List installed models (local + Ollama)
`darksol search <query>`	Search HuggingFace with hardware-aware fit
`darksol ps`	Show loaded model processes
`darksol status`	System and server status
`darksol usage`	Show inference stats and cost tracking
`darksol rm <model>`	Remove a downloaded model
`darksol browse`	Interactive model browser
`darksol doctor`	System diagnostics
`darksol mcp list`	List MCP server registry
`darksol mcp enable <name>`	Enable an MCP server
`darksol mcp disable <name>`	Disable an MCP server

API Endpoints

Default: http://127.0.0.1:11435

Endpoint	Method	Description
`/health`	GET	Service liveness and metadata
`/v1/models`	GET	Installed models (OpenAI format)
`/v1/chat/completions`	POST	Chat completions with SSE streaming
`/v1/completions`	POST	Text completions
`/v1/embeddings`	POST	Text embeddings
`/v1/ollama/models`	GET	Ollama local model inventory
`/v1/directory/models`	GET	HuggingFace model search (`q`, `limit`, `task`, `sort`)
`/v1/app/usage`	GET	Inference stats and cost tracking
`/v1/app/meta`	GET	App metadata and route inventory
`/v1/mcp/servers`	GET	MCP server registry
`/v1/mcp/servers/:name/enable`	POST	Enable an MCP server
`/v1/mcp/servers/:name/disable`	POST	Disable an MCP server
`/v1/models/pull`	POST	Pull a model from HuggingFace
`/v1/models/import-ollama`	POST	Import an Ollama model into Darksol
`/v1/runtime/ports`	GET	Check port availability
`/v1/runtime/ports/find`	POST	Find a free port
`/v1/runtime/config`	POST	Update runtime host/port config
`/v1/runtime/status`	GET	Engine runtime status
`/v1/runtime/start`	POST	Start managed runtime
`/v1/runtime/stop`	POST	Stop managed runtime
`/v1/runtime/restart`	POST	Restart managed runtime
`/v1/runtime/keepwarm`	GET/POST	Keep-warm scheduler config
`/v1/bankr/health`	GET	Bankr gateway status
`/v1/bankr/config`	GET/POST	Bankr gateway config
`/v1/bankr/models`	GET	Bankr cloud model list
`/v1/bankr/usage`	GET	Bankr usage summary

Example: Chat Completion

curl -X POST http://127.0.0.1:11435/v1/chat/completions \
  -H "content-type: application/json" \
  -d '{
    "model": "llama-3.2-3b",
    "messages": [
      { "role": "user", "content": "Hello from Darksol." }
    ]
  }'

Example: Search Models

curl "http://127.0.0.1:11435/v1/directory/models?q=llama&limit=3&sort=popular"

MCP Integration

Darksol supports the Model Context Protocol for connecting external tools to your models. Pre-configured servers:

CoinGecko — crypto prices and market data
DexScreener — DEX trading pairs and analytics
Etherscan — Ethereum blockchain data
DefiLlama — DeFi protocol TVL and yields

All servers are disabled by default. Enable them with darksol mcp enable <name>.

Config: ~/.darksol/mcp-servers.json

Environment

Variable	Default	Description
`HUGGINGFACE_TOKEN`	—	Auth token for private HuggingFace models
`DARKSOL_OLLAMA_ENABLED`	`true`	Enable Ollama interop
`DARKSOL_OLLAMA_BASE_URL`	`http://127.0.0.1:11434`	Ollama endpoint
`BANKR_BASE_URL`	—	Bankr LLM gateway URL
`BANKR_API_KEY`	—	Bankr API key

Runtime config: ~/.darksol/config.json

Desktop + Web Notes

For architecture details (desktop shell + web portal implementation), see:

docs/PHASE8_DESKTOP_WEB_ARCHITECTURE.md

Desktop Dev + Installer (Windows)

From repo root:

# install desktop runtime deps
npm --prefix desktop install

# run Electron desktop shell (auto-checks/boots local darksol backend)
npm run desktop:dev

# build Windows NSIS installer
npm run desktop:build:win

Installer output path:

desktop/dist/darksol-inference-desktop-<version>-setup.exe

The npm README stays focused on install + usage.

License

MIT

Built with teeth. 🌑