JSPM

Found 3 results for quantized

agentary-js

JS SDK for running quantized small language models in the browser

  • v1.4.6
  • 45.31
  • Published

llama.native.js

use `npm i --save llama.native.js` to run lama.cpp models on your local machine. features a socket.io server and client that can do inference with the host of the model.

    • v1.1.0
    • 33.94
    • Published

    llama-ggml.js

    serve websocket GGML 4/5bit Quantized LLM's based on Meta's LLaMa model with llama.ccp

      • v0.1.0
      • 15.77
      • Published