Package Exports

raglite
raglite/dist/index.js

This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (raglite) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.

Readme

RAGLite

RAGLite is a minimal, TypeScript-first Retrieval-Augmented Generation (RAG) pipeline. It's designed for simplicity, extensibility, and performance, supporting PDF and DOCX ingestion, OpenAI embeddings, and fast vector search using SQLite/LibSQL.

Features

Simple API: One main class (RAGLite) for most use cases.
Document Ingestion: Load PDFs and DOCX files out of the box.
OpenAI Embeddings: Uses OpenAI's embedding models.
Fast Vector Search: Backed by SQLite/LibSQL for efficient similarity search.
Extensible: Advanced users can use or extend individual components.

Installation

npm install raglite

Quick Start

Basic Usage

import { RAGLite } from "raglite";

const { OPENAI_API_KEY, DATABASE_URL } = process.env;

// Initialize RAGLite
const rag = new RAGLite({
  apiKey: OPENAI_API_KEY,
  url: DATABASE_URL,
});

// Load a document from a file path
await rag.load("path/to/document.pdf");

// Load a document from a URL
await rag.load("https://example.com/path/to/document.docx");

// Load a document from text
await rag.load("Hello, world!");

// Include metadata with the document
await rag.load("Hello, world!", {
  metadata: {
    source: "https://example.com/path/to/document.docx",
  },
});

// Search for relevant chunks
const results = await rag.search("What is retrieval-augmented generation?", 5);

console.log(results); // [{ content: "...", metadata: { source: "..." } }, ...]

API

`new RAGLite(options)`

Option	Type	Default	Description
apiKey	string		OpenAI API key
url	string	`file:data/raglite.db`	SQLite/LibSQL database URL
tableName	string	`embeddings`	Table name for embeddings
model	string	`text-embedding-3-small`	OpenAI embedding model
dimensions	number	`1536`	Embedding vector size

`rag.load(filePath|url|text|buffer, options)`

filePath (string): Local file path to load the document from.
url (string): URL to load the document from.
text (string): Plain text to load into the document.
buffer (Buffer): Raw buffer data to load into the document.
options (object): Options for the document.
Loads and embeds the document into the vector store.

`rag.search(query, results = 3)`

query (string): The search query.
results (number): Number of top results to return (default: 3).
Returns: Array of relevant document chunks.

Advanced Usage

RAGLite is modular. You can use or extend its components directly:

Vector Store: LibSQLStore (raglite/stores/LibSQLStore)
Embedding Model: OpenAIModel (raglite/models/OpenAIModel)
Document Loaders: PDF, DOCX, and text loaders in raglite/loaders/
Base Classes: For custom stores, models, or loaders.

Example (using components directly):

import { LibSQLStore } from "raglite/stores/LibSQLStore";
import { OpenAIModel } from "raglite/models/OpenAIModel";

const model = new OpenAIModel({ apiKey: "..." });
const store = new LibSQLStore({ url: "...", model });

await store.addDocument({ content: "Hello world", metadata: {} });
const results = await store.search("Hello");

Requirements

Node.js >= 18 (recommended: >= 20)
OpenAI API key
SQLite/LibSQL database (local file or remote)

Development & Testing

Run tests: pnpm test or npm test
Build: pnpm build or npm run build

License

MIT

Contributing

Contributions, issues, and feature requests are welcome! Please open an issue or PR.