get-east-asian-width
Determine the East Asian Width of a Unicode character
Found 184 results for text-processing
Determine the East Asian Width of a Unicode character
Promptbook: Run AI apps in plain human language across multiple models and platforms
MDAST to DOCX plugin for resolving and embedding images. Supports base64, URLs, and custom resolvers for seamless DOCX image integration.
Extend MDAST by parsing embedded HTML in Markdown. Converts HTML into structured MDAST nodes compatible with @m2d/core for DOCX generation.
Core engine to convert extended MDAST to DOCX. Supports plugins for footnotes, images, lists, tables, and more. Designed for seamless Markdown-to-DOCX conversion.
A highly efficient, isomorphic, full-featured, multilingual text search engine library, providing full-text search, fuzzy matching, phonetic scoring, document indexing and more, with micro JSON state hydration/dehydration in-browser and server-side.
Parsing Library for Typescript and Javascript.
Plugin to convert Markdown tables (MDAST) to DOCX with support for rich formatting and seamless integration into mdast2docx.
Plugin to convert ordered and unordered lists from Markdown (MDAST) to DOCX. Supports nesting, custom bullets, and numbering styles.
Plugin to convert mathematical expressions in Markdown (MDAST) to DOCX using LaTeX-style syntax. Integrates seamlessly with mdast2docx.
Convert Markdown Abstract Syntax Tree (MDAST) to DOCX seamlessly. Supports footnotes, images, links, and customizable document properties.
A plugin for @m2d/core that parses emoji shortcodes like :smile: and replaces them with their corresponding Unicode emoji characters for DOCX output.
Extended MDAST types and custom node data for mdast2docx with support for DOCX formatting.
🔪 chunk/split a string by length without cutting/truncating words.
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
A unified plugin to prepare MDAST trees for DOCX conversion using mdast2docx.
A NPM package containing reusable utility functions for use in N8N code nodes, providing common functionality for text processing, data validation, batch operations, and more.
Promptbook: Run AI apps in plain human language across multiple models and platforms
Semantically create chunks from large texts. Useful for workflows involving large language models (LLMs).
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Full MCP 2025-06-18 compliant server with 121+ IT tools, logging, ping, progress tracking, cancellation, and sampling utilities
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Promptbook: Run AI apps in plain human language across multiple models and platforms
Get the Excerpt from a markdown file (like in jekyll or *smith)
Promptbook: Run AI apps in plain human language across multiple models and platforms
TypeScript library for extracting legal citations from text strings. A complete port of the Python eyecite library.
自动提取React项目中的中文字符串并进行国际化的CLI工具
Split Markdown documents into logical chunks while preserving code blocks, tables, lists, and other nested structures. Safely segment large MD files for processing/pagination without breaking syntax integrity, powered by AST-based parsing for accurate blo
SmartEdit: A Powerful and Extensible CLI Editor
Convert Markdown Abstract Syntax Tree (MDAST) to DOCX seamlessly. Supports footnotes, images, links, and customizable document properties.
A library to convert markdown to HTML.
A fast, zero-config CLI tool to clean and format Markdown files.
Convert between plain English and regex patterns with ease
A Node.js library for sentiment analysis using TextBlob
n8n community node for GitLab Code Splitter API
A lightweight TypeScript library designed to fix typos in OCR post-processing.
Modular SDK for structured text extraction from documents using LLMs
A Node.js module to remove personally identifiable information (PII) from text.
Functions for manipulating strings.
Extend MDAST by parsing embedded HTML in Markdown. Converts HTML into structured MDAST nodes compatible with @m2d/core for DOCX generation.
A CLI tool to extract text from a static Next.js export and generate llm.txt for LLM ingestion.
Extend MDAST by parsing embedded HTML in Markdown. Converts HTML into structured MDAST nodes compatible with @m2d/core for DOCX generation.
A lightweight TypeScript library designed to reconstruct paragraphs from OCRed inputs.
Encode and decode Unicode escapes in a string
A fully typed, general-purpose utility for unidirectional string transliteration (non-Latin script => Latin script).
Simple CLI wrapper for the CmpStr package to normalize and compare strings directly via terminal
🧠 powerful JavaScript library that leverages advanced AI embeddings to perform zero-shot text classification. Whether you're dealing with unlabelled data or seeking to classify text against dynamic and user-defined labels, this library provides a seamles
LanguageTool filter for Catalan text processing
A utility library for full-text search in TypeScript
A simple, zero-config package for common AI tasks like sentiment analysis in JavaScript and TypeScript.
A type-safe string templating library for TypeScript
A lightweight TypeScript library designed to reconstruct paragraphs from AI transcriptions.
Universal document-to-markdown and section splitter for HTML, URLs, and PDFs.
A performant zero-dependency utility to clean UTF-8 text, fix mojibake from latin1, verify string length, and sanitize input
Japanese text transliteration library for JavaScript/TypeScript
Official TypeScript/JavaScript SDK for Hashub Vector API - High-quality multilingual text embeddings with Turkish excellence
A powerful CLI tool for converting custom syntax to Markdown with AI assistance, statistics analysis, and interactive menus
TypeScript library for truncating HTML strings while preserving HTML tags and structure
Semantically create chunks from large texts. Useful for workflows involving large language models (LLMs).
A powerful wrapper around the OpenAI API, providing additional features and making it easier to interact with AI models. Seamlessly chat with your AI assistant, include context strings, and manage conversation history.
Server-side DOM text manipulator (Node.js).
Famous leftpad library for open source reasons
Use LLMs to run map-reduce summarization tasks on large documents until a target token size is met.
A TypeScript/JavaScript module for implementing Retrieval-Augmented Generation (RAG) using Qdrant vector database, Google's Generative AI embeddings, and Groq LLM.
Core engine to convert extended MDAST to DOCX. Supports plugins for footnotes, images, lists, tables, and more. Designed for seamless Markdown-to-DOCX conversion.
JSON front-matter parser and combiner. Minimal and perfect
A unified plugin to prepare MDAST trees for DOCX conversion using mdast2docx.
NodeJS library that semantically chunks text and matches it against a user query using cosine similarity for precise and relevant text retrieval
A powerful AI-powered CLI tool
Core engine to convert extended MDAST to DOCX. Supports plugins for footnotes, images, lists, tables, and more. Designed for seamless Markdown-to-DOCX conversion.
A versatile string manipulation library providing a range of text utilities for JavaScript and Node.js applications.
一个基于 Cheerio 的 HTML 解析和数据提取工具库
Kapsamlı Türkçe veri işleme, doğrulama, formatlama ve sahte Türkçe veri üretme araçları kütüphanesi
Text file bundling tool that preserves file structure. Nice for sending multiple files in one shot to large language models.
Talk to Sim with Teach Feature
A unified plugin to prepare MDAST trees for DOCX conversion using mdast2docx.
Split long Thai address strings into structured components (name, phone, address, subdistrict, district, province, zipcode). Handles names without title prefixes, location name conflicts, and province abbreviations.
A lightweight and powerful collection of string utility functions for Node.js - trimming, casing, formatting, and more.
Find and replace text in github file
N8N node for generating text embeddings using Transformer.js with direct text input
🔪 chunk/split a string by length without cutting/truncating words.
A library of stream classes for semantic text processing, including sources like Wikipedia and news articles.
Here is a README generated from the code snippet:
Uzbek to Cyrillic transliterator
Extended MDAST types and custom node data for mdast2docx with support for DOCX formatting.
Corrects text typed with the wrong Thai/English keyboard layout
split tracklist text to object contains artist and title of each track
React components for DamkarAI - AI-powered text and code assistant
A powerful React hook for text summarization using Google's Generative AI API. Easily integrate advanced text summarization capabilities into your React applications.
A tool for semantic chunking of text
MDAST to DOCX plugin for resolving and embedding images. Supports base64, URLs, and custom resolvers for seamless DOCX image integration.
A powerful and flexible text search library for JavaScript that enables you to build a simple text search engine.
Process links in text. Ben Alman's linkify.js adaptation
Professional AI prompt engineering toolkit with advanced template features, real-time dashboards, conditional logic, template inheritance, live monitoring, OpenRouter integration, and 310+ model support
MDAST to DOCX plugin for resolving and embedding images. Supports base64, URLs, and custom resolvers for seamless DOCX image integration.
Plugin to convert mathematical expressions in Markdown (MDAST) to DOCX using LaTeX-style syntax. Integrates seamlessly with mdast2docx.
Plugin to convert Markdown tables (MDAST) to DOCX with support for rich formatting and seamless integration into mdast2docx.
Find and replace text in github file in all repositories
Plugin to convert Markdown tables (MDAST) to DOCX with support for rich formatting and seamless integration into mdast2docx.
A simple utility library for string manipulation
Convert Markdown Abstract Syntax Tree (MDAST) to DOCX seamlessly. Supports footnotes, images, links, and customizable document properties.
A powerful TypeScript library designed to simplify a wide range of string operations and manipulations.
Extended MDAST types and custom node data for mdast2docx with support for DOCX formatting.
BulkAI is a powerful Node.js CLI tool designed to automate the processing of markdown and text files using OpenAI's GPT-4.
Comprehensive string manipulation utilities with zero dependencies
Remove diacritics (accents, special characters, and marks) from text, making it easier to normalize, search, and process text across multiple languages.
A plugin for @m2d/core that parses emoji shortcodes like :smile: and replaces them with their corresponding Unicode emoji characters for DOCX output.
A lightweight and easy-to-use npm package for performing sentiment analysis on text. Analyze the positivity, negativity, or neutrality of any string input with ease, and process multiple texts in batch for more efficient analysis.
Plugin to convert ordered and unordered lists from Markdown (MDAST) to DOCX. Supports nesting, custom bullets, and numbering styles.
A plugin for @m2d/core that parses emoji shortcodes like :smile: and replaces them with their corresponding Unicode emoji characters for DOCX output.
<h1 align="center">Welcome to Shiba - your strings helper! 👋 </h1>
A tool to generate AI fine-tuning datasets from text files
A simple utility library for string manipulation.
Chunked Augmented Generation (CAG) algorithm for processing large text inputs with AI models
strkit is a utility library offering a collection of essential string functions including validation, case conversion, truncation, and more. Ideal for both JavaScript and TypeScript developers to simplify string operations in their applications.
Non linear text processing system
Konversi teks Bahasa Indonesia dari kasual ke formal untuk surat, email, dan laporan resmi
A template for padding utils function for Strings in JavaScript / NodeJs
TOML front-matter parser and combiner. Minimal and perfect
generate excerpt from html text while preserving html structure
A utility function to convert a string into a URL-friendly slug, with support for string sanitization, normalization, and transformation.
TDK (Türk Dil Kurumu) sözlüğünden kelime anlamlarını, köken bilgilerini ve atasözlerini getiren Node.js paketi.
Slugify and search Vietnamese text with diacritics support
ArabicEdita is n8n community node to fix the arabic writing problem in editimage builin node in n8n
Famous leftpad library for open source reasons
Semantically create chunks from large texts. Useful for workflows involving large language models (LLMs).
Visualization of statistic peaks and valleys TE ranks
Öğrenmeye çalıştığım bir kod
Plugin to convert ordered and unordered lists from Markdown (MDAST) to DOCX. Supports nesting, custom bullets, and numbering styles.
A unified data-ingestion CLI that auto-detects and converts text, image, audio and tabular sources into standardized training datasets
Famous leftpad library for open source reasons
YAML front-matter parser and combiner. Minimal and perfect
AI-powered text extraction using Zod schemas
Famous leftpad library for open source reasons
Utility for extracting chinese characters from a string
🤖 Smart text analysis package for detecting positive and negative words with AI support. Features customizable word lists, multiple languages, and AI-powered sentiment analysis. Perfect for content moderation, sentiment analysis, and text filtering in an
A simple utility library for string manipulations including case transformations and hexadecimal conversions.
🧑🏭 Node.js package for restoring punctuation and casing to strings via ONNX Model `punctuation_fullstop_truecase_english`
One Dionys (String Utils) - Provides useful functions for manipulating strings and can be used in typescript/javascript.
Transform stream enumeratee generators for stream-driven data extract and transformation (i.e. ETL).
GbDetector is an advanced text analysis module designed to identify gambling-related content through sophisticated pattern matching and text processing techniques.
Encode and decode Unicode escapes in a string
A powerful tool to anonymize sensitive data, allowing reversible decoding.
A library for generating slug based on the input string and the ability to configure parameters
A CLI tool for text search
Plugin to convert mathematical expressions in Markdown (MDAST) to DOCX using LaTeX-style syntax. Integrates seamlessly with mdast2docx.
A simple Node.js utility module for common string operations.
A JavaScript library for handling singular possessive apostrophes with support for international names
Batch process text files using OpenAI API to clean and transform content
A real-time word counting utility for text input
Utility to clean up text by removing or translating common 'slop' patterns
Universal outlining engine. Generate an outline of any text-based document! CLI included.
this is a lightweight and modular library providing a comprehensive set of utility functions to streamline development workflows. It is designed to simplify common tasks in software projects, including string manipulation, array operations, date handling,
A Node.js package to extract unique titles based on cosine similarity.
Famous leftpad library for open source reasons
it is small word counter cli app
Complete TypeScript port of rivo/uniseg with 100% API compatibility. Unicode text segmentation for grapheme clusters, word boundaries, and text width calculation.
Extract keywords from text content
Returns lines matching a pattern in a string. Supports inverse operation as well, to exclude lines with matches.
A comprehensive utility library for file operations, HTTP requests, data processing, and more
Remove or replace em dashes (—) in strings with a simple boolean parameter