Package Exports
This package does not declare an exports field, so the exports above have been automatically detected and optimized by JSPM instead. If any package subpath is missing, it is recommended to post an issue to the original package (bun-docx) to support the "exports" field. If that is not possible, create a JSPM override to customize the exports field for this package.
Readme
docx-cli
A Bun-built CLI for AI agents (Claude, Codex) to read, edit, and comment on .docx files with full format fidelity. Outputs JSON-AST for precise locator-based editing; preserves anything it doesn't model by mutating XML in place.
Install
Standalone binary (no Bun required):
curl -fsSL https://raw.githubusercontent.com/kklimuk/docx-cli/main/install.sh | shHonors PREFIX (default $HOME/.local/bin) and VERSION (default latest):
PREFIX=/usr/local sh -c "$(curl -fsSL https://raw.githubusercontent.com/kklimuk/docx-cli/main/install.sh)"
VERSION=v0.2.0 sh -c "$(curl -fsSL https://raw.githubusercontent.com/kklimuk/docx-cli/main/install.sh)"Pre-built binaries are published for linux/x64, linux/arm64, darwin/x64, darwin/arm64, windows/x64.
npm (requires Bun >= 1.3):
bun add -g bun-docx
# or
bunx bun-docx read doc.docxCommands
docx create FILE [--title T] [--author A] [--text "..."]
docx read FILE
docx insert FILE --after p3 --text "..." [--style HeadingN] [--color HEX] [--bold] [--italic] [--url URL]
docx insert FILE --after p3 --runs '[{"type":"text","text":"X","bold":true}]'
docx edit FILE --at p3 --text "..." | --runs '[...]'
docx delete FILE --at p3
docx find FILE QUERY [--regex] [--ignore-case] [--all] [--nth N]
docx replace FILE PATTERN REPLACEMENT [--regex] [--ignore-case] [--all] [--limit N] [--dry-run]
docx comments add FILE --range p3:5-20 --text "..." [--author NAME]
docx comments reply FILE --to c0 --text "..."
docx comments resolve FILE --id c0 [--unset]
docx comments delete FILE --id c0
docx comments list FILE [--include-resolved] [--thread c0]
docx images list FILE
docx images extract FILE --to ./media [--id imgN]
docx images replace FILE --at imgN --with ./new.png
docx hyperlinks list FILE
docx hyperlinks add FILE --at pN:S-E --url URL
docx hyperlinks replace FILE --at linkN --with URL
docx hyperlinks delete FILE --at linkN
docx track-changes FILE on|off
docx info schema [--ts]
docx info locators [--json]Every command has --help. Mutating commands accept --dry-run and -o/--output PATH (write to a parallel file instead of overwriting FILE). JSON output by default for read and *.list; structured {ok, code, error, hint} on failure.
When <w:trackChanges/> is set in the doc (toggle via docx track-changes FILE on), insert/edit/delete/replace automatically emit <w:ins>/<w:del> markers attributed to $DOCX_AUTHOR (default docx-cli). To make a one-off untracked edit, flip the flag off, edit, then flip it back on. find results inside tracked-change wrappers carry a trackedChanges array so agents can decide what to do with hits in pending insertions/deletions.
Locators
pN paragraph N (e.g., p3)
pN:S-E characters S..E within paragraph N
pN:S-pM:E cross-paragraph range
tN table N; tN:rRcC for cell at row R, col C
cN, imgN, linkN comment / image / hyperlink idsRun docx info locators for the full reference.
Development
bun install && bun run prepare # set up + git hooks
bun dev <subcommand> # run via source
bun run check # biome + knip + tsc
bun run test:unit # core + cli tests (fast)
bun run test:integration # LibreOffice round-trip (needs `soffice` on PATH)
bun test # everything
bun run build # produce dist/docx via bun build --compileLibreOffice (for integration tests)
- macOS:
brew install --cask libreoffice - Linux:
sudo apt-get install libreoffice-core libreoffice-writer - Windows: https://www.libreoffice.org/download/
Architecture
src/
index.ts # binary entrypoint
cli/
index.ts # parseArgs dispatch
help.ts # top-level --help
respond.ts # JSON ack / structured error helpers
create/ # create FILE
read/ # read FILE
insert/ # insert FILE (uses ./emit Paragraph component)
edit/ # edit FILE
delete/ # delete FILE
comments/ # add | reply | resolve | delete | list
images/ # list | extract | replace
hyperlinks/ # add | list | replace | delete
track-changes/ # FILE on|off
info/ # schema | locators (reference output)
core/
package/ # JSZip open/close, named-part read/write
parser/ # XmlNode class + parse/serialize + JSX factory
jsx/ # h, Fragment, namespaces (w, r, a, wp, pic, ...)
ast/ # types + DocView + XML→AST reader
locators/ # parse "p3:5-20" + resolve to refs
tests/
core/, cli/, integration/
fixtures/Stack
- Runtime: Bun (
node:utilparseArgs, JSX with custom factory, native zlib) - Parser:
jszip+fast-xml-parser+fast-xml-builder - Quality: Biome + Knip + tsc; LibreOffice headless for round-trip integration tests
- Standard: ECMA-376 Part 1 §17 (WordprocessingML), Transitional profile
How It Works
In-place XML mutation. The AST returned by read is a view over the parsed XML tree, not a separate model. When you edit or comments add, we mutate the underlying XML nodes directly and serialize back. Anything we don't model in the AST (custom styles, theme colors, schema extensions) survives because we never re-emit untouched regions.
JSX for emitters. Constructing OOXML fragments imperatively (<w:rPr> → <w:b/> → <w:color w:val="800080"/>) gets verbose. We write emitters in JSX with a custom factory: <w.rPr><w.b/><w.color w-val="800080"/></w.rPr> becomes the right XmlNode tree. Component names are PascalCase (<Paragraph>, <RunProperties>); they return XmlNode | null so empty wrappers get omitted automatically.
Span-aware comments. comments add --range p3:5-20 finds the runs that contain offsets 5 and 20, splits them at the boundaries (preserving rPr formatting on both halves), and inserts <w:commentRangeStart> / <w:commentRangeEnd> markers between the slices. The <w:commentReference> run goes after the end marker.
ParaId auto-injection. Comments authored by tools like mammoth or older Word versions lack w14:paraId, which commentsExtended.xml requires for resolve/reply. We detect this on resolve/reply and inject a fresh paraId, also adding the xmlns:w14 namespace declaration to the <w:comments> root if missing.
Cross-format image replacement. images replace --at img0 --with new.png detects the new MIME type via Bun.file().type, renames the part (word/media/image1.jpeg → word/media/image1.png), rewrites the relationship Target, and ensures [Content_Types].xml has a <Default> for the new extension.
Hyperlink CRUD. hyperlinks list enumerates <w:hyperlink> elements with positional linkN ids; hyperlinks add --at p3:5-20 --url URL wraps an existing span (splitting runs at offsets); hyperlinks replace --at link0 --with URL updates the rels Target, allocating a new rId if the existing one is shared so siblings stay pointed at the original URL; hyperlinks delete --at link0 unwraps the link (text survives) and prunes the rels entry when no longer referenced.
CI
GitHub Actions (.github/workflows/ci.yml) runs four jobs on push to main and on PRs:
| Job | What |
|---|---|
check |
biome check . && knip-bun && tsc --noEmit |
unit-tests |
bun run test:unit (core + cli, fast) |
integration-tests |
Installs LibreOffice, runs bun run test:integration |
build-binary |
Smoke-builds via bun build --compile and runs --version |
.github/workflows/release.yml triggers on v* tags, matrix-builds the five binaries, and uploads them to a GitHub Release via softprops/action-gh-release.