processing-docx
Installation
SKILL.md
DOCX Processing
.docx is a ZIP archive of XML and resources. Different tasks have different tools and workflows.
Workflow Decision
| Intent | Workflow | Reference |
|---|---|---|
| Read/analyze text only | pandoc → markdown | raw-xml-access.md |
| Read structure, comments, media, formatting | unpack → raw XML | raw-xml-access.md |
| Create new document | docx-js (JS/TS) | docx-js.md |
| Edit own document, simple changes | Document library (Python) | ooxml.md |
| Edit someone else's document | Redlining (tracked changes) | redlining.md |
| Legal / academic / business / gov docs | Redlining — REQUIRED | redlining.md |
| Visual analysis | soffice → PDF → pdftoppm | raw-xml-access.md |