document-converter
Document Converter (MarkItDown)
Convert Office and structured documents to Markdown for LLM consumption using Microsoft MarkItDown.
Overview
MarkItDown is a Microsoft open-source tool that converts a wide range of document formats into clean Markdown text. This is essential for academic research workflows where source materials arrive in diverse formats (PDFs from journals, PPTX from conferences, DOCX from collaborators, XLSX data tables).
Supported Formats
| Format | Extensions | Notes |
|---|---|---|
.pdf |
Text extraction; scanned PDFs require OCR | |
| Word | .docx |
Paragraphs, tables, lists, headings preserved |
| PowerPoint | .pptx |
Slide-by-slide with speaker notes |
More from wentorai/research-claw
multi-search-engine
Integration of 17 search engines for web crawling without API keys. Includes 8 domestic (Baidu, Bing CN, 360, Sogou, WeChat, Toutiao, Jisilu) and 9 international engines (Google, DuckDuckGo, Yahoo, Startpage, Brave, Ecosia, Qwant, WolframAlpha).
7citation styles
>-
6academic-writing
Academic writing expert specializing in scholarly papers, literature reviews, research methodology, and thesis writing with strict academic standards.
6plotting sop
>-
6academic-deep-research
Methodical research assistant for exhaustive investigations through systematic research cycles. Best for literature reviews, competitive analysis, trend reports, and comprehensive topic exploration.
6coding sop
>-
6