document-diff
Document Diff
Overview
Compare two versions of a document with structure-aware precision. SoMark parses both files into clean Markdown first, then a diff is generated at the text level. The result tells you exactly what changed between two versions of a contract, report, policy document, or any other file.
Why parse before diffing?
Raw PDF/Word binary diffing is meaningless. By parsing both documents into clean Markdown first, the diff captures semantic changes — actual content additions, deletions, and modifications — not binary noise.
In short: parse both documents with SoMark, then diff the structured output.
When to trigger
- Compare two versions of a document
- Find what changed between two contracts, reports, or policies
- Identify added or removed clauses in an agreement
More from somarkai/skills
tender-analyzer
Analyze tender and procurement documents (PDF, Word, images) to extract qualification requirements, scoring criteria, key deadlines, prohibited clauses, and submission checklists. Uses SoMark for accurate parsing of complex government and enterprise procurement documents. Requires SoMark API Key (SOMARK_API_KEY).
44contract-reviewer
Review contracts and legal agreements (PDF, Word, images) for risks, unfair clauses, missing provisions, and key obligations using SoMark for accurate document parsing. Provides structured risk analysis with severity ratings. Requires SoMark API Key (SOMARK_API_KEY).
15resume-parser
Parse resumes and CVs (PDF, Word, images) into structured JSON profiles using SoMark for accurate document parsing. Extracts name, contact info, work experience, education, skills, and certifications. Ideal for HR workflows, candidate review, and talent intelligence. Requires SoMark API Key (SOMARK_API_KEY).
14image-parser
Parse text and coordinates from images with SoMark (character/word/line positions on original image). Ideal for OCR-first image understanding where location matters. Requires SoMark API Key (SOMARK_API_KEY).
13financial-report-analyzer
Analyze financial reports and research notes (PDF, Word, images) to extract key financial metrics, profitability trends, risk signals, and management commentary. Uses SoMark to accurately parse complex financial tables, multi-column layouts, and charts before AI analysis. Ideal for earnings analysis, equity research, and investment due diligence. Requires SoMark API Key (SOMARK_API_KEY).
12paper-digest
Parse and deeply analyze academic papers (PDF, images) into structured research cards covering problem, methods, datasets, results, limitations, and contributions. Uses SoMark to accurately recover two-column layouts, formulas, tables, and figures before AI extraction. Ideal for literature review, research tracking, and knowledge base building. Requires SoMark API Key (SOMARK_API_KEY).
10