Document to Text Reborn (Digital Archaeologist)

Overview

This skill utilizes a 3-layer extraction model to "excavate" meaning and aesthetics from various document formats. It separates pure content from design and metadata, enabling high-fidelity analysis and reuse.

3-Layer Extraction Model

Content Layer (Soul): High-fidelity text extraction maintaining structural elements like headings and tables (Markdown output).
Aesthetic Layer (Mask): Extraction of design parameters, colors, fonts, and layout grid information.
Metadata Layer (Context): File properties, authorship, and contextual markers.

Supported Formats

PDF: Text and metadata. (Aesthetic: Coordinate-based analysis)
Word (.docx): Structural Markdown conversion. (Aesthetic: Style extraction)
Excel (.xlsx): Multi-sheet CSV extraction.
PowerPoint (.pptx): Slide-based content extraction.
Images: OCR supporting English and Japanese.

Related skills

More from famaoai-creator/gemini-skills

Installs

Repository

famaoai-creator…i-skills

GitHub Stars

First Seen

Feb 13, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykFail

doc-to-text

Document to Text Reborn (Digital Archaeologist)

Overview

3-Layer Extraction Model

Supported Formats

More from famaoai-creator/gemini-skills

data-transformer

pmo-governance-lead

completeness-scorer

local-reviewer

api-fetcher

prompt-optimizer