content-extraction
This skill extracts ALL content from an existing website and outputs it as structured, reusable data files. It crawls every page, downloads every asset, and produces a complete content inventory.
The user provides: the URL of the site to extract from, and optionally the target format (TypeScript, JSON, Markdown).
What Gets Extracted
For each page on the site:
| Content type | Output |
|---|---|
| Text | Headings, paragraphs, lists, quotes — preserved with hierarchy |
| Metadata | <title>, <meta description>, OG tags, canonical URL, lang |
| Images | Downloaded to public/images/ with original filenames. Alt text cataloged |
| Links | Internal + external, with anchor text and destination URL |
| PDFs & assets | Downloaded to public/assets/. Filenames and original URLs cataloged |
| Forms | Field names, types, labels, validation rules, action URLs |
| Navigation | Menu structure, link hierarchy, active states |
| Structured data | JSON-LD, microdata, schema.org markup |
More from saccoai/agent-skills
website-analysis
Crawl any website in a single pass to produce both a complete structural map and full content extraction. Discovers all pages, routes, navigation, multilingual variants, and issues while simultaneously extracting all text, images, metadata, and assets. Use before any migration, redesign, or audit.
16nextjs-fullstack
Opinionated Next.js fullstack patterns — App Router, Tailwind CSS v4, shadcn/ui, Better Auth, Drizzle ORM, Server Actions, and Vercel deployment. Use when scaffolding a new project or enforcing consistent architecture across client projects.
13seo-migration
SEO preservation during website migrations — redirect mapping, canonical URLs, sitemap generation, structured data, meta tags, and Search Console verification. Use when rebuilding a site to ensure zero SEO loss from URL changes, content moves, or domain switches.
9project-handoff
Generate complete client handoff documentation — deployment guide, environment setup, CMS instructions, maintenance checklist, architecture overview, and operational runbook. Use when delivering a finished project to a client or their team.
8client-proposal
Generate a professional project proposal from a website audit. Analyzes the prospect's current site, identifies issues, and produces a structured proposal with scope, deliverables, tech recommendations, and phased timeline. Use as a sales tool or for scoping client engagements.
6web-audit
Comprehensive website quality audit — Lighthouse scores, accessibility (axe-core), cross-browser testing, performance budgets, and mobile responsiveness. Generates actionable reports with pass/fail per page. Use to audit any live website or as a QA gate before deployment.
6