article-extractor
Installation
SKILL.md
Article Extractor
Extract main content from web articles and blog posts, removing navigation, ads, and clutter. Saves clean, readable text.
Prerequisites
This skill requires UV for dependency management. Run from the tapestry-skills project root.
Workflow
URL → Validate → Detect Tool → Extract Content → Sanitize Filename → Save
Tools (in priority order):
reader(Mozilla Readability) - best for most articlestrafilatura- excellent for blogs/news (included in dependencies)- Fallback (
tapestry-extract-html) - works without external tools