firecrawl-data-handling
Installation
SKILL.md
FireCrawl Data Handling
Overview
Manage scraped web content from FireCrawl pipelines. Covers content extraction filtering, HTML sanitization, markdown cleaning, structured data validation, and storage patterns for crawled content.
Prerequisites
- FireCrawl API key
@mendable/firecrawl-jsSDK- Storage system for crawled content
- Understanding of web scraping data formats
Instructions
Step 1: Content Format Selection and Cleaning
import FirecrawlApp from '@mendable/firecrawl-js';
const firecrawl = new FirecrawlApp({
apiKey: process.env.FIRECRAWL_API_KEY!,
});
Related skills