using-web-scraping
Web Scraping Skill — Chrome (Playwright) + DuckDuckGo
A privacy-minded, agent-facing web-scraping skill that uses headless Chrome (Playwright/Puppeteer) and DuckDuckGo for search. Focuses on: reliable navigation, extracting structured text, obeying robots.txt, and rate-limiting.
When to use
- Collect public webpage content for summarization, metadata extraction, or link discovery.
- Use DuckDuckGo for queries when you want a privacy-respecting search source.
- NOT for bypassing paywalls, scraping private/logged-in content, or violating Terms of Service.
Safety & etiquette
- Always check and respect
/robots.txtbefore scraping a site. - Rate-limit requests (default: 1 request/sec) and use polite
User-Agentstrings. - Avoid executing arbitrary user-provided JavaScript on scraped pages.
- Only scrape public content; if login is required, return
login_requiredinstead of attempting to bypass.
Capabilities
- Search DuckDuckGo and return top-N result links.
- Visit result pages in headless Chrome and extract
title,meta description,maintext (or best-effort article text), andcanonicalURL. - Return results as structured JSON for downstream consumption.
More from besoeasy/open-skills
news-aggregation
Aggregate and deduplicate recent news from multiple sources into concise topic summaries.
151free-geocoding-and-maps
Geocode addresses and get map data using free OpenStreetMap Nominatim API. Use when: (1) Converting addresses to coordinates, (2) Reverse geocoding coordinates to addresses, (3) Location-based features, or (4) Validating addresses.
67get-crypto-price
Fetch current and historical crypto prices and compute ATH or ATL over common time windows.
49anonymous-file-upload
Upload and host files anonymously using decentralized storage with Originless and IPFS.
48trading-indicators-from-price-data
Compute common trading indicators from OHLCV price data for analysis and strategy development.
46json-and-csv-data-transformation
Transform data between JSON, CSV, and other formats with filtering, mapping, and flattening. Use when: (1) Converting API responses to CSV, (2) Processing data pipelines, (3) Extracting specific fields, or (4) Flattening nested structures.
44