xiaohongshu-search-summarizer
SKILL.md
Xiaohongshu Search and Summarize
This skill automates the process of extracting high-quality multi-modal content (text + images) from Xiaohongshu (小红书) and actively assists you in generating a deeply integrated, analytical final report for the user. Due to Xiaohongshu's aggressive anti-scraping mechanisms, direct HTTP requests or naive scraping often result in 404s or blocks. This skill natively bypasses these by simulating a real user through the playwright-cli in a headed browser window.
It operates in two distinct phases:
Phase 1: Subagent Data Collection
- Simulate a search for the keyword on Xiaohongshu in a headed browser.
- Advance through image sliders to fully load all lazy pictures from the top N posts.
- Extract titles, descriptions, top comments, and all high-resolution images.
- Download those images to a local directory and generate a raw data document (
[keyword]_raw_data.md).
Phase 2: AI Multi-Modal Synthesis (Your Job)
- You MUST use your file reading capabilities to read the
[keyword]_raw_data.mdfile. - Inside the raw data markdown, you will find paths to image files. You MUST use your file reading / vision capabilities on these image file paths to actually ingest and "see" their visual content. If you skip this step, you are only reading file names, not the images themselves!
- You analyze the texts, summarize the genuinely useful comments (discarding noise like "pm me"), and interpret the semantic content of the images you just viewed (e.g. diagrams, guidelines, step-by-step UI flows).
- You compile everything into a beautifully synthesized, single comprehensive report rather than just a linear list of posts.