Xiaohongshu Search and Summarize

This skill automates the process of extracting high-quality multi-modal content (text + images) from Xiaohongshu (小红书) and actively assists you in generating a deeply integrated, analytical final report for the user. Due to Xiaohongshu's aggressive anti-scraping mechanisms, direct HTTP requests or naive scraping often result in 404s or blocks. This skill natively bypasses these by simulating a real user through the playwright-cli in a headed browser window.

It operates in two distinct phases:

Phase 1: Subagent Data Collection

Simulate a search for the keyword on Xiaohongshu in a headed browser.
Advance through image sliders to fully load all lazy pictures from the top N posts.
Extract titles, descriptions, top comments, and all high-resolution images.
Download those images to a local directory and generate a raw data document ([keyword]_raw_data.md).

Phase 2: AI Multi-Modal Synthesis (Your Job)

You MUST use your file reading capabilities to read the [keyword]_raw_data.md file.
Inside the raw data markdown, you will find paths to image files. You MUST use your file reading / vision capabilities on these image file paths to actually ingest and "see" their visual content. If you skip this step, you are only reading file names, not the images themselves!
You analyze the texts, summarize the genuinely useful comments (discarding noise like "pm me"), and interpret the semantic content of the images you just viewed (e.g. diagrams, guidelines, step-by-step UI flows).
You compile everything into a beautifully synthesized, single comprehensive report rather than just a linear list of posts.

xiaohongshu-search-summarizer

Xiaohongshu Search and Summarize

Phase 1: Subagent Data Collection

Phase 2: AI Multi-Modal Synthesis (Your Job)

Dependencies