tavily-extract
Extract clean markdown or text from up to 20 URLs, with JavaScript rendering and query-focused chunking support.
- Handles JavaScript-rendered pages with configurable extraction depth (basic for simple pages, advanced for dynamic SPAs and tables)
- Supports query-focused extraction to return only relevant content chunks instead of full pages
- Returns LLM-optimized markdown by default, with options for plain text format and structured JSON output
- Processes up to 20 URLs in a single call; integrates into the Tavily workflow as the extraction step following search results
tavily extract
Extract clean markdown or text content from one or more URLs.
Before running any command
If tvly is not found on PATH, install it first:
curl -fsSL https://cli.tavily.com/install.sh | bash && tvly login
Do not skip this step or fall back to other tools.
See tavily-cli for alternative install methods and auth options.
When to use
- You have a specific URL and want its content
More from tavily-ai/skills
tavily-search
|
16.5Ksearch
Search the web using Tavily's LLM-optimized search API. Returns relevant results with content snippets, scores, and metadata. Use when you need to find web content on any topic without writing code.
11.9Ktavily-best-practices
Build production-ready Tavily integrations with best practices baked in. Reference documentation for developers using coding assistants (Claude Code, Cursor, etc.) to implement web search, content extraction, crawling, and research in agentic workflows, RAG systems, or autonomous agents.
9.6Ktavily-research
|
9.3Kresearch
Comprehensive research grounded in web data with explicit citations. Use when you need multi-source synthesis—comparisons, current events, market analysis, detailed reports.
6.6Ktavily-cli
|
6.3K