extract
Extract clean content from specific URLs using Tavily's extraction API.
- Supports up to 20 URLs per request with optional query-based reranking to focus on relevant content chunks
- Two extraction modes: basic for fast text extraction, advanced for JavaScript-rendered pages and structured data
- Automatic OAuth authentication via browser on first run, or manual API key configuration in settings
- Returns markdown or plain text format with optional image URLs and configurable timeout up to 60 seconds
Extract Skill
Extract clean content from specific URLs. Ideal when you know which pages you want content from.
Authentication
The script uses OAuth via the Tavily MCP server. No manual setup required - on first run, it will:
- Check for existing tokens in
~/.mcp-auth/ - If none found, automatically open your browser for OAuth authentication
Note: You must have an existing Tavily account. The OAuth flow only supports login — account creation is not available through this flow. Sign up at tavily.com first if you don't have an account.
Alternative: API Key
If you prefer using an API key, get one at https://tavily.com and add to ~/.claude/settings.json:
{
"env": {
"TAVILY_API_KEY": "tvly-your-api-key-here"
More from tavily-ai/skills
tavily-search
|
16.7Ksearch
Search the web using Tavily's LLM-optimized search API. Returns relevant results with content snippets, scores, and metadata. Use when you need to find web content on any topic without writing code.
11.9Ktavily-best-practices
Build production-ready Tavily integrations with best practices baked in. Reference documentation for developers using coding assistants (Claude Code, Cursor, etc.) to implement web search, content extraction, crawling, and research in agentic workflows, RAG systems, or autonomous agents.
9.7Ktavily-research
|
9.4Kresearch
Comprehensive research grounded in web data with explicit citations. Use when you need multi-source synthesis—comparisons, current events, market analysis, detailed reports.
6.6Ktavily-extract
|
6.6K