cloudflare-vectorize

Installation
Summary

Semantic search and RAG with Cloudflare Vectorize V2, supporting 5M vectors, 31ms latency, and async mutations.

  • Vectorize V2 (GA September 2024) brings 25× capacity increase (200K → 5M vectors), 18× faster queries (549ms → 31ms), and async mutations with breaking API changes including returnMetadata boolean → string enum
  • Metadata filtering with 10 indexes per collection, range operators ($gte, $lt, etc.), and cardinality-aware query optimization for RAG and semantic search workflows
  • Batch insert optimization: 5000-vector batches deliver 18× throughput improvement; individual inserts impractical at scale
  • Native Workers AI integration for embeddings (@cf/baai/bge-base-en-v1.5, 768 dims) plus OpenAI support; dimension limit of 1536 requires reduction for larger models
  • 14 common errors prevented: dimension mismatches, metadata index timing, V2 async mutation handling, TypeScript type gaps, Windows dev registry issues, and topK conditional limits
SKILL.md

Cloudflare Vectorize

Complete implementation guide for Cloudflare Vectorize - a globally distributed vector database for building semantic search, RAG (Retrieval Augmented Generation), and AI-powered applications with Cloudflare Workers.

Status: Production Ready ✅ Last Updated: 2026-01-21 Dependencies: cloudflare-worker-base (for Worker setup), cloudflare-workers-ai (for embeddings) Latest Versions: wrangler@4.59.3, @cloudflare/workers-types@4.20260109.0 Token Savings: ~70% Errors Prevented: 14 Dev Time Saved: ~4 hours

What This Skill Provides

Core Capabilities

  • Index Management: Create, configure, and manage vector indexes
  • Vector Operations: Insert, upsert, query, delete, and list vectors (list-vectors added August 2025)
  • Metadata Filtering: Advanced filtering with 10 metadata indexes per index
  • Semantic Search: Find similar vectors using cosine, euclidean, or dot-product metrics
Related skills
Installs
332
GitHub Stars
776
First Seen
Jan 20, 2026