transloadit-media-processing

Installation
Summary

Cloud-based media processing for video, audio, images, and documents using 86+ specialized robots.

  • Supports video encoding (HLS, MP4, WebM), thumbnail generation, image resizing/watermarking, audio transcoding, document OCR, and speech-to-text via chainable processing steps
  • Access via MCP server (recommended for IDE integration) or CLI; requires free Transloadit account with API credentials
  • Build multi-step pipelines by chaining robot operations together using the "use" field; reuse pipelines as templates with dynamic variables
  • Includes preset configurations for common formats (HLS-1080p, MP3, WebP) and batch processing via HTTP import from URLs, S3, GCS, or other cloud storage
SKILL.md

Transloadit Media Processing

Process, transform, and encode media files using Transloadit's cloud infrastructure. Supports video, audio, images, and documents with 86+ specialized processing robots.

When to Use This Skill

Use this skill when you need to:

  • Encode video to HLS, MP4, WebM, or other formats
  • Generate thumbnails or animated GIFs from video
  • Resize, crop, watermark, or optimize images
  • Convert between image formats (JPEG, PNG, WebP, AVIF, HEIF)
  • Extract or transcode audio (MP3, AAC, FLAC, WAV)
  • Concatenate video or audio clips
  • Add subtitles or overlay text on video
  • OCR documents (PDF, scanned images)
  • Run speech-to-text or text-to-speech
  • Apply AI-based content moderation or object detection
Related skills

More from github/awesome-copilot

Installs
8.4K
GitHub Stars
32.8K
First Seen
Feb 18, 2026