videoagent-video-studio

Installation
Summary

Generate short AI videos from text or images using 7 backend models with zero API key setup.

  • Supports three generation modes: text-to-video, image-to-video, and reference-based generation for consistent output
  • Seven models available (minimax, kling, veo, hunyuan, grok, seedance, pixverse) with automatic selection or manual override via --model flag
  • Configurable duration (4–12 seconds), aspect ratios (16:9, 9:16, 1:1, 4:3, 3:4), and automatic prompt enhancement for better results
  • Simple Node.js CLI interface with async job status checking and JSON response format returning video URLs
SKILL.md

🎬 VideoAgent Video Studio

Use when: User asks to generate a video, create a video from text, animate an image, make a short clip, or produce AI video.

Generate short AI videos with 7 backends. This skill picks the right mode (text-to-video or image-to-video), enhances the prompt for best results, and returns the video URL.


Quick Reference

User Intent Mode Typical Duration
"Make a video of..." (no image) text-to-video 4–10 s
"Animate this image" / "Make this move" image-to-video 4–6 s
"Turn this into a video with..." image-to-video 4–6 s
Cinematic, story, ad Prefer text-to-video with detailed prompt 5–10 s

Generation Modes

Related skills
Installs
10.0K
GitHub Stars
732
First Seen
Mar 7, 2026