videoagent-video-studio
Installation
Summary
Generate short AI videos from text or images using 7 backend models with zero API key setup.
- Supports three generation modes: text-to-video, image-to-video, and reference-based generation for consistent output
- Seven models available (minimax, kling, veo, hunyuan, grok, seedance, pixverse) with automatic selection or manual override via
--modelflag - Configurable duration (4–12 seconds), aspect ratios (16:9, 9:16, 1:1, 4:3, 3:4), and automatic prompt enhancement for better results
- Simple Node.js CLI interface with async job status checking and JSON response format returning video URLs
SKILL.md
🎬 VideoAgent Video Studio
Use when: User asks to generate a video, create a video from text, animate an image, make a short clip, or produce AI video.
Generate short AI videos with 7 backends. This skill picks the right mode (text-to-video or image-to-video), enhances the prompt for best results, and returns the video URL.
Quick Reference
| User Intent | Mode | Typical Duration |
|---|---|---|
| "Make a video of..." (no image) | text-to-video |
4–10 s |
| "Animate this image" / "Make this move" | image-to-video |
4–6 s |
| "Turn this into a video with..." | image-to-video |
4–6 s |
| Cinematic, story, ad | Prefer text-to-video with detailed prompt |
5–10 s |
Generation Modes
Related skills