muapi-ugc-video-factory
Installation
SKILL.md
UGC Video Factory
Turn a person photo + product photo (+ optional script & environment) into a vertical 9:16 UGC-style video ad with native dialogue audio.
A three-stage pipeline:
- GPT writes a director-grade ultra-realistic lifestyle photography prompt from your inputs.
- Nano-Banana Pro Edit fuses the person + product into a single hero photo (1K, 9:16).
- Seedance 2.0 VIP Image-to-Video animates the hero photo into a 10s vertical UGC clip with synced spoken audio.
Inputs
| Name | Type | Required | Default | Description |
|---|---|---|---|---|
person |
image_url | yes | — | Photo of the person who will appear in the ad (face + upper body works best). |
product |
image_url | yes | — | Clear photo of the product (preferably on neutral background, logo/text legible). |
script |
text | no | Okay… first of all, ship happens. And this hat is honestly my favorite. It also comes in navy and black, so you can pick your vibe. |
The exact line the on-screen person will say (kept short — 1–2 sentences fit 10s comfortably). |
environment |
text | no | study room, laptop in front of it |
Scene / context where the person is using the product (e.g. "bathroom mirror, morning routine", "coffee shop window seat"). |