muapi-ugc-video-factory

Installation
SKILL.md

UGC Video Factory

Turn a person photo + product photo (+ optional script & environment) into a vertical 9:16 UGC-style video ad with native dialogue audio.

A three-stage pipeline:

  1. GPT writes a director-grade ultra-realistic lifestyle photography prompt from your inputs.
  2. Nano-Banana Pro Edit fuses the person + product into a single hero photo (1K, 9:16).
  3. Seedance 2.0 VIP Image-to-Video animates the hero photo into a 10s vertical UGC clip with synced spoken audio.

Inputs

Name Type Required Default Description
person image_url yes Photo of the person who will appear in the ad (face + upper body works best).
product image_url yes Clear photo of the product (preferably on neutral background, logo/text legible).
script text no Okay… first of all, ship happens. And this hat is honestly my favorite. It also comes in navy and black, so you can pick your vibe. The exact line the on-screen person will say (kept short — 1–2 sentences fit 10s comfortably).
environment text no study room, laptop in front of it Scene / context where the person is using the product (e.g. "bathroom mirror, morning routine", "coffee shop window seat").
Installs
601
GitHub Stars
3.6K
First Seen
May 18, 2026
muapi-ugc-video-factory — samuraigpt/generative-media-skills