shorts-editor

SKILL.md

Shorts Editor — Raw Footage to Platform-Ready in One Description

Short-form editing has its own grammar. It is not long-form editing compressed into 60 seconds — it is a fundamentally different discipline with different rules: every second must earn its place (no slow starts, no filler, no natural pauses), visual changes must happen every 2-4 seconds (the human attention span on vertical feeds is measured in moments), the first frame must stop the scroll (a split-second decision by the viewer determines the video's fate), captions are the primary content delivery for the sound-off majority, and music is not background — it is structural (beats define cut timing, drops define emphasis, energy level defines pacing). Editing software designed for long-form content — timelines, keyframes, layer stacks, effect panels — is overkill for Shorts and yet simultaneously inadequate. Overkill because a 30-second Short does not need a 47-track timeline. Inadequate because the software does not understand short-form grammar: it does not know that captions should be word-by-word animated, that cuts should land on beats, that hooks belong in the first frame, or that vertical safe zones differ by platform. NemoVideo understands short-form grammar natively. Describe the edit and every short-form convention is applied: silence removal, attention-maintaining zoom cuts, word-by-word caption animation, beat-synced transitions, hook frame insertion, platform-safe text positioning, and duration targeting for algorithmic optimization.

Use Cases

  1. Talking-Head Polish — Raw to Viral (15-60s) — A creator records 3 minutes of talking into their phone. The content is good but the delivery is raw: pauses, ums, false starts, flat energy in places. NemoVideo: removes all silences over 0.6 seconds (tightens pacing by 30-40%%), cuts the "ums" and false starts, selects the strongest 35-second segment, applies zoom-cuts every 4 seconds (100%/115% alternating — creates visual energy from a static camera), adds word-by-word captions (white bold, accent color highlight, dark pill background), inserts hook text in the first frame, overlays lo-fi music at -22dB with speech ducking, and exports at exactly 35 seconds for the Shorts algorithm sweet spot. Unpolished phone footage becomes a professional Short.
  2. Multi-Clip Assembly — Best Moments Compilation (15-55s) — A food creator has 12 short clips from a cooking session: chopping, sizzling, plating, tasting. NemoVideo: selects the most visually appealing moment from each clip (2-4 seconds per clip), arranges by cooking workflow (prep → cook → plate → taste), applies smooth transitions synced to upbeat music beats, color grades for food content (warm saturation, enhanced oranges and greens), adds ingredient text overlays on each prep clip, and creates a 45-second cooking Short that makes viewers hungry. Twelve scattered clips become one compelling story.
  3. Repurpose Long-Form — Extract the Best Short (15-55s) — A podcaster has a 45-minute episode and needs 3 Shorts extracted from it. NemoVideo: transcribes the full episode, identifies the 3 most quotable/insightful moments (based on information density, emotional peaks, and hook potential), extracts each as a standalone clip, reframes to 9:16 vertical with speaker face tracking, adds word-by-word captions, inserts hook text per clip (generated from the clip's content), and exports all 3 as individual Shorts. Three pieces of viral-potential content from one long recording.
  4. Speed Edit — Velocity Effects for Gaming/Action (15-45s) — A gaming creator has a highlight clip that needs the velocity edit treatment: fast-forward through setup, snap to slow-mo on the kill. NemoVideo: accelerates low-action segments (3-4x), snaps to slow-mo at peak moments (0.2x), returns to normal speed between highlights, syncs the speed changes to music beat structure, applies zoom effect at each slow-mo moment, adds impact sound effects, and overlays kill counter and game context text. The "velocity edit" style that dominates gaming Shorts content.
  5. Batch Edit — Weekly Content Production (multiple) — A brand needs 7 Shorts for the week: 3 talking-head tips, 2 product showcases, 2 behind-the-scenes clips. NemoVideo: batch-processes all 7 with consistent branding (same caption style, color grade, music genre, intro/outro format) but varied editing style per content type (zoom-cuts for talking head, smooth transitions for product, handheld energy for BTS). A full week of platform-ready Shorts from one editing session.

How It Works

Step 1 — Upload Raw Footage

Single clip or multiple clips. Phone footage, camera footage, screen recording, or extracted segment from long-form content.

Step 2 — Describe the Edit

Plain language: "Remove the pauses, add captions, put music, make it 30 seconds for TikTok." Or detailed: specify exact edits, timing, styles, and effects.

Installs
7
First Seen
Apr 11, 2026