extract-moves-from-video

Installation
SKILL.md

Extract Moves From Video

Overview

This skill provides a systematic approach for extracting text commands from video recordings. Common use cases include extracting gameplay commands from text adventure games (like Zork), capturing terminal commands from screen recordings, or transcribing any typed input visible in video content.

Workflow

Step 1: Analyze the Source Video

Before processing, understand the video characteristics:

  1. Determine video properties: Resolution, duration, frame rate
  2. Identify text regions: Where commands appear on screen (e.g., after a prompt character like >)
  3. Assess text style: Font type, color, background contrast (terminal text on dark backgrounds requires specific handling)
  4. Check for audio: Determine if audio transcription could supplement OCR (verify audio contains relevant content before installing large packages like Whisper)
  5. Understand typing patterns: Estimate how frequently new commands appear to inform frame sampling rate

Step 2: Download and Prepare Video

Related skills

More from letta-ai/skills

Installs
36
Repository
letta-ai/skills
GitHub Stars
97
First Seen
Jan 24, 2026