youtube-thumbnail-design
AI-generated YouTube thumbnails optimized for mobile preview and click-through rates.
- Requires 1280×720px minimum (1920×1080px recommended) with high contrast color pairs and max 3 colors per thumbnail
- Includes the 120px mobile test: thumbnail must clearly show mood, subject, and readable text when viewed at that width
- Safe zone guidelines prevent critical elements from being obscured by video duration timestamps (bottom-right) and chapter markers (bottom-left)
- Face expressions significantly impact CTR; surprise and curiosity outperform neutral, with faces filling 30–50% of frame and eyes directed toward text or subject
- Provides content-type patterns (tutorial, before/after, review, listicle) and A/B testing strategies for single-variable optimization
Install the belt CLI skill:
npx skills add belt-sh/cli
YouTube Thumbnail Design
Create high-CTR YouTube thumbnails with AI image generation via inference.sh CLI.
Quick Start
Requires inference.sh CLI (
belt). Install instructions
belt login
More from inferen-sh/skills
nano-banana-2
Generate images with Google Gemini 3.1 Flash Image Preview (Nano Banana 2) via inference.sh CLI. Capabilities: text-to-image, image editing, multi-image input (up to 14 images), Google Search grounding. Triggers: nano banana 2, nanobanana 2, gemini 3.1 flash image, gemini 3 1 flash image preview, google image generation
0press-release-writing
Press release writing in AP style with inverted pyramid structure. Covers formatting, datelines, quotes, boilerplates, and fact-checking. Use for: product launches, funding announcements, partnerships, company news, events. Triggers: press release, pr writing, media release, news release, announcement, product launch announcement, funding announcement, company news, media advisory, ap style, press statement, news wire
0speech-to-text
0ai-marketing-videos
Create AI marketing videos for ads, promos, product launches, and brand content. Models: Veo, Seedance, Wan, FLUX for visuals, Kokoro for voiceover. Types: product demos, testimonials, explainers, social ads, brand videos. Use for: Facebook ads, YouTube ads, product launches, brand awareness. Triggers: marketing video, ad video, promo video, commercial, brand video, product video, explainer video, ad creative, video ad, facebook ad video, youtube ad, instagram ad, tiktok ad, promotional video, launch video
0text-to-speech
Convert text to natural speech with Inworld TTS, ElevenLabs, DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: Inworld TTS-2 (100+ languages, emotion steering), Inworld TTS 1.5 (ultra-low latency), ElevenLabs (premium, 22+ voices, 32 languages), DIA TTS (conversational), Kokoro TTS, Chatterbox, Higgs Audio, VibeVoice (podcasts). Capabilities: text-to-speech, voice cloning, multi-speaker dialogue, podcast generation, expressive speech, emotion/delivery steering, character voices. Use for: voiceovers, audiobooks, podcasts, accessibility, video narration, IVR, voice assistants, gaming characters, avatar audio. Triggers: text to speech, tts, voice generation, ai voice, speech synthesis, voice over, generate speech, ai narrator, voice cloning, text to audio, elevenlabs, eleven labs, voice ai, ai voiceover, speech generator, natural voice, inworld, inworld tts, character voice, game voice, npc voice
0competitor-teardown
0