character-design-sheet
Maintain consistent character appearance across AI-generated images using reference sheets and detailed descriptions.
- Create turnaround sheets (front, 3/4, side, back views), expression sheets (6+ emotions), outfit variations, and color palettes to document character design
- Use a 50+ word detailed description anchor reused exactly in every prompt as the most practical consistency technique for small projects
- Train character-specific LoRA models for projects requiring many images, with 10-20 reference images and trigger words for high consistency
- Includes proportion guides (realistic 7.5:1 to chibi 2-3:1), color hex documentation, and troubleshooting for common drift issues like hair color and eye changes
Install the belt CLI skill:
npx skills add belt-sh/cli
Character Design Sheet
Create consistent characters across multiple AI-generated images via inference.sh CLI.
Quick Start
Requires inference.sh CLI (
belt). Install instructions
belt login
More from inferen-sh/skills
technical-blog-writing
0ai-video-generation
Generate AI videos with Google Veo, Seedance 2.0, HappyHorse, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 2.0, HappyHorse 1.0, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, reference-to-video, video editing, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative, seedance, happyhorse
0tools-ui
0video-ad-specs
0text-to-speech
Convert text to natural speech with Inworld TTS, ElevenLabs, DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: Inworld TTS-2 (100+ languages, emotion steering), Inworld TTS 1.5 (ultra-low latency), ElevenLabs (premium, 22+ voices, 32 languages), DIA TTS (conversational), Kokoro TTS, Chatterbox, Higgs Audio, VibeVoice (podcasts). Capabilities: text-to-speech, voice cloning, multi-speaker dialogue, podcast generation, expressive speech, emotion/delivery steering, character voices. Use for: voiceovers, audiobooks, podcasts, accessibility, video narration, IVR, voice assistants, gaming characters, avatar audio. Triggers: text to speech, tts, voice generation, ai voice, speech synthesis, voice over, generate speech, ai narrator, voice cloning, text to audio, elevenlabs, eleven labs, voice ai, ai voiceover, speech generator, natural voice, inworld, inworld tts, character voice, game voice, npc voice
0twitter-thread-creation
0