flux-image
Text-to-image and image-to-image generation using FLUX models via inference.sh CLI.
- Supports multiple FLUX variants: Dev LoRA (highest quality), Klein LoRA (fastest), and Pruna-optimized versions ranging from 4B to full-size models
- Enables LoRA fine-tuning for custom style adaptation and image-to-image transformations alongside standard text-to-image generation
- Requires inference.sh CLI (
infsh) installation and authentication; models are invoked as named apps with JSON input payloads - Pricing ranges from $0.0001 per image (Klein 4B) to higher costs for production-quality Dev LoRA outputs
Install the belt CLI skill:
npx skills add belt-sh/cli
FLUX Image Generation
Generate images with FLUX models via inference.sh CLI.

Quick Start
Requires inference.sh CLI (
belt). Install instructions
belt login
belt app run falai/flux-dev-lora --input '{"prompt": "a futuristic city at night"}'
More from inferen-sh/skills
text-to-speech
Convert text to natural speech with Inworld TTS, ElevenLabs, DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: Inworld TTS-2 (100+ languages, emotion steering), Inworld TTS 1.5 (ultra-low latency), ElevenLabs (premium, 22+ voices, 32 languages), DIA TTS (conversational), Kokoro TTS, Chatterbox, Higgs Audio, VibeVoice (podcasts). Capabilities: text-to-speech, voice cloning, multi-speaker dialogue, podcast generation, expressive speech, emotion/delivery steering, character voices. Use for: voiceovers, audiobooks, podcasts, accessibility, video narration, IVR, voice assistants, gaming characters, avatar audio. Triggers: text to speech, tts, voice generation, ai voice, speech synthesis, voice over, generate speech, ai narrator, voice cloning, text to audio, elevenlabs, eleven labs, voice ai, ai voiceover, speech generator, natural voice, inworld, inworld tts, character voice, game voice, npc voice
0twitter-thread-creation
0javascript-sdk
0elevenlabs-stt
0customer-persona
0ai-rag-pipeline
0