agent-speak
Agent Speak
Use rawgenai <provider> tts to speak, rawgenai <provider> dialogue for multi-speaker audio, and rawgenai <provider> voice to manage voices. Always read the chosen provider's reference file before running commands.
Prerequisites
brew install WHQ25/tap/rawgenai
Before using a provider, read its setup guide at references/setup/ to configure credentials.
Input Sources
- Positional argument:
rawgenai <provider> tts "text" [flags] - File:
rawgenai <provider> tts --file input.txt [flags] - Stdin:
echo "text" | rawgenai <provider> tts [flags]
General Guidelines
More from whq25/rawgenai
agent-right-brain
Give agents creative abilities using `rawgenai` — speak, listen, generate images/videos/music/sound effects, create multi-speaker dialogue, and manage voices. Use this skill when the user asks to "speak", "talk", "read aloud", "transcribe", "generate an image", "create a picture", "draw", "edit an image", "generate a video", "create a video", "animate", "generate music", "create a song", "generate sound effects", "create dialogue", "design a voice", "clone a voice", or any request involving voice, audio, image, or video creation.
7agent-gen-image
Give agents image generation and editing abilities using `rawgenai` — create images from text, edit existing images, inpaint, and generate with reference images. Use this skill when the user asks to "generate an image", "create a picture", "draw", "edit an image", "inpaint", "remove background", or any request involving image creation or manipulation.
2agent-gen-video
Give agents video generation abilities using `rawgenai` — create videos from text/images, extend videos, and remix existing footage. Use this skill when the user asks to "generate a video", "create a video", "animate", "video from image", "extend video", "remix video", or any request involving video creation.
1agent-gen-audio
Give agents music and sound effect generation abilities using `rawgenai` — create music from prompts or lyrics, and generate sound effects from text descriptions. Use this skill when the user asks to "generate music", "create a song", "compose music", "make a beat", "generate sound effects", "create sfx", or any request involving music or sound effect creation.
1agent-listen
Give agents listening abilities using `rawgenai` — transcribe audio/video to text with timestamps, subtitles, and speaker diarization. Use this skill when the user asks to "transcribe", "speech to text", "convert audio to text", "generate subtitles", "diarize speakers", or any request involving audio/video transcription.
1