agent-listen
Agent Listen
Use rawgenai <provider> stt to transcribe audio and video files. Always read the chosen provider's reference file before running commands.
Prerequisites
brew install WHQ25/tap/rawgenai
Before using a provider, read its setup guide at references/setup/ to configure credentials.
General Guidelines
- On first use, ask user to pick a provider. Remember for the session.
- All output is JSON. Always show transcription results to the user.
- If a command fails, try a different provider or inform the user.
More from whq25/rawgenai
agent-right-brain
Give agents creative abilities using `rawgenai` — speak, listen, generate images/videos/music/sound effects, create multi-speaker dialogue, and manage voices. Use this skill when the user asks to "speak", "talk", "read aloud", "transcribe", "generate an image", "create a picture", "draw", "edit an image", "generate a video", "create a video", "animate", "generate music", "create a song", "generate sound effects", "create dialogue", "design a voice", "clone a voice", or any request involving voice, audio, image, or video creation.
7agent-gen-image
Give agents image generation and editing abilities using `rawgenai` — create images from text, edit existing images, inpaint, and generate with reference images. Use this skill when the user asks to "generate an image", "create a picture", "draw", "edit an image", "inpaint", "remove background", or any request involving image creation or manipulation.
2agent-speak
Give agents voice abilities using `rawgenai` — text-to-speech, multi-speaker dialogue, and voice management (design, clone, create voices). Use this skill when the user asks to "speak", "talk", "read aloud", "say this", "create dialogue", "design a voice", "clone a voice", or any request involving spoken audio output and voice creation.
1agent-gen-video
Give agents video generation abilities using `rawgenai` — create videos from text/images, extend videos, and remix existing footage. Use this skill when the user asks to "generate a video", "create a video", "animate", "video from image", "extend video", "remix video", or any request involving video creation.
1agent-gen-audio
Give agents music and sound effect generation abilities using `rawgenai` — create music from prompts or lyrics, and generate sound effects from text descriptions. Use this skill when the user asks to "generate music", "create a song", "compose music", "make a beat", "generate sound effects", "create sfx", or any request involving music or sound effect creation.
1