The Agent Skills Directory

[EXTERNAL_DOWNLOADS]: The skill references external resources for installation, including a GitHub-hosted markdown file for CLI instructions and the 'belt-sh/cli' package from the npm registry.
[REMOTE_CODE_EXECUTION]: The skill's primary functionality involves running remote AI applications (such as Whisper and ElevenLabs Scribe) on the inference.sh platform via the 'belt app run' command.
[COMMAND_EXECUTION]: Instructs the agent to use the 'belt' CLI tool for logging in, running applications, and sampling inputs, which is necessary for the intended audio processing workflow.
[PROMPT_INJECTION]: The skill processes untrusted audio and video content from external URLs; this represents a surface for indirect prompt injection if the transcribed text is subsequently used as instructions for an AI agent without proper sanitization or boundary markers.
Ingestion points: 'audio_url' and 'video_url' parameters in 'SKILL.md'.
Boundary markers: Not specified in the provided instructions.
Capability inventory: Network and compute access via the 'belt' CLI.
Sanitization: No explicit sanitization of transcribed content is mentioned.

speech-to-text