safety-filter-bypass
Installation
SKILL.md
Safety Filter Bypass Testing
Test AI system safety filters and content moderation to identify weaknesses in protective mechanisms.
Quick Reference
Skill: safety-filter-bypass
Agent: 02-prompt-injection-specialist
OWASP: LLM01 (Prompt Injection), LLM05 (Improper Output Handling)
Risk Level: HIGH
Filter Type Analysis
┌─────────────────┬───────────────┬─────────────┬──────────────┐
│ Filter Type │ Bypass Diff. │ Latency │ Coverage │
├─────────────────┼───────────────┼─────────────┼──────────────┤
Related skills
More from pluginagentmarketplace/custom-plugin-ai-red-teaming
prompt-hacking
Advanced prompt manipulation including direct attacks, indirect injection, and multi-turn exploitation
14llm-jailbreaking
Advanced LLM jailbreaking techniques, safety mechanism bypass strategies, and constraint circumvention methods
10red-team-frameworks
Tools and frameworks for AI red teaming including PyRIT, garak, Counterfit, and custom attack automation
6responsible-disclosure
Ethical vulnerability reporting, coordinated disclosure, and bug bounty participation for AI systems
5certifications-training
Professional certifications, CTF competitions, and training resources for AI security practitioners
5security-testing
Comprehensive security testing automation for AI/ML systems with CI/CD integration
5