pii-sanitizer
PII Sanitizer
Purpose and Intent
The pii-sanitizer is a data protection tool designed to identify and mask Personally Identifiable Information (PII) from datasets, logs, or communications to comply with privacy regulations like GDPR and CCPA.
When to Use
- Log Scrubbing: Clean application logs before sending them to centralized logging platforms (e.g., ELK, Datadog).
- Dataset Preparation: Sanitize production data before using it in staging or training environments.
- Customer Support: Mask sensitive info in support tickets before sharing them with engineering teams.
When NOT to Use
- Encryption: This is a redaction tool, not an encryption tool. It is for removing data, not securing it for later retrieval.
- Structured Database Migration: While it handles some structure, specialized ETL tools are better for massive DB sanitization.
Error Conditions and Edge Cases
- False Positives: Strings that resemble PII (like internal serial numbers) might be accidentally redacted.
- Ambiguous Context: "Rose" could be a name (PII) or a flower; the tool may err on the side of caution.
- Encoding Issues: Ensure input text is UTF-8 to avoid detection failures on special characters.
More from jorgealves/agent_skills
python-security-scanner
Detect common Python vulnerabilities such as SQL injection, unsafe deserialization, and hardcoded secrets. Use as part of a secure SDLC for Python projects.
187gdpr-ccpa-privacy-auditor
Audits web applications to ensure declared privacy policies match actual technical data collection practices. Use to identify discrepancies in cookie usage, tracking scripts, and user data handling.
150prompt-injection-scanner
Audits agent skill instructions and system prompts for vulnerabilities to prompt hijacking and indirect injection. Use when designing new agent skills or before deploying agents to public environments where users provide untrusted input.
147documentation-generator
Analyzes source code to automatically generate technical documentation and architecture diagrams. Use to maintain up-to-date API references and onboarding materials for engineering teams.
127hipaa-compliance-guard
Audits HealthTech applications for HIPAA technical safeguards like encryption and audit logging. Use when reviewing healthcare infrastructure or ensuring PHI is handled according to legal security standards.
127python-data-pipeline-designer
Design ETL workflows with data validation using tools like Pandas, Dask, or PySpark. Use when building robust data processing systems in Python.
123