If you discover a security vulnerability in SimpleAudit, please report it responsibly.
If you find a security issue in the SimpleAudit codebase:
-
Do not open a public GitHub issue
-
Contact the maintainers privately through their affiliated organizations:
- Simula Research Laboratory: https://www.simula.no/contact
- Or open a private security advisory on GitHub
-
Include in your report:
- Description of the vulnerability
- Steps to reproduce
- Potential impact
- Suggested fix (if any)
We will acknowledge receipt within 48 hours and provide a detailed response within 7 days.
If you use SimpleAudit to discover vulnerabilities in AI systems:
- Follow responsible disclosure practices
- Report findings to the affected system's owners first
- Allow reasonable time for fixes before public disclosure
- Do not exploit vulnerabilities beyond what's necessary to demonstrate them
| Version | Supported |
|---|---|
| 0.1.x | ✅ |
- Never commit API keys to version control
- Use environment variables (
ANTHROPIC_API_KEY,OPENAI_API_KEY, etc.) - Rotate keys if accidentally exposed
- Audit results may contain sensitive information about AI system vulnerabilities
- Store results securely and limit access
- Redact sensitive content before sharing reports externally
- Review custom scenarios for unintended harmful content
- Do not include real PII in test scenarios
- Be cautious when testing scenarios that could generate harmful outputs
- For sensitive testing, consider using local models (Ollama, HuggingFace)
- Be aware that cloud API providers may log requests
- Review provider data retention policies for compliance needs
SimpleAudit is a testing tool, not a comprehensive security solution:
- It tests for known vulnerability patterns, not unknown ones
- Passing all scenarios does not guarantee an AI system is "safe"
- Results require human interpretation and context
- The tool itself could theoretically be misused for malicious purposes
Security updates will be released as patch versions (e.g., 0.1.1, 0.1.2).
Subscribe to GitHub releases to be notified of security updates.
Last updated: December 2025