prompt-safety

Star

Here are 5 public repositories matching this topic...

Dmitry-Nikiforov / llm-policy-gateway

Star

Control layer for LLM integrations that evaluates model output risks (SQL, command execution, etc.) before execution.

security risk-analysis proxy ai-security secure-ai llm llm-ops llm-gateway prompt-safety

Updated Apr 20, 2026
Java

vertbera / beyond-the-mirror

Star

Field research exposing how LLM safeguards collapse under polite, persistent interaction. Includes full report, metrics, session logs, and the AION conditioning protocol.

ai-safety aion cognitive-systems security-research ai-ethics llm machine-conditioning prompt-safety resilience-failure ethics-fatigue

Updated May 3, 2026
Python

AIForHindustan / beyond-the-mirror

Star

Field research exposing how LLM safeguards collapse under polite, persistent interaction. Includes full report, metrics, session logs, and the AION conditioning protocol.

ai-safety aion cognitive-systems security-research ai-ethics llm machine-conditioning prompt-safety resilience-failure ethics-fatigue

Updated May 8, 2025
Python

honestyer / UserPromptSubmit

Star

Local pre-send risk guard for Claude Code prompts with safe rewrites and audit reports.

windows macos cli-tools guardrails local-first risk-audit anthropic claude-code prompt-safety

Updated Apr 11, 2026
Python

isatyamks / multimodal-rag

Star

Multimodal RAG system for generating test cases and use cases from documents using hybrid retrieval, safety guards, and LLMs.

python nlp testing qa ml test-automation multimodal rag qa-automation hybrid-search llm rags ai-testing chromadb hybrid-search-technique multimodal-rag hallucination-mitigation claude-code prompt-safety

Updated Apr 5, 2026
Python

Improve this page

Add a description, image, and links to the prompt-safety topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the prompt-safety topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

prompt-safety

Here are 5 public repositories matching this topic...

Dmitry-Nikiforov / llm-policy-gateway

vertbera / beyond-the-mirror

AIForHindustan / beyond-the-mirror

honestyer / UserPromptSubmit

isatyamks / multimodal-rag

Improve this page

Add this topic to your repo