adversarial-audit

Here are 2 public repositories matching this topic...

priyanshuphenomenal007 / gemini-meta-cognitive-audit

Independent, forensic-style audit of the publicly available Gemini 2.5 Pro model by Google. Examines meta-cognitive reasoning failures and self-evaluation behavior. Not affiliated with or endorsed by Google.

research ai-safety interpretability ai-alignment noncommercial meta-cognition llm adversarial-audit reasoning-failure priyanshu-research

Updated Oct 7, 2025
PowerShell

priyanshuphenomenal007 / AI-Model-Failure-Analysis-Sonnet4

Star

A systematic investigation into the state volatility, memory contradictions, and logical inconsistencies of Anthropic's Claude Sonnet 4 large language model.

research ai-safety reasoning interpretability llm confabulation adversarial-audit

Updated Oct 7, 2025

Improve this page

Add a description, image, and links to the adversarial-audit topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-audit topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly