adversarial-evaluation

Star

Here are 5 public repositories matching this topic...

tjhavranek / research-audit-duel-protocol

Star

Human-in-the-loop adversarial workflows for high-stakes research audit -- from ChatGPT--Gemini duels to 4-model MAD.

gemini grok claude chatgpt multi-agent-debate adversarial-evaluation research-audit

Updated Mar 19, 2026

nick7nlp / evol-ruozhiba

Star

Evolved ruozhiba questions — adversarial logic puzzles for challenging AI systems

chinese-nlp data-augmentation adversarial-evaluation

Updated May 23, 2024

Ziqing110 / rag-evidence-attack-lab

Star

Scientific QA robustness evaluation pipeline for evidence-missing RAG scenarios on PeerQA, with EM/F1 reliability analysis.

python rag openai-api llm-evaluation hallucination-detection adversarial-evaluation

Updated Mar 18, 2026
Python

Madhur-1 / RevealVLLMSafetyEval

Star

RevealVLLMSafetyEval is a comprehensive pipeline for evaluating Vision-Language Models (VLMs) on their compliance with harm-related policies. It automates the creation of adversarial multi-turn datasets and the evaluation of model responses, supporting responsible AI development and red-teaming efforts.

red-teaming responsible-ai llava vllm vision-language-models qwen2 responsible-ai-techniques llama3 phi3 gpt-4o qwen2-vl pixtral adversarial-evaluation multimodal-safety

Updated May 12, 2025
Python

Darv0n / sia-research-engine

Star

Multi-agent deep research engine with SIA (Semantic Intelligence Architecture) — thermodynamic entropy control, adversarial critique, multi-reactor swarm orchestration

python research entropy multi-agent knowledge-graph swarm-intelligence rag anthropic langgraph adversarial-evaluation

Updated Feb 25, 2026
Python

Improve this page

Add a description, image, and links to the adversarial-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adversarial-evaluation

Here are 5 public repositories matching this topic...

tjhavranek / research-audit-duel-protocol

nick7nlp / evol-ruozhiba

Ziqing110 / rag-evidence-attack-lab

Madhur-1 / RevealVLLMSafetyEval

Darv0n / sia-research-engine

Improve this page

Add this topic to your repo