spider search : multimodal classifier: Qdrant-Powered Multimodal Misinformation Correlation Engine

Demo : https://www.youtube.com/watch?v=hWhCMGOmW5Q&t=3s spider search : multimodal classifier is a claim-centric, multimodal (text + image) system that correlates misinformation across memes, articles, and social-media style claims. It uses Qdrant as the primary vector memory and retrieval engine, providing evidence-grounded responses, traceability, and long-term memory updates.

Why this matters

Meme-based misinformation spreads fast, often with text paraphrases and image variants. spider search : multimodal classifier helps analysts and journalists surface:

Related claim clusters (paraphrases/variants)
Similar meme variants (image similarity)
Supporting/contradicting evidence snippets
A traceable retrieval path (scores, filters, IDs)

spider search : multimodal classifier is a correlation and evidence system — not a truth oracle. It reports what the indexed corpus supports and flags insufficient/conflicting evidence when needed.

Architecture (High Level)

[Streamlit UI]
   |-- Analyze Meme (Image Upload)
   |-- Analyze Claim/Text
   |-- Ingest Corpus
        | OCR / Embeddings / Claim Extraction
        v
 [Ingestion Pipeline] -----> [Qdrant Collections]
        |                      - claims (text_dense)
        |                      - evidence_snippets (snippet_dense)
        |                      - media_memes (image_dense + ocr_text_dense)
        v
 [SQLite Audit + Mappings]
        |
        v
 [Retrieval + Trace Panel]

See docs/architecture_diagram.txt for the ASCII diagram and docs/report_outline.md for the report outline.

Features

Multimodal retrieval: CLIP image embeddings + OCR text embeddings for memes.
Qdrant-centric memory: canonical claims and evidence stored in Qdrant collections with metadata filtering.
Long-term memory: claim canonicalization, reinforcement, contradiction updates, decay, and audit logs in SQLite.
Evidence-based responses: stance-grouped evidence snippets with visible trace panel.
LLM deduction (optional): Ollama generates a short claim verdict using Qdrant-retrieved evidence.
Agentic memory maintenance: autonomous claim evolution tracking with trend, contradiction, and alert signals.

Repo Structure

app/                  # Streamlit UI
core/                 # config, schemas, utils
models/               # text/image embedding, OCR, rule-based extraction
qdrant_store/         # Qdrant client + CRUD helpers
ingestion/            # meme/text ingestion pipelines
memory/               # canonicalization, confidence, decay, events
agents/               # agentic monitoring + orchestration
storage/              # SQLite + file storage helpers
docs/                 # report outline + architecture diagram

Setup

1) Start Qdrant (Docker)

docker run -p 6333:6333 -p 6334:6334 -v $(pwd)/qdrant_storage:/qdrant/storage qdrant/qdrant

Health check:

curl http://localhost:6333/collections

2) Install Python dependencies

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

3) OCR setup (Tesseract)

Linux: sudo apt-get install tesseract-ocr
Windows: install from the official installer and set TESSERACT_CMD in .env.

4) Configure environment

Copy .env.example to .env and adjust if needed:

cp .env.example .env

By default USE_OLLAMA=true for stance classification. Set it to false to fall back to the local NLI + rule-based classifier. Qdrant remains the primary retrieval/memory engine in both modes.

4b) Ollama setup (optional but enabled by default)

Install Ollama (Ubuntu snap):

sudo snap install ollama

Start the service and pull a model:

ollama serve
ollama pull llama3

Verify it is running:

curl http://localhost:11434/api/tags

5) Run the app

streamlit run streamlit_app.py

Keep the Qdrant Docker container running while the app is in use. On first launch, open Ingest Corpus and add your memes and text files before running analysis.

Agentic System (Claim Evolution Monitoring)

spider search : multimodal classifier includes an agent that monitors new ingested sources, updates claim trend signals, computes contradiction ratios, tracks meme variants, and logs its reasoning into SQLite events.

Agent usage

Streamlit UI: open Agent Insights and click Run Agent Now.
Event-driven: ingestion automatically triggers the agent after meme/text ingestion.

Database updates

agent_state table is created automatically on first run.
events table includes agent_name for agent-specific logs (auto-migrated).

Quick Demo Flow

Ingest Corpus
- Upload 10+ meme images and 10+ text documents (TXT files).
- System extracts claims, embeds, and stores them in Qdrant.
Analyze Meme
- Upload a meme and view:
  - OCR text
  - Claim candidates
  - Similar meme variants (image similarity)
  - Matched claims + evidence snippets
  - Trace panel (IDs + scores)
Analyze Claim/Text
- Enter a claim sentence.
- View matched claims + evidence, plus the Ollama deduction panel if enabled.
Run Decay (optional)
- Admin button triggers memory decay and logs updates.

Qdrant Collections

A) `claims`

Vector: text_dense (384 or 768 dim, cosine)
Payload: canonical claim, counts, timestamps, confidence, status, links

B) `evidence_snippets`

Vector: snippet_dense (text embedding)
Payload: snippet text, stance, source metadata

C) `media_memes`

Vectors: image_dense (CLIP) + ocr_text_dense
Payload: OCR text, pHash, linked claim IDs

Evidence-Grounded Output Rules

If evidence is insufficient or conflicting, the UI signals insufficient/conflicting evidence.
Retrieval traces show collection, point IDs, similarity scores, and payload previews.

Limitations & Ethics

OCR errors can lead to imperfect claim extraction.
Meme templates might bias similarity results.
No personal data should be ingested without consent.
spider search : multimodal classifier does not assert truth — it surfaces evidence and uncertainties.

Notes

No paid APIs are used.
Optional enhancement: enable Ollama locally by setting USE_OLLAMA=true.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spider search : multimodal classifier: Qdrant-Powered Multimodal Misinformation Correlation Engine

Why this matters

Architecture (High Level)

Features

Repo Structure

Setup

1) Start Qdrant (Docker)

2) Install Python dependencies

3) OCR setup (Tesseract)

4) Configure environment

4b) Ollama setup (optional but enabled by default)

5) Run the app

Agentic System (Claim Evolution Monitoring)

Agent usage

Database updates

Quick Demo Flow

Qdrant Collections

A) `claims`

B) `evidence_snippets`

C) `media_memes`

Evidence-Grounded Output Rules

Limitations & Ethics

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
agents		agents
core		core
data		data
docs		docs
ingestion		ingestion
memory		memory
models		models
qdrant_store		qdrant_store
storage		storage
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py

Folders and files

Latest commit

History

Repository files navigation

spider search : multimodal classifier: Qdrant-Powered Multimodal Misinformation Correlation Engine

Why this matters

Architecture (High Level)

Features

Repo Structure

Setup

1) Start Qdrant (Docker)

2) Install Python dependencies

3) OCR setup (Tesseract)

4) Configure environment

4b) Ollama setup (optional but enabled by default)

5) Run the app

Agentic System (Claim Evolution Monitoring)

Agent usage

Database updates

Quick Demo Flow

Qdrant Collections

A) claims

B) evidence_snippets

C) media_memes

Evidence-Grounded Output Rules

Limitations & Ethics

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

A) `claims`

B) `evidence_snippets`

C) `media_memes`

Packages