🔬 Official implementation of ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection (ICLR 2026). Novel multimodal RL approach for interpretable and explainable content moderation.
-
Updated
Mar 1, 2026 - Python
🔬 Official implementation of ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection (ICLR 2026). Novel multimodal RL approach for interpretable and explainable content moderation.
Visualize and compare reinforcement learning algorithms for LLM training with interactive formulas, pipeline flow, and algorithm metrics.
Add a description, image, and links to the multimodal-rl topic page so that developers can more easily learn about it.
To associate your repository with the multimodal-rl topic, visit your repo's landing page and select "manage topics."