multimodal-rl

Here are 2 public repositories matching this topic...

JingbiaoMei / ExPO-HM

🔬 Official implementation of ExPO-HM: Learning to Explain-then-Detect for Hateful Meme Detection (ICLR 2026). Novel multimodal RL approach for interpretable and explainable content moderation.

multimodal-learning explainable-ai content-moderation vision-language-models preference-optimization grpo iclr-2026 hateful-meme-detection multimodal-rl

Updated Mar 1, 2026
Python

squamulenudestatue531 / rl-explainer

Star

Visualize and compare reinforcement learning algorithms for LLM training with interactive formulas, pipeline flow, and algorithm metrics.

reinforcement-learning rl sac multimodal-learning explainable-ai content-moderation multi-agent-reinforcement-learning graph-networks soft-actor-critic marl gnn cooperative-environments layerwiserelevancepropogation vision-language-models preference-optimization iclr-2026 multimodal-rl

Updated Mar 30, 2026

Improve this page

Add a description, image, and links to the multimodal-rl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-rl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly