Skip to content

Pinned Loading

  1. Recap-DataComp-1B Recap-DataComp-1B Public

    [ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"

    131 1

  2. MedTrinity-25M MedTrinity-25M Public

    [ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“

    Python 336 21

  3. story-adapter story-adapter Public

    A Training-free Iterative Framework for Long Story Visualization

    Python 889 125

  4. MedReason MedReason Public

    MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs

    Python 159 15

  5. VLAA-Thinking VLAA-Thinking Public

    SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

    Python 109 1

  6. OpenVision OpenVision Public

    OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

    Python 230 14

Repositories

Showing 10 of 33 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…