Skip to content
View Siyou-Li's full-sized avatar
🧨
🧨
  • Queen Mary University of London
  • London

Highlights

  • Pro

Block or report Siyou-Li

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Siyou-Li/README.md

👋 Hi, I'm a Ph.D. student at the Computational Linguistics Lab at Queen Mary University of London (QMUL), focusing on Multimodal Large Language Models (MLLMs).

My research explores innovative ways to enhance interactions between language and Image/Vidio/Audio, aiming to advance the capabilities of AI in understanding and generating multimodal content.

LinkedinZhihu

Pinned Loading

  1. u2Tokenizer u2Tokenizer Public

    a multiscale multimodal large language models for radiology report generation (RRG) tasks

    Python 276 21

  2. QTSplus QTSplus Public

    Query-aware Token Selector (QTSplus), a lightweight yet powerful visual token selection module that serves as an information gate between the vision encoder and LLMs.

    Python 129 9