Skip to content
View vanzll's full-sized avatar

Block or report vanzll

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
vanzll/README.md

Hi, I'm Carlos Zhenglin Wan 👋

Ph.D Student in Computer Science at National University of Singapore (NUS) focused on RL and LLM Agents.

Academic Homepage: vanzll.github.io


Research Interests

  • Reinforcement Learning for LLM Agents
  • LLM Agentic Systems & Tool Use

Experiences

  • Intern (Remote) @ Hong Kong Generative AI Research & Development Center (HKGAI), HKUST
  • Research Assistant @ CCDS, NTU
  • Research Intern @ Centre for Frontier AI Research, A*STAR

Pinned Loading

  1. Uni-RLHF-Platform Uni-RLHF-Platform Public

    Forked from pickxiguapi/Uni-RLHF-Platform

    Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

    Python

  2. EBC EBC Public

    [ICML'25] Diversifying Policy Behaviors via Extrinsic Behavioral Curiosity

    Python 14

  3. Johnny221B/OSCAR Johnny221B/OSCAR Public

    This is official github for our paper

    Python 3

  4. FM_IRL FM_IRL Public

    Official pytorch Implementation of FM-IRL.

    Python 4

  5. acodercat/cave-agent acodercat/cave-agent Public

    Stateful runtime management for LLM agents—inject, manipulate, and retrieve Python objects across turns.

    Python 76 2

  6. bennidict23/GoRL bennidict23/GoRL Public

    An Algorithm-Agnostic Framework for Online Reinforcement Learning with Generative Policies

    Python 21