PsychAgent

PsychAgent: An Experience-Driven Lifelong Learning Agent for Self-Evolving Psychological Counselor

PsychAgent is a research codebase for multi-session AI psychological counseling. It focuses on longitudinal interaction: carrying memory across sessions, retrieving explicit skills during counseling, and using reward-guided rollouts to select stronger trajectories.

Paper: PDF | arXiv 2604.00931
Model: ecnu-icalk/PsychAgent-Qwen3-32B

Overview

This repository is the public research release of PsychAgent. It includes runnable pipelines for:

multi-session dialogue generation
evaluation on counselor-side and client-side metrics
reward-driven best-of-n rollout selection
prompts, configs, paper assets, and minimal example data

The current release is enough to run the public generation, evaluation, and RFT workflows. Full paper-scale training assets and the complete post-session skill evolution pipeline are not included in this snapshot.

How PsychAgent Works

PsychAgent is organized around three practical components:

Memory and planning keep cross-session continuity so later sessions build on earlier ones instead of restarting from scratch.
Skill retrieval surfaces explicit counseling skills during interaction.
Reward-guided rollouts generate multiple candidate trajectories and keep the better ones for downstream selection and internalization.

In the codebase, these ideas map to:

src/sample/ for multi-session generation
src/eval/ for evaluation and reward-related metrics
src/rft/ for best-of-n rollout and reward selection

Repository Layout

The most important directories are:

paper/: paper PDF and figure assets used in the README
configs/: baseline, dataset, eval, and RFT configuration files
assets/profiles/: bundled profile assets used by sample and rft
data/eval/: native evaluation examples
prompts/: public prompts, client prompts, PsychAgent prompts, and eval prompts
src/sample/: multi-session generation pipeline
src/eval/: evaluation orchestration and metric implementations
src/rft/: rollout generation and reward-based selection
src/web/: web workspace for demo and local interaction
src/shared/: shared YAML and file utilities

If you are new to the repository, start with:

src/README.md for the main code entry points
configs/README.md for configuration responsibilities

Setup

PsychAgent does not ship with a pinned environment file yet. The following setup is a good starting point for the current public release:

python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install openai httpx jinja2 pydantic PyYAML aiolimiter tenacity pytest

Python 3.10+ is recommended.
Install torch only if you need embedding-based skill library loading or backfilling.
The bundled configs point to internal OpenAI-compatible endpoints. Replace them with your own service settings before running.
The released checkpoint is available on Hugging Face: ecnu-icalk/PsychAgent-Qwen3-32B.

Required Environment Variables

The default examples rely on a few environment variables:

SGLANG_API_KEY: counselor backend key used by the sample and RFT examples
API_KEY: client simulator key for sample, or client/reward key for rft
CHAT_API_KEY: fallback key used by eval, and optionally by reward evaluation
CHAT_API_BASE: base URL used by eval when it is not set in config or CLI
CHAT_MODEL_NAME: optional eval model override
PSYCHAGENT_EMBEDDING_API_KEY: required by the current skill retrieval setup

Quick Start

1. Run multi-session generation

export SGLANG_API_KEY=your_key_here
export API_KEY=your_client_simulator_key_here

python -m src.sample \
  --mode psychagent \
  --baseline configs/baselines/psychagent_sglang_local.yaml \
  --runtime configs/runtime/psychagent_sglang_local.yaml \
  --dataset configs/datasets/profiles_sample.yaml \
  --strict-config

Outputs are written to:

sample_outputs/before_rft/<modality>/<case_id>/

Each case directory typically contains course.json and session_*.json.

2. Run evaluation

To evaluate the built-in eval examples:

export CHAT_API_KEY=your_eval_key_here
export CHAT_API_BASE=https://your-openai-compatible-endpoint/v1

python -m src.eval \
  --config configs/runtime/eval_default.yaml \
  --input-format auto \
  --modalities bt,cbt,het,pdt,pmt

To evaluate outputs generated by sample:

python -m src.eval \
  --config configs/runtime/eval_default.yaml \
  --input-format sample \
  --data-root sample_outputs/before_rft \
  --output-root data/eval_outputs_from_sample \
  --modalities cbt

3. Run reward-driven best-of-n rollouts

export SGLANG_API_KEY=your_key_here
export API_KEY=your_client_or_reward_key_here

python -m src.rft \
  --baseline configs/baselines/psychagent_sglang_local.yaml \
  --runtime configs/runtime/psychagent_sglang_local.yaml \
  --dataset configs/datasets/psycheval_bt_cbt_het_pdt_pmt.yaml \
  --rft-config configs/runtime/rft_default.yaml \
  --save_dir rft_outputs \
  --strict-config

To run rft on the bundled profile assets, switch --dataset to configs/datasets/profiles_rft.yaml.

4. Try the Web workspace

If you want to experience the multi-session counseling flow in a browser, use the Web workspace under src/web/. For the full usage guide, see src/web/README.md.

The recommended startup order is:

deploy the counselor model service
start the web backend
start the frontend

You can launch the counselor model service with sglang like this:

nohup python -m sglang.launch_server \
    --model-path /path/to/psychagent-checkpoint \
    --trust-remote-code \
    --port 30000 \
    --tp 8 \
    --host 0.0.0.0 \
    > /path/to/logs/sglang_server.log 2>&1 &

Update base_url in configs/baselines/psychagent_sglang_local.yaml so it matches your model service endpoint.

Then create a minimal .env.local in the project root:

SGLANG_API_KEY=your-sglang-key
PSYCHAGENT_EMBEDDING_API_KEY=your-embedding-key
BACKEND_PORT=8000
FRONTEND_PORT=5173
BACKEND_HOST=localhost

From the project root, start:

./run_backend.sh
./run_frontend.sh

Default endpoints:

Frontend: http://localhost:5173
Backend:  http://localhost:8000
Health:   http://localhost:8000/health

Typical usage flow:

sign in or register
choose a counseling school
create a course
start the session
close the current session and continue to the next

Example UI screenshots:

Create Course

Switch School

Consultation View

Configuration Guide

The three main workflows use different config combinations:

sample: baseline + runtime + dataset
eval: runtime
rft: baseline + runtime + dataset + rft-config

Use configs/README.md when you need field-level detail, such as:

where to change model endpoints and API keys
how to change output language, concurrency, or resume behavior
how to switch datasets, splits, and modality coverage
how to configure reward endpoints and rollout counts

Included Data and Resources

This repository already includes:

an initialization skill library under assets/skills/sect/
bundled profile assets under assets/profiles/
native evaluation examples under data/eval/
prompts for bt, cbt, het, pdt, and pmt
evaluation prompts and method implementations for shared and therapy-specific metrics
paper assets under paper/
the released checkpoint on Hugging Face

This release does not include:

the full PsychEval training and evaluation assets used in the paper
the complete post-session skill extraction and evolution pipeline
a full end-to-end post-training recipe for reinforced internalization
a repository license file

Results Snapshot

The paper evaluates PsychAgent on the multi-session, multi-therapy PsychEval benchmark.

PsychAgent (Qwen3-32B) reports 7.32 / 7.91 / 5.92 / 8.24 on counselor-shared, counselor-specific, client-shared, and client-specific metrics.
Against Qwen3-Max, the reported gains are +1.44 / +0.17 / +0.51 / +0.43.
Against TheraMind, the reported gains are +1.07 / +0.97 / +0.44 / +0.41.
Across 522 matched dialogues, PsychAgent is ranked first by both human raters and the Gemini-3 LLM rater; human-human QWK is 0.675.

Related paper assets include paper/radar_v1.pdf, paper/trend.pdf, and paper/fig_11_qwk_heatmap_abc.png.

Project Status

PsychAgent should currently be read as a research code release rather than a polished production package. The repository already provides runnable public workflows for generation, evaluation, and best-of-n reward selection. A few items are still best treated as follow-up work for a broader public release:

a repository license
a pinned environment file
public-ready service templates in the example configs
fuller reproduction notes for paper-scale experiments
more explicit deployment guidance for the released checkpoint

Citation

Paper page: arXiv:2604.00931

@misc{yang2026psychagent,
  title         = {PsychAgent: An Experience-Driven Lifelong Learning Agent for Self-Evolving Psychological Counselor},
  author        = {Yutao Yang and Junsong Li and Qianjun Pan and Jie Zhou and Kai Chen and Qin Chen and Jingyuan Zhao and Ningning Zhou and Xin Li and Liang He},
  year          = {2026},
  eprint        = {2604.00931},
  archiveprefix = {arXiv},
  primaryclass  = {cs.AI},
  url           = {https://arxiv.org/abs/2604.00931},
  doi           = {10.48550/arXiv.2604.00931}
}

Safety Notice

This repository is for research and experimental use only.
It is not a substitute for licensed mental-health professionals, medical diagnosis, emergency response, or crisis intervention.
Real-world deployment would require additional work on safety, privacy, informed consent, monitoring, escalation, and clinical governance.
If someone may be at immediate risk of harm, contact local emergency or crisis-support services rather than relying on an automated system.

Acknowledgements

PsychEval for the benchmark setting and evaluation protocol adopted in the paper
OpenAI-compatible API tooling, Jinja2 templating, and Pydantic-based validation used throughout the released code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PsychAgent

Overview

How PsychAgent Works

Repository Layout

Setup

Required Environment Variables

Quick Start

1. Run multi-session generation

2. Run evaluation

3. Run reward-driven best-of-n rollouts

4. Try the Web workspace

Create Course

Switch School

Consultation View

Configuration Guide

Included Data and Resources

Results Snapshot

Project Status

Citation

Safety Notice

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
assets		assets
configs		configs
data/eval		data/eval
paper		paper
prompts		prompts
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
README_CN.md		README_CN.md
run_backend.sh		run_backend.sh
run_frontend.sh		run_frontend.sh

Folders and files

Latest commit

History

Repository files navigation

PsychAgent

Overview

How PsychAgent Works

Repository Layout

Setup

Required Environment Variables

Quick Start

1. Run multi-session generation

2. Run evaluation

3. Run reward-driven best-of-n rollouts

4. Try the Web workspace

Create Course

Switch School

Consultation View

Configuration Guide

Included Data and Resources

Results Snapshot

Project Status

Citation

Safety Notice

Acknowledgements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages