PsyLLM

🌸 About • 📰 News • 📦 Dataset • 🧠 PsyLLM • 🔥 Quick Start • 📜 Citation

🌸 About

This repository contains the official evaluation code and data for the paper "Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling". See more details in our paper.

PsyLLM is the first large language model explicitly designed to combine diagnostic and therapeutic reasoning for mental health counseling. Unlike traditional LLM-based systems that mainly provide empathetic or surface-level responses, PsyLLM simulates the reasoning process of professional therapists — assessing symptoms, applying international diagnostic standards (DSM/ICD), and selecting suitable therapeutic strategies (such as CBT, ACT, and psychodynamic approaches) to produce clinically grounded, context-sensitive counseling dialogues.

📰 News

[2025-11-02] We released a new work on arXiv — TheraMind: A Strategic and Adaptive Agent for Longitudinal Psychological Counseling. We warmly welcome everyone to check it out and join the discussion!
[2025-10-28] Created the official project website: https://github.com/Emo-gml/PsyLLM.
[2025-10-21] We open-sourced the model weights and dataset on Hugging Face!
[2025-05-12] Paper submitted to arXiv: https://arxiv.org/abs/2505.15715.

📦 Dataset

Figure: Overview of the OpenR1-Psy dataset construction pipeline.

OpenR1-Psy is a large-scale psychological counseling dataset that integrates diagnostic reasoning and therapeutic reasoning to train and evaluate large language models for mental health dialogue generation. It goes beyond empathy-focused corpora by incorporating explicit reasoning traces grounded in DSM/ICD diagnostic standards and diverse psychotherapy frameworks such as CBT, ACT, psychodynamic, and humanistic therapy.

🧠 PsyLLM

PsyLLM is a large language model specialized in psychological counseling and mental health dialogue generation. It unifies diagnostic reasoning and therapeutic reasoning, grounded in established clinical frameworks such as DSM and ICD, and integrates diverse therapeutic paradigms including CBT (Cognitive Behavioral Therapy), ACT (Acceptance and Commitment Therapy), and psychodynamic therapy.

PsyLLM is trained on the OpenR1-Psy dataset, which features multi-turn counseling dialogues enriched with explicit reasoning traces. These traces enable clinically informed, empathetic, and interpretable AI-assisted therapeutic interactions.

The model training and fine-tuning pipeline are implemented using the open-source framework LLaMA-Factory. For more details, please refer to the Code.

🔥 Quick Start

from transformers import AutoModelForCausalLM, AutoTokenizer

model_path = "GMLHUHE/PsyLLM"

# load the tokenizer and the model
tokenizer = AutoTokenizer.from_pretrained(model_path)
model = AutoModelForCausalLM.from_pretrained(
    model_path,
    torch_dtype="auto",
    device_map="auto"
)

# prepare the model input
prompt = "I have participated in big group sessions before where I was left to find my own safe place, but it hasn't worked for me."
messages = [
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True,
    enable_thinking=True 
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

# conduct text completion
generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=32768
)
output_ids = generated_ids[0][len(model_inputs.input_ids[0]):].tolist()

# parsing thinking content
try:
    index = len(output_ids) - output_ids[::-1].index(151668)
except ValueError:
    index = 0

thinking_content = tokenizer.decode(output_ids[:index], skip_special_tokens=True).strip("\n")
content = tokenizer.decode(output_ids[index:], skip_special_tokens=True).strip("\n")

print("PsyLLM thinking content:", thinking_content)
print("PsyLLM content:", content)

📜 Citation

@article{hu2025beyond,
  title={Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling},
  author={Hu, He and Zhou, Yucheng and Si, Juzheng and Wang, Qianning and Zhang, Hengheng and Ren, Fuji and Ma, Fei and Cui, Laizhong},
  journal={arXiv preprint arXiv:2505.15715},
  year={2025}
}

🧩 License

For research and educational use only.

Please ensure compliance with ethical and legal standards in mental health AI research.

🔥Please contact huhe@gml.ac.cn if you encounter any issues.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
Code		Code
PsyLLM.yaml		PsyLLM.yaml
PsyLLM_Inference.py		PsyLLM_Inference.py
README.md		README.md
logo.jpg		logo.jpg
openR1-psy.jpg		openR1-psy.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PsyLLM

🌸 About

📰 News

📦 Dataset

🧠 PsyLLM

🔥 Quick Start

📜 Citation

🧩 License

About

Uh oh!

Releases

Languages

Emo-gml/PsyLLM

Folders and files

Latest commit

History

Repository files navigation

PsyLLM

🌸 About

📰 News

📦 Dataset

🧠 PsyLLM

🔥 Quick Start

📜 Citation

🧩 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Languages