Distilled Prompt Learning for Incomplete Multimodal Survival Prediction

Yingxue Xu, Fengtao Zhou, Chenyu Zhao, Yihui Wang, Can Yang, Hao Chen

Share us a ⭐ if this repo does help

This repository is the official implementation of DisPro. If you encounter any question, please feel free to contact us. You can create an issue or just send email to me (yxueb@connect.ust.hk). Also welcome for any idea exchange and discussion.

Introduction

The integration of multimodal data including pathology images and gene profiles is widely applied in precise survival prediction. Despite recent advances in multimodal survival models, collecting complete modalities for multimodal fusion still poses a significant challenge, hindering their application in clinical settings. Current approaches tackling incomplete modalities often fall short, as they typically compensate for only a limited part of the knowledge of missing modalities. To address this issue, we propose a Distilled Prompt Learning framework (DisPro) to utilize the strong robustness of Large Language Models (LLMs) to missing modalities, which employs two-stage prompting for compensation of comprehensive information for missing modalities. In the first stage, Unimodal Prompting (UniPro) distills the knowledge distribution of each modality, preparing for supplementing modality-specific knowledge of the missing modality in the subsequent stage. In the second stage, Multimodal Prompting (MultiPro) leverages available modalities as prompts for LLMs to infer the missing modality, which provides modality-common information. Simultaneously, the unimodal knowledge acquired in the first stage is injected into multimodal inference to compensate for the modality-specific knowledge of the missing modality. Extensive experiments covering various missing scenarios demonstrated the superiority of the proposed method.

Data preparation

WSIs

Preprocessing WSI data by PrePATH and extract uni features (or other foundation features you want) for each slide. PrePATH provides an easy-to-use tool for WSI preprocessing.
Set up the dir of feature as data_root_wsi.

Gene

The pathways signature is provided in the datasets folder.
The RNA-Seq expression data are provided on Onedrive. You can unzip them and put it somewhere you like, which should be set as data_root_omics.

CSV

The data file for complete modaltiy is provided in splits/[STUDY]_Splits.csv.
The data file for various missing scenarios are provided in splits/csv_missing_cleaned folder, where the number following 'W' or 'O' represents the missing rate for WSI or Omics, respectively. (Or you can simulate the missing scenarios by yourself.)

Requirements

Create a new conda environmenty.

conda create -n dispro python=3.10
conda activate dispro

Install the required packages.

torch == 2.3.0+cu121
timm == 0.9.8
torchvision == 0.18.0
numpy == 1.24.3

or directly install environment by yaml file.

conda create -n dispro -f dispro.yaml

Run

There are two stages in DisPro.

Stage 1 - UniPro

You need to train a UniPro for each modality.

WSI

To train the UniPro for pathology, you can specify the arguments in the run_unipro_wsi.sh script stored in scripts and run it.

bash scripts/run_unipro_wsi.sh

Omics

To train the UniPro for pathology, you can specify the arguments in the run_unipro_wsi.sh script stored in scripts and run it.

bash scripts/run_unipro_omics.sh

Stage 2 - MultiPro

You need to specify the save path result_root to checkpoints in utils/get_path_ckpt_dict.py.
You need to get the json file storing paths of ckpt of UniPro for every modality by run:

python utils/get_path_ckpt_dict.py

To train the MultiPro for missing modality, you can specify the arguments in the run_multipro.sh script stored in scripts and run it.

bash scripts/run_multipro.sh

License & Citation

This project is licensed under the Apache-2.0 License.

If you find this work useful, please cite our paper:

@InProceedings{Xu_2025_CVPR,
    author    = {Xu, Yingxue and Zhou, Fengtao and Zhao, Chenyu and Wang, Yihui and Yang, Can and Chen, Hao},
    title     = {Distilled Prompt Learning for Incomplete Multimodal Survival Prediction},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2025},
    pages     = {5102-5111}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
datasets		datasets
imgs		imgs
models		models
scripts		scripts
splits		splits
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
dispro.yaml		dispro.yaml
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distilled Prompt Learning for Incomplete Multimodal Survival Prediction

Share us a ⭐ if this repo does help

Table of Contents

Introduction

Data preparation

WSIs

Gene

CSV

Requirements

Run

Stage 1 - UniPro

WSI

Omics

Stage 2 - MultiPro

License & Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Distilled Prompt Learning for Incomplete Multimodal Survival Prediction

Share us a ⭐ if this repo does help

Table of Contents

Introduction

Data preparation

WSIs

Gene

CSV

Requirements

Run

Stage 1 - UniPro

WSI

Omics

Stage 2 - MultiPro

License & Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages