Worm Perturb-seq (WPS)

Please note that we are still actively improving this pipeline. Please contact us if you encounter any issues!

This repository provides a data processing pipeline for Worm Perturb-Seq (WPS) technology.

Introduction

WPS is an end-to-end massively parallel RNAi and RNA-seq technology in model animal C. elegans. WPS involves both an experimental and computational pipeline that seamlessly cover the process from culturing animals to identifying differentially expressed (DE) genes in each perturbation. The workflow is summarized in the following figure:

This repository details the procedures to process WPS data, starting from fastq files generated by NGS sequencing to the production of DE genes in each condition.

To find out more details about WPS, please read our manuscript:

Hefei Zhang#, Xuhang Li#, Dongyuan Song, Onur Yukselen, Shivani Nanda, Alper Kucukural, Jingyi Jessica Li, Manuel Garber & Albertha J. M. Walhout, Worm Perturb-Seq: massively parallel whole-animal RNAi and RNA-seq. Nature Communications 2025 available here

WPS data processing involves multiple tools and requires considerable computational power. We recommend setting up an UNIX-based local server or using a computational cluster for processing the data (caution: Windows server may encounter problems in installing some dependencies such as pysam package in Python).

Softwares

Via Foundry: the raw data processing uses a pre-built dolphinNext pipeline for the ease of use and for easy management of large-scale data. Please register at Via Foundry before you start. If you intend to process the raw data (e.g., aligning the reads to genome) manually, please see 1_process_the_raw_data for more information (but you will need to install more dependencies that are not included in this list). We strongly recommend using the dolphinNext pipeline for the sake of reproducibility and robustness.
Python 2.7
R > 3.5

Walkthrough

The pipeline includes three major steps: step 1: process raw data, step 2: data quality control, and step 3: data analysis. Please see the instruction within each module for running a test.

The followings are descriptions on each major step:

step 1: process raw data: This step illustrates how to process raw reads (fastq files) generated from NGS sequencer. The folder contains a real-data showcase on processing one WPS library using Via Foundry platform and our pre-bulit pipeline.

step 2: data quality control: This step takes the files generated in step 1 and performs two complementary quality control (QC) analyses, the RNAi identity QC and sample QC. This QC step is essential to ensure research rigor of downstream data interpretation and involves interative correction of problems in the dataset. The folder includes a real-data showcase using dataset from metabolic WPS plate 7.

step 3: data analysis: This step is a simple showcase on performing differential expression analysis using WPS DE framework. It also works as a template showing how downstream analyses can be seamlessly integrated into the framework of WPS data analysis pipeline.

Contact

Any questions or suggestions on WPS are welcomed! Please report it on issues, or contact Xuhang Li (xuhang.li@umassmed.edu).

Related Manuscripts

WPS technology is the foundation method of three back-to-back papers. For further reading, please see:
- WPS method paper: Worm Perturb-Seq: massively parallel whole-animal RNAi and RNA-seq. Nature Communications (2025).(available here)
- Metabolic rewiring story: Systems-level design principles of metabolic rewiring in an animal. Nature (2025).(online link)
- Metabolic wiring story: A systems-level, semi-quantitative landscape of metabolic flux in C. elegans. Nature (2025).(online link)

Acknowledgement

We thank members of the Walhout lab, Mike Lee, and Chad Myers for discussion and critical reading of the manuscript. We thank former Garber lab members, Kyle Gellatly and Rachel Murphy, for their help in the early stage of this project. This work was supported by grants from the National Institutes of Health GM122502 and DK068429 to A.J.M.W., U01HG012064 to M.G., and NSF DBI-1846216 and NIGMS R35GM140888 to J.J.L.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
data		data
docs		docs
step1_process_raw_data		step1_process_raw_data
step2_quality_control		step2_quality_control
step3_data_analysis		step3_data_analysis
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Worm Perturb-seq (WPS)

Please note that we are still actively improving this pipeline. Please contact us if you encounter any issues!

Introduction

Table of contents

Dependencies

Walkthrough

Contact

Related Manuscripts

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

XuhangLi/WPS

Folders and files

Latest commit

History

Repository files navigation

Worm Perturb-seq (WPS)

Please note that we are still actively improving this pipeline. Please contact us if you encounter any issues!

Introduction

Table of contents

Dependencies

Walkthrough

Contact

Related Manuscripts

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages