Enable zstd compression format by GilbertHan1011 · Pull Request #640 · OpenGene/fastp

GilbertHan1011 · 2025-11-19T12:00:09Z

zstdandard(zstd) is a compression algorithm with higher performance than zlib. This contribution enabled fastp to handle zstd format

bwlang

@sfchen I tested this locally. It compiled and worked as expected for reading zstd compressed inputs - including with larger files (10k reads). I also tested interleaved fastq.zst files successfully

I think it could be merged, maybe without the .gitignore modification.

bwlang · 2026-01-18T21:15:55Z

.gitignore

-*.html
+*.html
+
+out/


I think this is a local preference

Yes. This have been updated.

KimBioInfoStudio · 2026-03-10T03:33:11Z

we r working on this, BTW which downstream sw support consume fq.zst?

GilbertHan1011 · 2026-03-10T09:46:45Z

@KimBioInfoStudio Precellar https://github.com/regulatory-genomics/precellar, and lots of snakemake pipeline https://github.com/regulatory-genomics/ATAC-sm

KimBioInfoStudio · 2026-03-10T10:09:43Z

seems, most upstream basecalling not support output r1.fq.zst even BGZF, welcome pr, could u rebase it to latest master branch, BTW, we r seeking a better format which support indexed multi members and read parallelism

GilbertHan1011 · 2026-03-10T12:11:54Z

seems, most upstream basecalling not support output r1.fq.zst even BGZF, welcome pr, could u rebase it to latest master branch, BTW, we r seeking a better format which support indexed multi members and read parallelism

Done.
I’m also very interested in that kind of format. I'm planing work on it.

KimBioInfoStudio

Pls use lfs track those files?

GilbertHan1011 · 2026-03-11T05:54:15Z

I tried Git LFS, but GitHub does not allow uploading new LFS objects to my public fork (rejected by server).
So I removed the .zst testdata files from the PR to avoid adding binary blobs.

KimBioInfoStudio · 2026-03-12T05:21:17Z

could u help with one group of bench as evidence

xxx.fq.zst win at compression ratio
no perf regression compare to .fq.gz -> .fq.gz
cover pe and se
ge 10M reads/pairs

GilbertHan1011 · 2026-03-12T11:16:21Z

fastp_zstd_benchmark_report_maxcomp.pdf
@KimBioInfoStudio

KimBioInfoStudio · 2026-03-12T12:01:41Z

@sfchen plz kindly consider merge this pr

bwlang approved these changes Jan 18, 2026

View reviewed changes

litian han and others added 4 commits March 10, 2026 19:47

Enable zstd compression format

b8c9f08

minor

291d17a

Update .gitignore

79677d7

Update .gitignore

3f8ecde

GilbertHan1011 force-pushed the master branch from 4c7132d to 3f8ecde Compare March 10, 2026 12:08

KimBioInfoStudio reviewed Mar 11, 2026

View reviewed changes

Remove .zst testdata files (avoid Git LFS on forks)

115dd1f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable zstd compression format#640

Enable zstd compression format#640
GilbertHan1011 wants to merge 5 commits intoOpenGene:masterfrom
GilbertHan1011:master

GilbertHan1011 commented Nov 19, 2025

Uh oh!

bwlang left a comment

Uh oh!

bwlang Jan 18, 2026

Uh oh!

GilbertHan1011 Jan 19, 2026

Uh oh!

KimBioInfoStudio commented Mar 10, 2026

Uh oh!

GilbertHan1011 commented Mar 10, 2026

Uh oh!

KimBioInfoStudio commented Mar 10, 2026

Uh oh!

GilbertHan1011 commented Mar 10, 2026

Uh oh!

KimBioInfoStudio left a comment

Uh oh!

GilbertHan1011 commented Mar 11, 2026

Uh oh!

KimBioInfoStudio commented Mar 12, 2026 •

edited

Loading

Uh oh!

GilbertHan1011 commented Mar 12, 2026

Uh oh!

KimBioInfoStudio commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-              *.html

                
                    No newline at end of file
+              *.html
+              out/

                
                    No newline at end of file

Conversation

GilbertHan1011 commented Nov 19, 2025

Uh oh!

bwlang left a comment

Choose a reason for hiding this comment

Uh oh!

bwlang Jan 18, 2026

Choose a reason for hiding this comment

Uh oh!

GilbertHan1011 Jan 19, 2026

Choose a reason for hiding this comment

Uh oh!

KimBioInfoStudio commented Mar 10, 2026

Uh oh!

GilbertHan1011 commented Mar 10, 2026

Uh oh!

KimBioInfoStudio commented Mar 10, 2026

Uh oh!

GilbertHan1011 commented Mar 10, 2026

Uh oh!

KimBioInfoStudio left a comment

Choose a reason for hiding this comment

Uh oh!

GilbertHan1011 commented Mar 11, 2026

Uh oh!

KimBioInfoStudio commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GilbertHan1011 commented Mar 12, 2026

Uh oh!

KimBioInfoStudio commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

KimBioInfoStudio commented Mar 12, 2026 •

edited

Loading