Skip to content
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
193 commits
Select commit Hold shift + click to select a range
4052204
laneprocess.py uses correct SAMPLE_NAME
jemma-nelson Apr 18, 2022
621b345
apply fix to right file, mark old file deprecated
Apr 24, 2022
58603ec
Use CopyComplete.txt to start processing
jemma-nelson Jul 5, 2022
536ed63
Switch initial flowcell processing to hpcz-1
jemma-nelson Jul 5, 2022
3b99fd3
chore: update default queue names for Altius
Jul 7, 2022
4c82c7c
Add module for bcl2fastq - contains samplesheet generation
jemma-nelson Jul 20, 2022
ca3a64d
Add test of alt-seq pipeline
jemma-nelson Jul 24, 2022
3858ef6
Refine altseq.nf and add process_altseq.bash
jemma-nelson Jul 25, 2022
b636f55
Connect altseq with LIMS
jemma-nelson Jul 26, 2022
e1c3ae7
altseq script optimizations - better caching
jemma-nelson Aug 2, 2022
27d61d6
Altseq - version 1.0.0
jemma-nelson Aug 2, 2022
2a3b1ec
Don't use scratch space for bcl2fastq & merge_fq
jemma-nelson Aug 14, 2022
24c9b4a
Use production LIMS instead of staging
jemma-nelson Aug 14, 2022
212a21b
Altseq - skip running alignment for now
jemma-nelson Aug 18, 2022
bf322cb
Altseq - handle pools with same pool barcodes
jemma-nelson Aug 18, 2022
1341024
setup.sh uses processing_information endpoint again
jemma-nelson Aug 18, 2022
ec1df14
Merge pull request #56 from StamLab/alt-seq
jemma-nelson Aug 18, 2022
cd3c06a
fix: encode_cram_no_ref now works again
jemma-nelson Aug 21, 2022
d20c2e1
fix for altseq setup.sh processing
jemma-nelson Sep 4, 2022
c273b87
nextflow_clean script proceeds w/o output symlinks
jemma-nelson Sep 8, 2022
69ebe3c
!fixup c273b879 - missed a simple bug.
Sep 8, 2022
fb8bd40
Improve alignprocess.py error logging
jemma-nelson Oct 19, 2022
85b565f
Fix alignprocess.py when library_kit_method=null
jemma-nelson Oct 19, 2022
b2cdab7
fixup: Can't use f-strings in current python ver
jemma-nelson Oct 19, 2022
c86a544
Merge pull request #57 from StamLab/fix/align_process_library_kits
jemma-nelson Nov 13, 2022
0248da8
Config: Add 137 to retry-with-more-mem exit codes
Nov 20, 2022
4760b6d
fix/rna-agg: two typos in anaquin processing
Nov 20, 2022
4cbfafc
altseq - use better publishing strategy
jemma-nelson Dec 5, 2022
d70fdc3
Add basic analysis
jemma-nelson Dec 9, 2022
df9c611
Update sample_config to new format
jemma-nelson Dec 12, 2022
e6c29d7
Add altseq config example template to show usage
jemma-nelson Dec 12, 2022
210e4c6
Changes toward altseq per-sample stats
jemma-nelson Dec 14, 2022
f5dcc76
Reformat ./fetch.sh as Makefile
jemma-nelson Dec 14, 2022
2ce9f1e
Update altseq/upload_data.py
jemma-nelson Jan 9, 2023
0ea0fa8
Commit some python tooling files
jemma-nelson Jan 9, 2023
67fad27
Update altseq README with more starsolo info
jemma-nelson Jan 19, 2023
232d536
altseq: drop filtering step (unnecessary)
jemma-nelson Jan 19, 2023
d2dea5a
third_party: update STAR to fix intronicAS count
jemma-nelson Jan 19, 2023
b5f01f3
fix: don't duplicate aggs for processing
jemma-nelson Jan 22, 2023
3771197
Altseq: Generate per-flowcell and per-pool stats
jemma-nelson Feb 13, 2023
4af9409
altseq/upload_data.py uploads flowcell_counts.json
jemma-nelson Feb 13, 2023
44b87eb
Altseq - update stats-upload logic
jemma-nelson Feb 27, 2023
3909a15
Altseq - use /net/seq/scratch for workdirs
jemma-nelson Feb 27, 2023
6ac7883
Merge branch 'main' into feat/altseq-dev
jemma-nelson Feb 27, 2023
a4d6a4a
Fix altseq file upload
jemma-nelson Mar 19, 2023
1469a04
WIP - altseq uses CRAM compression in 1 step
jemma-nelson Mar 21, 2023
21d17f8
STAR v1.2: Remove anaquin from pipeline
jemma-nelson Mar 21, 2023
995a276
Fix up cram sorting, add starsolo Log publishing
jemma-nelson Mar 28, 2023
82021c1
Merge branch 'feat/altseq-dev' of http://github.com/StamLab/stampipes…
Mar 28, 2023
1dd04f1
STAR uploads the right all-alignments-bam file
jemma-nelson Mar 29, 2023
c58d0cc
Altseq 1.1.0 alpha4 - fix ref file, publish `Gene`
jemma-nelson Apr 3, 2023
173ee8b
Alt-seq: fix for flowcells with 1 single pool
jemma-nelson Apr 20, 2023
ef8eeac
Merge branch 'feat/altseq-dev' of http://github.com/StamLab/stampipes…
Apr 20, 2023
a15e3ce
Update docker container path for dnase alns
jemma-nelson May 9, 2023
1794ab4
Fix docker config for dnase agg tests
jemma-nelson May 9, 2023
622abbc
Merge pull request #61 from StamLab/fix/local_docker_tests
jemma-nelson May 9, 2023
08e2c36
fix: add preseq to modules for bwa aggregation
jemma-nelson May 22, 2023
f4b438d
script to create barcode report from demux stats
jemma-nelson May 31, 2023
3ed32ee
(add) process for creating bigbeds for peak point1per files
markfrerker Jun 2, 2023
ad8f4d5
starch_to_bigbed - add 'modules' label
jemma-nelson Jun 4, 2023
8adb188
starch_to_bigbed - create chrom_sizes from param
jemma-nelson Jun 4, 2023
78a5c2e
v2.6.0 for DNase aggregation
jemma-nelson Jun 4, 2023
6feff6b
Merge pull request #65 from StamLab/feature/create-peaks-bigbed
jemma-nelson Jun 6, 2023
9bd3384
Add altcode process for alignment & quantification
jemma-nelson Jun 8, 2023
d2181d4
Altcode: Wire in rest of params
jemma-nelson Jun 11, 2023
ae70874
Add script to create multiple samplesheets
jemma-nelson Jul 19, 2023
530da1c
altcode -minor output publishing changes
jemma-nelson Jul 20, 2023
4a06465
make_samplesheets.py fixups
jemma-nelson Jul 23, 2023
10132f5
workaround: --allow_collisions for max_mismatch.py
jemma-nelson Jul 23, 2023
23da457
flowcells/setup.sh: Support alt-code novaseq runs
jemma-nelson Jul 23, 2023
01b7be3
setup.sh - fixups for some long-running issues
jemma-nelson Jul 23, 2023
e7ec83d
link_nextseq.py: Python3 compat and formatting
jemma-nelson Jul 23, 2023
f7fc9c3
fixups: extra fixes from testing
jemma-nelson Jul 23, 2023
8f31c10
Merge branch 'main' into feat/altcode-dev
jemma-nelson Jul 23, 2023
813883a
Merge pull request #69 from StamLab/feat/altcode-dev
jemma-nelson Jul 23, 2023
0c0abc7
Altcode - add scanpy file gen. Uses apptainer
jemma-nelson Jul 26, 2023
519aa89
Add process_altcode.bash
jemma-nelson Jul 27, 2023
7cae5da
altcode config: remove unneeded runOptions
jemma-nelson Jul 28, 2023
41f26ff
improve convert_to_hda5 error detection
jemma-nelson Jul 28, 2023
dbd0fad
Add 'scratch false' for jobs that don't need /tmp/
jemma-nelson Jul 30, 2023
331479f
Make sure altseq runs on hpcz-2
jemma-nelson Jul 30, 2023
3c586ef
Fix ref for altseq - was live but not committed
jemma-nelson Jul 30, 2023
4680d91
Various accumulated fixes from production
jemma-nelson Jul 30, 2023
90bdd3f
Merge remote-tracking branch 'origin/main' into feat/altcode-dev
jemma-nelson Jul 30, 2023
36d35fb
Add upload script
jemma-nelson Jul 30, 2023
42d0f1d
setup.sh fixes
jemma-nelson Jul 30, 2023
b1f1e65
Bunch of changes to support LibraryPool dirs
jemma-nelson Aug 6, 2023
05c2914
Remove pylint dep to solve version conflict
jemma-nelson Aug 6, 2023
8383e42
remove commented out config
jemma-nelson Aug 8, 2023
8e5b532
fix process_altcode.bash
jemma-nelson Aug 9, 2023
1a1b389
Allow collate_fastq to handle altcode
jemma-nelson Aug 9, 2023
29d3473
Fix naming of LibraryPool fastq files
jemma-nelson Aug 9, 2023
133cf1c
Add poolprocess.py, analogous to alignprocess.py
jemma-nelson Aug 9, 2023
d8a74cd
Add apptainer bind mounts
jemma-nelson Aug 9, 2023
0766e46
Fix typo in process name
jemma-nelson Aug 9, 2023
aa10ad0
Altseq is stable, remove alpha label
jemma-nelson Sep 11, 2023
0222980
Merge branch 'feat/altseq-dev' into feat/altcode-dev
jemma-nelson Sep 11, 2023
f34085c
Fix - copy SampleSheet*csv in run_bcl2fastq_2.sh
jemma-nelson Sep 11, 2023
ad5bd21
Fixes and QoL for make_samplesheets.py
akjohnson Sep 13, 2023
7cab837
Altcode - fetch additional metadata
jemma-nelson Sep 21, 2023
276ceda
Altcode - embed metadata in h5ad files
jemma-nelson Sep 24, 2023
f152c33
Altcode metadata - get lentitale info from TC
jemma-nelson Oct 1, 2023
96e9044
Add Agilent Agent adapter trimming
jemma-nelson Oct 3, 2023
fba0294
Altcode - save more multimapper variants.
jemma-nelson Oct 4, 2023
059a037
Altcode: Add sample plate wells to metadata
jemma-nelson Oct 4, 2023
5d752bb
Fix for miniseq linking
jemma-nelson Oct 4, 2023
a42425c
Alt-code: update to use 96-barcode file for R1
jemma-nelson Oct 9, 2023
1ae8c52
fix typo in altcode metadata
jemma-nelson Oct 9, 2023
ba4ea66
Altcode - add polyA trimming and adjust resources
jemma-nelson Oct 9, 2023
8e37f05
altcode: version bump to 1.0.0-alpha3
jemma-nelson Oct 9, 2023
d4a5f81
Altcode - improve workdir handling & cleaning
jemma-nelson Oct 9, 2023
0d3df54
altcode: Generate stats and upload to LIMS
jemma-nelson Oct 15, 2023
bf45a5e
added Effector Assembly QC to lentitale object
msbuckley Oct 25, 2023
33e7c41
Merge pull request #71 from StamLab/feat/altcode-dev-patch
jemma-nelson Oct 25, 2023
0910298
edgecase fix for altcode stat finding
jemma-nelson Oct 25, 2023
d8a8d43
altcode alpha4 - summarize by library
jemma-nelson Oct 25, 2023
3a20e41
Fix whitespace
jemma-nelson Oct 31, 2023
ee5fbd8
New metadata fmt
jemma-nelson Oct 31, 2023
eb838b0
Bump to version 1.0.0-alpha5
jemma-nelson Oct 31, 2023
2e57cb1
altcode metadata: Fix barcode field
jemma-nelson Nov 2, 2023
1b73daa
Add locus info
jemma-nelson Nov 26, 2023
12fc6b6
altcode: verify cram copy completion
jemma-nelson Jan 10, 2024
e49a73d
altcode: fix issue with double-copy
jemma-nelson Jan 10, 2024
b8a6b3e
altcode: improve metadata talen information
jemma-nelson Jan 10, 2024
715017a
altcode: retain unmapped reads in CRAM file
jemma-nelson Jan 10, 2024
4212fd3
altcode: alpha version bump
jemma-nelson Jan 10, 2024
6278fb9
Fixes for altcode alpha6
jemma-nelson Jan 16, 2024
02abd7b
Fix miniseq automatic processing
jemma-nelson Jan 23, 2024
37adb29
Altcode alpha7: more lenient cell barcode strategy
jemma-nelson Feb 1, 2024
0d4caac
altcode - alphta8: add CB & UB output
jemma-nelson Feb 11, 2024
80b4682
Add support for NovaSeq X 1.5B
jemma-nelson Mar 12, 2024
936e36b
Improve fastq linking for LP samples
jemma-nelson Mar 12, 2024
2075024
Collation works for LPs
jemma-nelson Mar 12, 2024
327bf79
poolprocess.py minimally works again
jemma-nelson Mar 12, 2024
256fa52
upload_data - improve flowcell regex detection
jemma-nelson Mar 12, 2024
0815e51
fix barcode count for masks with fewer lanes
jemma-nelson Mar 12, 2024
d89a995
altcode - 1.0 relase
jemma-nelson Mar 12, 2024
eeb6918
Merge pull request #72 from StamLab/feat/altcode-dev
jemma-nelson Mar 12, 2024
23a799c
Support Novaseq 10B
akjohnson Mar 24, 2024
87b5cd8
Fix barcode linking for LPs
jemma-nelson Mar 24, 2024
b4b6fbc
Simplify make_samplesheets.py with new pool system
jemma-nelson Mar 24, 2024
3cd1587
Altcode pipeline - some more completion checks
jemma-nelson Apr 8, 2024
0cf032a
upload_data.py: switch to bulk-count upload endpoint
jemma-nelson Apr 8, 2024
74ef1a5
altcode: stringify NaNs in stat upload
jemma-nelson Apr 8, 2024
4871c83
alignprocess.py: don't crash if library kit method missing
jemma-nelson Apr 8, 2024
209cdf5
Fix - vs _ confusion in fastq file names
jemma-nelson Apr 8, 2024
aa20f52
poolprocess.py - keep going even if errors encountered
jemma-nelson Apr 8, 2024
60a6d29
poolprocess.py - add sublibrary and cell_library
jemma-nelson Apr 9, 2024
3ed1b27
Fix mistaken lane merging/duplication
jemma-nelson Apr 23, 2024
0b0ebda
NovaSeq 10B fix: backport from prod
jemma-nelson Apr 23, 2024
4491a79
Add support for Novaseq X 25B
jemma-nelson May 21, 2024
e4c3b93
upload_data.py: wrap bulk_upload stats in list
jemma-nelson Jun 3, 2024
ffba535
Fix for NovaSeq run type
jemma-nelson Jun 9, 2024
2616122
Merge pull request #73 from StamLab/fix/NovaSeq_demuxing
jemma-nelson Jun 9, 2024
b4ed09f
demuxing novaseq: process at lane/samplesheet level
jemma-nelson Jun 13, 2024
3dceeee
Remove TODO that's already to-done
jemma-nelson Jun 13, 2024
ff7a8d1
Merge pull request #74 from StamLab/feat/fast_demuxing
jemma-nelson Jun 13, 2024
fa09404
setup.sh - fix copy for FC with no samples
jemma-nelson Jun 27, 2024
7be986d
Revert "setup.sh - fix copy for FC with no samples"
jemma-nelson Jun 27, 2024
385f655
Fix for copy for Flowcells without samples
jemma-nelson Jun 27, 2024
736787b
Merge pull request #75 from StamLab/fix/copy_for_fc_without_samples
jemma-nelson Jun 30, 2024
5bcb116
Update collate wait to include LPs
jemma-nelson Jul 7, 2024
b683211
Remove duplicate collate/fastqc-script creation
jemma-nelson Jul 7, 2024
5104779
Merge pull request #76 from StamLab/fix/lp_collation_timing
jemma-nelson Jul 9, 2024
92f6059
style: Format python codebase with ruff
jemma-nelson Jul 11, 2024
a42333f
style: Fix lint errors from ruff
jemma-nelson Jul 11, 2024
94e3c64
style: order imports in python scripts with ruff
jemma-nelson Jul 11, 2024
109d348
style: fix some possible bugs identifed by ruff
jemma-nelson Jul 11, 2024
daee133
style: Make sure to use raw strings for regex
jemma-nelson Jul 11, 2024
2738b49
style: use pythonic capitalization consistently
jemma-nelson Jul 11, 2024
1cbe3d6
style: executables are chmod +x and have shebang
jemma-nelson Jul 11, 2024
58d776c
style: logging-related changes
jemma-nelson Jul 11, 2024
9eeb4b1
First draft of fastq pipeline container
jemma-nelson Sep 12, 2024
41ce205
Add biopython to fastq container
jemma-nelson Sep 17, 2024
21ee6a2
Update script to use apptainer when able
jemma-nelson Sep 18, 2024
8cb0901
collation/fastq updates for container
jemma-nelson Sep 22, 2024
de4d687
rework for run-time cluster loc detection
jemma-nelson Oct 6, 2024
6255a8e
bcl2fastq actually runs in apptainer now
jemma-nelson Oct 6, 2024
d8ef59d
Final tweaks
jemma-nelson Oct 15, 2024
718d54e
Fix fastqc script to run on new cluster
jemma-nelson Oct 15, 2024
8f2059c
Merge pull request #78 from StamLab/feat/fastq_container
jemma-nelson Oct 16, 2024
899290b
Merge pull request #77 from StamLab/style/add_precommit
jemma-nelson Oct 16, 2024
2d03447
Fix: fastqc/alignments/pools wait for collation
jemma-nelson Oct 20, 2024
f94689c
Merge pull request #79 from StamLab/fix/wait_for_collation
jemma-nelson Oct 20, 2024
2638c0d
Alignprocess.py skips library pools
jemma-nelson Oct 21, 2024
2d1dd24
Merge pull request #80 from StamLab/fix/skip_lp_alignments
jemma-nelson Oct 21, 2024
857cdc8
aggregateprocess.py: fix typo-induced bug
Nov 20, 2024
30e533b
fix: handle missing `project_share_directory`
jemma-nelson Feb 5, 2025
8fec6d1
Fix setup.sh for miniseq on new cluster
jemma-nelson Mar 23, 2025
f44ddc0
link_nextseq.py supports R3 & R4 fastq files
jemma-nelson Jul 6, 2025
3245526
Fix collate/fastq/upload for up to 4 reads
jemma-nelson Jul 6, 2025
d28784a
Fixup for fastqc.bash
jemma-nelson Jul 7, 2025
d6a371e
Disable pool processing
jemma-nelson Aug 26, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 5 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,10 @@ environments/
**/.nextflow*

**/*html*
**/*trace.txt*
**/trace*txt*
**/report*txt*

**/dag*dot*


*.fq
Expand All @@ -24,6 +27,7 @@ environments/
!.gitattributes
!.gitmodules
!.pylintrc
!.pre-commit-config.yaml

work
genome_build
Expand Down
9 changes: 9 additions & 0 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,9 @@
repos:
- repo: https://github.com/astral-sh/ruff-pre-commit
# Ruff version.
rev: v0.5.1
hooks:
# Run the linter.
- id: ruff
# Run the formatter.
- id: ruff-format
13 changes: 7 additions & 6 deletions Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -65,14 +65,15 @@ RUN apt-get install -y \
g++ \
git \
libbz2-dev \
libcurl4-openssl-dev \
liblzma-dev \
make \
ncurses-dev \
wget \
zlib1g-dev
RUN wget --quiet https://github.com/samtools/samtools/releases/download/1.7/samtools-1.7.tar.bz2 \
&& tar xf samtools-1.7.tar.bz2 \
&& cd samtools-1.7 \
RUN wget --quiet https://github.com/samtools/samtools/releases/download/1.12/samtools-1.12.tar.bz2 \
&& tar xf samtools-1.12.tar.bz2 \
&& cd samtools-1.12 \
&& make install

#####################
Expand Down Expand Up @@ -151,9 +152,8 @@ RUN wget --quiet https://github.com/arq5x/bedtools2/releases/download/v2.25.0/be
from build-base as build-preseq
RUN apt-get install -y \
libgsl-dev
RUN git clone --recurse-submodules https://github.com/smithlabcode/preseq.git \
RUN git clone --recurse-submodules -b v2.0.1 --single-branch https://github.com/smithlabcode/preseq.git \
&& cd preseq \
&& git checkout v2.0.1 \
&& make


Expand All @@ -168,8 +168,9 @@ RUN apt-get install -y \
bc \
bowtie \
build-essential \
libboost-dev \
coreutils \
gawk \
libboost-dev \
libgsl-dev \
littler \
openjdk-8-jre \
Expand Down
1 change: 0 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,6 @@ Modules

The current modules to load for working the pipeline are:

* `anaquin/2.0.1`
* `bcl2fastq/1.8.4`
* `bcl2fastq2/2.15.0.4`
* `bedops/2.4.35-typical`
Expand Down
46 changes: 46 additions & 0 deletions containers/fastq/fastq.def
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
Bootstrap: docker
From: rockylinux:8.6

%arguments
FASTQC_VERSION=v0.11.5

%help
This container contains the necessary software to run the first stage of
the pipeline, beginning from retriving metadata from LIMS, running
bcl2fastq and demultiplexing, collating fastq files, running FastQC, and
registering output with LIMS.

%files
./bcl2fastq /usr/local/bin/bcl2fastq
../../scripts/lims/stamlims_api /opt/

%post
set -e
dnf install -y python39
ln -s /usr/bin/python3 /usr/bin/python
python -m pip install /opt/stamlims_api/
python -m pip install Biopython
dnf install -y rsync

# install fastqc
# https://github.com/s-andrews/FastQC/
(
dnf install -y unzip perl java
cd /opt
curl https://www.bioinformatics.babraham.ac.uk/projects/fastqc/fastqc_{{ FASTQC_VERSION }}.zip --output fastqc.zip
unzip fastqc.zip
rm fastqc.zip
chmod +x FastQC/fastqc
ln -s /opt/FastQC/fastqc /usr/local/bin/fastqc
)


%test
set -e
bcl2fastq --version
rsync --version
fastqc --version
gzip --version
python3 --version

# vim: noexpandtab ts=4 sts=4 sw=4
127 changes: 127 additions & 0 deletions data/flowcells/miniseq/example_SampleSheet.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,127 @@
[Header],,,,,,,,,,
IEMFileVersion,4,,,,,,,,,
Investigator Name,Janghee,,,,,,,,,
Project Name,2015_09_30,,,,,,,,,
Experiment Name,,,,,,,,,,
Date,9/30/15,,,,,,,,,
Workflow,Resequencing,,,,,,,,,
Application,TruSeq DNA/RNA,,,,,,,,,
Assay,TruSeq DNA/RNA,,,,,,,,,
Description,,,,,,,,,,
Chemistry,Amplicon,,,,,,,,,
,,,,,,,,,,
[Reads],,,,,,,,,,
150,,,,,,,,,,
150,,,,,,,,,,
[Settings],,,,,,,,,,
OnlyGenerateFASTQ,1,,,,,,,,,
[Data],,,,,,,,,,
Sample_ID,Sample_Name,Sample_Plate,Sample_Well,I7_Index_ID,index,I5_Index_ID,index2,Sample_Project,Description,GenomeFolder
A1-i7-1-N501,,,,i7_1,AACGTGAT,N501,GCGATCTA,,,
B1-i7-1-N502,,,,i7_1,AACGTGAT,N502,ATAGAGAG,,,
C1-i7-1-N503,,,,i7_1,AACGTGAT,N503,AGAGGATA,,,
D1-i7-1-N504,,,,i7_1,AACGTGAT,N504,TCTACTCT,,,
E1-i7-1-N505,,,,i7_1,AACGTGAT,N505,CTCCTTAC,,,
F1-i7-1-N506,,,,i7_1,AACGTGAT,N506,TATGCAGT,,,
G1-i7-1-N507,,,,i7_1,AACGTGAT,N507,TACTCCTT,,,
H1-i7-1-N508,,,,i7_1,AACGTGAT,N508,AGGCTTAG,,,
X1-i7-1-N517,,,,i7_1,AACGTGAT,N517,TCTTACGC,,,
A2-i7-2-N501,,,,i7_2,AAACATCG,N501,GCGATCTA,,,
B2-i7-2-N502,,,,i7_2,AAACATCG,N502,ATAGAGAG,,,
C2-i7-2-N503,,,,i7_2,AAACATCG,N503,AGAGGATA,,,
D2-i7-2-N504,,,,i7_2,AAACATCG,N504,TCTACTCT,,,
E2-i7-2-N505,,,,i7_2,AAACATCG,N505,CTCCTTAC,,,
F2-i7-2-N506,,,,i7_2,AAACATCG,N506,TATGCAGT,,,
G2-i7-2-N507,,,,i7_2,AAACATCG,N507,TACTCCTT,,,
H2-i7-2-N508,,,,i7_2,AAACATCG,N508,AGGCTTAG,,,
X2-i7-2-N517,,,,i7_2,AAACATCG,N517,TCTTACGC,,,
A3-i7-3-N501,,,,i7_3,ATGCCTAA,N501,GCGATCTA,,,
B3-i7-3-N502,,,,i7_3,ATGCCTAA,N502,ATAGAGAG,,,
C3-i7-3-N503,,,,i7_3,ATGCCTAA,N503,AGAGGATA,,,
D3-i7-3-N504,,,,i7_3,ATGCCTAA,N504,TCTACTCT,,,
E3-i7-3-N505,,,,i7_3,ATGCCTAA,N505,CTCCTTAC,,,
F3-i7-3-N506,,,,i7_3,ATGCCTAA,N506,TATGCAGT,,,
G3-i7-3-N507,,,,i7_3,ATGCCTAA,N507,TACTCCTT,,,
H3-i7-3-N508,,,,i7_3,ATGCCTAA,N508,AGGCTTAG,,,
X3-i7-3-N517,,,,i7_3,ATGCCTAA,N517,TCTTACGC,,,
A4-i7-4-N501,,,,i7_4,AGTGGTCA,N501,GCGATCTA,,,
B4-i7-4-N502,,,,i7_4,AGTGGTCA,N502,ATAGAGAG,,,
C4-i7-4-N503,,,,i7_4,AGTGGTCA,N503,AGAGGATA,,,
D4-i7-4-N504,,,,i7_4,AGTGGTCA,N504,TCTACTCT,,,
E4-i7-4-N505,,,,i7_4,AGTGGTCA,N505,CTCCTTAC,,,
F4-i7-4-N506,,,,i7_4,AGTGGTCA,N506,TATGCAGT,,,
G4-i7-4-N507,,,,i7_4,AGTGGTCA,N507,TACTCCTT,,,
H4-i7-4-N508,,,,i7_4,AGTGGTCA,N508,AGGCTTAG,,,
X4-i7-4-N517,,,,i7_4,AGTGGTCA,N517,TCTTACGC,,,
A5-i7-5-N501,,,,i7_5,ACCACTGT,N501,GCGATCTA,,,
B5-i7-5-N502,,,,i7_5,ACCACTGT,N502,ATAGAGAG,,,
C5-i7-5-N503,,,,i7_5,ACCACTGT,N503,AGAGGATA,,,
D5-i7-5-N504,,,,i7_5,ACCACTGT,N504,TCTACTCT,,,
E5-i7-5-N505,,,,i7_5,ACCACTGT,N505,CTCCTTAC,,,
F5-i7-5-N506,,,,i7_5,ACCACTGT,N506,TATGCAGT,,,
G5-i7-5-N507,,,,i7_5,ACCACTGT,N507,TACTCCTT,,,
H5-i7-5-N508,,,,i7_5,ACCACTGT,N508,AGGCTTAG,,,
X5-i7-5-N517,,,,i7_5,ACCACTGT,N517,TCTTACGC,,,
A6-i7-6-N501,,,,i7_6,ACATTGGC,N501,GCGATCTA,,,
B6-i7-6-N502,,,,i7_6,ACATTGGC,N502,ATAGAGAG,,,
C6-i7-6-N503,,,,i7_6,ACATTGGC,N503,AGAGGATA,,,
D6-i7-6-N504,,,,i7_6,ACATTGGC,N504,TCTACTCT,,,
E6-i7-6-N505,,,,i7_6,ACATTGGC,N505,CTCCTTAC,,,
F6-i7-6-N506,,,,i7_6,ACATTGGC,N506,TATGCAGT,,,
G6-i7-6-N507,,,,i7_6,ACATTGGC,N507,TACTCCTT,,,
H6-i7-6-N508,,,,i7_6,ACATTGGC,N508,AGGCTTAG,,,
X6-i7-6-N517,,,,i7_6,ACATTGGC,N517,TCTTACGC,,,
A7-i7-7-N501,,,,i7_7,CAGATCTG,N501,GCGATCTA,,,
B7-i7-7-N502,,,,i7_7,CAGATCTG,N502,ATAGAGAG,,,
C7-i7-7-N503,,,,i7_7,CAGATCTG,N503,AGAGGATA,,,
D7-i7-7-N504,,,,i7_7,CAGATCTG,N504,TCTACTCT,,,
E7-i7-7-N505,,,,i7_7,CAGATCTG,N505,CTCCTTAC,,,
F7-i7-7-N506,,,,i7_7,CAGATCTG,N506,TATGCAGT,,,
G7-i7-7-N507,,,,i7_7,CAGATCTG,N507,TACTCCTT,,,
H7-i7-7-N508,,,,i7_7,CAGATCTG,N508,AGGCTTAG,,,
X7-i7-7-N517,,,,i7_7,CAGATCTG,N517,TCTTACGC,,,
A8-i7-8-N501,,,,i7_8,CATCAAGT,N501,GCGATCTA,,,
B8-i7-8-N502,,,,i7_8,CATCAAGT,N502,ATAGAGAG,,,
C8-i7-8-N503,,,,i7_8,CATCAAGT,N503,AGAGGATA,,,
D8-i7-8-N504,,,,i7_8,CATCAAGT,N504,TCTACTCT,,,
E8-i7-8-N505,,,,i7_8,CATCAAGT,N505,CTCCTTAC,,,
F8-i7-8-N506,,,,i7_8,CATCAAGT,N506,TATGCAGT,,,
G8-i7-8-N507,,,,i7_8,CATCAAGT,N507,TACTCCTT,,,
H8-i7-8-N508,,,,i7_8,CATCAAGT,N508,AGGCTTAG,,,
X8-i7-8-N517,,,,i7_8,CATCAAGT,N517,TCTTACGC,,,
A9-i7-9-N501,,,,i7_9,CGCTGATC,N501,GCGATCTA,,,
B9-i7-9-N502,,,,i7_9,CGCTGATC,N502,ATAGAGAG,,,
C9-i7-9-N503,,,,i7_9,CGCTGATC,N503,AGAGGATA,,,
D9-i7-9-N504,,,,i7_9,CGCTGATC,N504,TCTACTCT,,,
E9-i7-9-N505,,,,i7_9,CGCTGATC,N505,CTCCTTAC,,,
F9-i7-9-N506,,,,i7_9,CGCTGATC,N506,TATGCAGT,,,
G9-i7-9-N507,,,,i7_9,CGCTGATC,N507,TACTCCTT,,,
H9-i7-9-N508,,,,i7_9,CGCTGATC,N508,AGGCTTAG,,,
X9-i7-9-N517,,,,i7_9,CGCTGATC,N517,TCTTACGC,,,
A10-i7-10-N501,,,,i7_10,ACAAGCTA,N501,GCGATCTA,,,
B10-i7-10-N502,,,,i7_10,ACAAGCTA,N502,ATAGAGAG,,,
C10-i7-10-N503,,,,i7_10,ACAAGCTA,N503,AGAGGATA,,,
D10-i7-10-N504,,,,i7_10,ACAAGCTA,N504,TCTACTCT,,,
E10-i7-10-N505,,,,i7_10,ACAAGCTA,N505,CTCCTTAC,,,
F10-i7-10-N506,,,,i7_10,ACAAGCTA,N506,TATGCAGT,,,
G10-i7-10-N507,,,,i7_10,ACAAGCTA,N507,TACTCCTT,,,
H10-i7-10-N508,,,,i7_10,ACAAGCTA,N508,AGGCTTAG,,,
X10-i7-10-N517,,,,i7_10,ACAAGCTA,N517,TCTTACGC,,,
A11-i7-11-N501,,,,i7_11,CTGTAGCC,N501,GCGATCTA,,,
B11-i7-11-N502,,,,i7_11,CTGTAGCC,N502,ATAGAGAG,,,
C11-i7-11-N503,,,,i7_11,CTGTAGCC,N503,AGAGGATA,,,
D11-i7-11-N504,,,,i7_11,CTGTAGCC,N504,TCTACTCT,,,
E11-i7-11-N505,,,,i7_11,CTGTAGCC,N505,CTCCTTAC,,,
F11-i7-11-N506,,,,i7_11,CTGTAGCC,N506,TATGCAGT,,,
G11-i7-11-N507,,,,i7_11,CTGTAGCC,N507,TACTCCTT,,,
H11-i7-11-N508,,,,i7_11,CTGTAGCC,N508,AGGCTTAG,,,
X11-i7-11-N517,,,,i7_11,CTGTAGCC,N517,TCTTACGC,,,
A12-i7-12-N501,,,,i7_12,AGTACAAG,N501,GCGATCTA,,,
B12-i7-12-N502,,,,i7_12,AGTACAAG,N502,ATAGAGAG,,,
C12-i7-12-N503,,,,i7_12,AGTACAAG,N503,AGAGGATA,,,
D12-i7-12-N504,,,,i7_12,AGTACAAG,N504,TCTACTCT,,,
E12-i7-12-N505,,,,i7_12,AGTACAAG,N505,CTCCTTAC,,,
F12-i7-12-N506,,,,i7_12,AGTACAAG,N506,TATGCAGT,,,
G12-i7-12-N507,,,,i7_12,AGTACAAG,N507,TACTCCTT,,,
H12-i7-12-N508,,,,i7_12,AGTACAAG,N508,AGGCTTAG,,,
X12-i7-12-N517,,,,i7_12,AGTACAAG,N517,TCTTACGC,,,
23 changes: 23 additions & 0 deletions modules/adapter_trimming.nf
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,7 @@ process fastp_adapter_trim {
// 0.23.0 enables deterministic results, which is crucial
module 'fastp/0.21.0'
cpus 3
scratch false

input:
tuple path(r1), path(r2), val(adapterP5), val(adapterP7)
Expand All @@ -31,8 +32,30 @@ process fastp_adapter_trim {
"""
}

process agent_adapter_trim {
cpus 1
scratch false
module "jdk/11.0.16"

input:
tuple path(r1), path(r2)

output:
path 'output_R?.fastq.gz', emit: fastq

script:
"""
agent.sh trim -v2 \
-fq1 "${r1}" \
-fq2 "${r2}" \
-out \$PWD/output
"""
}

/// Our custom in-house adapter-trimming script
process adapter_trim {
cpus 3
scratch false
input:
tuple path(r1), path(r2), val(adapterP5), val(adapterP7)

Expand Down
Loading