This repository is the bioinformatics toolbox for Janis. It contains tools and data types directly related to the bioinformatics field.
You can see a full list of tools in the documentation.
The data types are a way of encapsulating information about the file (including secondary files), and it allows clarity when connecting inputs and steps together. Sometimes bioinformatics data types have associated files (like an indexed bam, or a fasta with various indexes); when you use these types, Janis will bundle your files together to be transported around.
Further information: Secondary / Accessory files
Janis is opinionated about the .bai index for a Bam. Specifically we use the pattern:
mysample.bammysample.bam.bai
If your tool expects, or creates a file in the other common format (.bam and .bai), you can use the secondaries_present_as attribute on a:
ToolInputto localise the index using a specific format (see: )ToolOutputto prepare your input for .
Documentation is generated on Janis. To generate new documentation you will need to:
- Commit your changes here,
- Update the submodule pointer on Janis,
- Checkout Janis (recursively),
- Run the regenerate script
janis/docs/regeneratedocumentation.py, - Commit these changes and the documentation will autobuild on ReadTheDocs.