Allow classification and reclassification to run as modules#109
Open
rmcolq wants to merge 5 commits intoartic-network:mainfrom
Open
Allow classification and reclassification to run as modules#109rmcolq wants to merge 5 commits intoartic-network:mainfrom
rmcolq wants to merge 5 commits intoartic-network:mainfrom
Conversation
…ganisms and viruses
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Changes include:
kraken_classificationmodule includes optional viral reclassificationkraken_reclassificationallows you to only reclassifyThis has included pulling out the
reclassifypart of theclassifyworkflow and calling it fromclassify.Example commands:
Run basic kraken classification with default db
Run reclassification, passing in the assignments/report from the first classification
Chain these steps together, extracting and passing in the viral and unclassified fraction to the second step. NB this is slightly different than running the above 2 steps as they pass the entire fastq to the second step.
NB this commit includes small bugfixes to
merge.pyandreport.pyto fix errors which were occurring when the reclassification was on the whole file, rather than just the viral+unclassified subset of the file. These scripts are mirrored inhttps://github.com/rmcolq/krakenpywhere new tests have been added to reflect/test these changes.NB2 to run on climb will probably want to specify
--kraken_database.default.host--kraken_database.default.portand--kraken_database.default.pathor the equivalent flags withviralinstead ofdefaultwhen there is an active kraken2 viral server