This tool predicts transporter proteins.
Input: proteins sequences in Fasta format Output: the predicted location,1=membrane, 0=non-membrane
The training dataset can be found here
There are a number of folders that support the running of TooT-M and its outputs.
Contains the homology details needed to extract the features. Details of the psiBlast hits for each sequence is found here.
intermediate_files/Compostions: Contains the extracted MSA_TPAAC features of test set
Contains the database to be used when performing BLAST.
The scripts needed to use the tool.
- This tool requires that
BLASTbe pre-installed - Usage:
Rscript src/TooT_M.R -query=<input> [-TooTM=<TooTMdir>] [-out=<outdir>] [-db=<path to db>][-topcons2=<TOPCONS2 results path>] <input>is your sequence input file in fasta format<out>is the output directory where you want the predicted results, formatted as csv<TooTMdir>is the directory where the base TooT-M files are located<path to db> is the directory where the database is stored<TOPCONS2 results path> is the directory where TOPCONS2 results (.txt) is stored; results can be obtained by there available website [here](https://tootsuite.encs.concordia.ca/datasets/membrane/) or by downloading the tool as described [here](https://github.com/ElofssonLab/TOPCONS2)- Pse-PSSM features (lambda in (0,1,...49) of each sequence in the test set is found under intermediate_files/Compositions/Testing/