A fast lightweight Windows GUI app that retrieves Identifiers from an imput fasta file
- Windows 11 or 10 version 16215.0 or higher(type
winveron the search button and clickwinverto find your OS version) - Windows Subsystem for linux(wsl1)
- the file in the
binfolder
Click on the FASTA IDs Retriever.exe file
Click browse and select an input file
Select whether or not you want to remove the ncbi/refseq/uniprot prefixes lcl|, gnl|, ref|, sp|, tr|
Select whether or not pipe(|) is used as an FASTA identifier/description seperator.
The default is space
Select the field number(which part to keep as ID after the pipe or space split of the fasta header). Default is 1
Click Run program
The example input/output files can be seen here
Input file for example 1 was downloaded from solgenomics.net tomato annotation version 4.1.
Input file for example 2 was downloaded from ncbi.
Input file for example 3 was downloaded from lotus base(MG20 v3.0 proteins file).
NOTE: This program also retrieves FASTA identifiers when no FASTA descriptions exist
