SEALS
(A System for Easy Analysis of Lots of Sequences) is a software package designed for large-scale data analysis in bioinformatics. Using a friendly command-line user interface, SEALS allows to automate many repetitive tasks as demonstrated below. |
1. Move to TCSH
tcsh
2. Activate SEALS
activate_seals
blastall -i my_protein.txt -d /flower/db/nr -p blastp -o results.br -I T
blast2blast results.br -ecut e-50 -save
blast2gi results_mod.br -save
gi2genbank results_mod.gi -timeout 50 -tries 0 -append
feature2fasta results_mod.gb -defline '$gi|$organism|$definition' -save
fauniq results_mod.fa -basis=sequence -save
clustalw results_mod_uniq.fa -align -type=protein
clustalw results_mod_uniq.aln -tree -kimura -tossgaps
blastall -i my_proteins.txt -d /flower/db/nr -p blastp -o results.br -I T
blast2blast results.br -ecut e-50 -save -delete
shatterblast results_mod.br -word 2
rm results_mod.br
blast2gi *.br -save
gi2genbank *.gi -timeout 50 -tries 0 -append
feature2fasta *.gb -defline '$gi|$organism|$definition' -save
fauniq *.fa -basis sequence -save
foreach_file *_uniq.fa -command='clustalw -infile={} -align -type=protein';
foreach_file *.aln -command='clustalw -infile={} -tree -kimura -tossgaps';
All commands separated by semicolons can be placed into a file (a script file, e.g. named script.sh) and than all you have to type is
./script.sh(!!!)