BLAST: Difference between revisions

Jump to navigation Jump to search
11 bytes added ,  5 years ago
Marked this version for translation
m (Note on zcat)
(Marked this version for translation)
Line 45: Line 45:
Blast search can greatly benefit from data parallelism by splitting the query file into multiples queries and running these queries against the database.
Blast search can greatly benefit from data parallelism by splitting the query file into multiples queries and running these queries against the database.


==== Preprocess ==== <!--T:16-->
==== Preprocess ==== <!--T:16-->
In order to accelerate the search, the <tt>seq.fa</tt> file must be split into smaller chunks. The file chunks should be at least <tt>1Mb</tt> or greater, but not '''smaller''' as it may hurt the parallel file system.
In order to accelerate the search, the <tt>seq.fa</tt> file must be split into smaller chunks. The file chunks should be at least <tt>1Mb</tt> or greater, but not '''smaller''' as it may hurt the parallel file system.


Line 124: Line 124:
}}
}}


<!--T:33-->
Note: the <tt>zcat</tt> utility could be used here to read a gzip compressed FASTA file and pass its decompressed content to GNU Parallel.
Note: the <tt>zcat</tt> utility could be used here to read a gzip compressed FASTA file and pass its decompressed content to GNU Parallel.


cc_staff
284

edits

Navigation menu