cc_staff
284
edits
m (Note on zcat) |
(Marked this version for translation) |
||
Line 45: | Line 45: | ||
Blast search can greatly benefit from data parallelism by splitting the query file into multiples queries and running these queries against the database. | Blast search can greatly benefit from data parallelism by splitting the query file into multiples queries and running these queries against the database. | ||
==== Preprocess ==== | ==== Preprocess ==== <!--T:16--> | ||
In order to accelerate the search, the <tt>seq.fa</tt> file must be split into smaller chunks. The file chunks should be at least <tt>1Mb</tt> or greater, but not '''smaller''' as it may hurt the parallel file system. | In order to accelerate the search, the <tt>seq.fa</tt> file must be split into smaller chunks. The file chunks should be at least <tt>1Mb</tt> or greater, but not '''smaller''' as it may hurt the parallel file system. | ||
Line 124: | Line 124: | ||
}} | }} | ||
<!--T:33--> | |||
Note: the <tt>zcat</tt> utility could be used here to read a gzip compressed FASTA file and pass its decompressed content to GNU Parallel. | Note: the <tt>zcat</tt> utility could be used here to read a gzip compressed FASTA file and pass its decompressed content to GNU Parallel. | ||