On 09/04/2016 10:00 PM, Dario Strbenac wrote:
Hello,

Functions such as vmatchPattern and vmatchPDict naturally lend
themselves to being parallelised. Could they be enhanced to accept a
BiocParallelParam object ? Or, is there no significant performance
difference using them as-is and having the bplapply loop surrounding
them and repeatedly calling DNAString (it's odd that vmatchPattern -
for searching BSgenome objects - requires a DNAString for the
pattern, rather than a DNAStringSet) or DNAStringSet ?

fwiw, and until there is direct support for this, probably one wants to look at bpvec() rather than bplapply() to parallelize these. Also, one would _not_ want to parallelize on the argument that becomes the PDict, because this part of the calculation scales favorably with number of elements.

Martin


-------------------------------------- Dario Strbenac University of
Sydney Camperdown NSW 2050 Australia

_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel



This email message may contain legally privileged and/or...{{dropped:2}}

_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel

Reply via email to