Re: [Scikit-learn-general] parallel GMM

Sturla Molden Wed, 02 Jul 2014 19:51:25 -0700

On 03/07/14 04:33, Sturla Molden wrote:

> I would also like to say (while we're at it) that parallelizing this
> outside BLAS and LAPACK, whether with threads or processes, will require
> a memory overhead roughly equal to the size of the data array per thread
> or process. That is because computation of the covariance matrix and the
> likelihood needs to produce "X - mean" in a temporary array. This
> temporary array cannot be shared but should be reused within each thread
> or process.


... But if LAPACK and BLAS have full responsibility for the 
multithreading, this memory overhead is avoided.

On the other hand the parallelization inside BLAS and LAPACK will be 
more fine grained, and therefore achieve lower performance and scalability.



Sturla


------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] parallel GMM

Reply via email to