Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-08 Thread dmitrey
Matthieu Brucher wrote: Matlab surely relies on MKL to do this (Matlab ships with MKL or ACML now). The latest Intel library handles multiprocessing, so if you want to use multithreading, use MKL (and it can handle quad-cores with no sweat). So Numpy is multithreaded. I have AMD processor

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-08 Thread Matthieu Brucher
MKL does the multithreading on its own for level 3 BLAS instructions (OpenMP). For ACML, the problem is that AMD does not provide a CBLAS interface and is not interested in doing so. With ACML, the compilation fails with the current Numpy, but hopefully with Scons it will work, at least

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-08 Thread Kevin Jacobs [EMAIL PROTECTED]
On 1/8/08, Matthieu Brucher [EMAIL PROTECTED] wrote: I have AMD processor so I guess I should use ACML somehow instead. However, at 1st I would prefer my code to be platform-independent, and at 2nd unfortunately I haven't encountered in numpy documentation (in website scipy.org and

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-08 Thread David Cournapeau
Matthieu Brucher wrote: MKL does the multithreading on its own for level 3 BLAS instructions (OpenMP). For ACML, the problem is that AMD does not provide a CBLAS interface and is not interested in doing so. With ACML, the compilation fails with the current

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-08 Thread Brian Granger
As others have mentioned, the quickest and easiest way of getting these things is to build numpy against a LAPACK/BLAS that has threading support enabled. I have not played with this, but there is no reason it shouldn't work out of the box. On Jan 7, 2008 2:26 PM, dmitrey [EMAIL PROTECTED]

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-08 Thread Brian Granger
Yes, the problem in this implementation is that it uses pthreads for synchronization instead of spin locks with a work pool implementation tailored to numpy. The thread synchronization overhead is horrible (300,000-400,000 clock cycles) and swamps anything other than very large arrays. I

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-08 Thread Brian Granger
On Jan 8, 2008 3:33 AM, Matthieu Brucher [EMAIL PROTECTED] wrote: I have AMD processor so I guess I should use ACML somehow instead. However, at 1st I would prefer my code to be platform-independent, and at 2nd unfortunately I haven't encountered in numpy documentation (in website

[Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-07 Thread dmitrey
Some days ago there was mentioned a parallel numpy that is developed by Brian Granger. Does the project have any blog or website? Has it any description about API and abilities? When 1st release is intended? Regards, D ___ Numpy-discussion mailing

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-07 Thread Brian Granger
Dmitrey, This work is being funded by a new NASA grant that I have at Tech-X Corporation where I work. The grant officially begins as of Jan 18th, so not much has been done as of this point. We have however been having some design discussions with various people. Here is a broad overview of

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-07 Thread dmitrey
The only one thing I'm very interested in for now - why the most simplest matrix operations are not implemented to be parallel in numpy yet (for several-CPU computers, like my AMD Athlon X2). First of all it's related to matrix multiplication and devision, either point or matrix (i.e. like

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-07 Thread Robert Kern
dmitrey wrote: The only one thing I'm very interested in for now - why the most simplest matrix operations are not implemented to be parallel in numpy yet (for several-CPU computers, like my AMD Athlon X2). First of all it's related to matrix multiplication and devision, either point or

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-07 Thread Matthieu Brucher
2008/1/7, dmitrey [EMAIL PROTECTED]: The only one thing I'm very interested in for now - why the most simplest matrix operations are not implemented to be parallel in numpy yet (for several-CPU computers, like my AMD Athlon X2). First of all it's related to matrix multiplication and devision,

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-07 Thread eric jones
Robert Kern wrote: dmitrey wrote: The only one thing I'm very interested in for now - why the most simplest matrix operations are not implemented to be parallel in numpy yet (for several-CPU computers, like my AMD Athlon X2). First of all it's related to matrix multiplication and

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-07 Thread Andrew Straw
dmitrey wrote: The only one thing I'm very interested in for now - why the most simplest matrix operations are not implemented to be parallel in numpy yet (for several-CPU computers, like my AMD Athlon X2). For what it's worth, sometimes I *want* my numpy operations to happen only on one

Re: [Numpy-discussion] parallel numpy (by Brian Granger) - any info?

2008-01-07 Thread David Cournapeau
Andrew Straw wrote: dmitrey wrote: The only one thing I'm very interested in for now - why the most simplest matrix operations are not implemented to be parallel in numpy yet (for several-CPU computers, like my AMD Athlon X2). For what it's worth, sometimes I *want* my numpy