Hi Andreas,

> To make your life easier, I've just merged complex
> number support into PyCUDA proper.

It turned out pycudafft did not really need it, because I only passed
pointers to complex numbers to kernels, not complex numbers by value.
But thank you anyway, I needed this for my main phd project. I tested
pycudafft against mainstream pycuda, it seem to work fine.

> If you'd like pycudafft to be part of PyCUDA itself, we can
> discuss how that could happen.

I am not sure it is necessary. There is *nix ideology, which favors
separated functionality. And you will have to add mako templating
engine as a dependency for pycuda. But the final decision about
architecture of your package is on you of course; it is not a problem
for me to compose corresponding patch. I already changed plan
interface a little in order to make it use shape/dtype parameters in a
same way as numpy arrays and pycuda.gpuarrays.

> Otherwise, please let me know if there's anything you need from PyCUDA.

Well, a few weeks ago I would ask you to add compiler output
redirection, because I had lots of warnings from fft kernels. But I
fixed this issue right before redirection appeared in mainstream, so
currently I have no other complaints about pycuda. It just works!

Best regards,
Bogdan

On Tue, Mar 2, 2010 at 6:51 AM, Andreas Klöckner
<[email protected]> wrote:
> Hi Bogdan,
>
> On Sonntag 14 Februar 2010, Bogdan Opanchuk wrote:
>> The project I am working on relies heavily on batched 3D FFTs. You all
>> know about the situation with CUFFT and PyCuda, and I decided that I
>> must put some effort in it. So, I ported Apple's OpenCL implementation
>> of FFT to PyCuda. The result you can see on
>> http://pypi.python.org/pypi/pycudafft . It is currently in beta stage,
>> but I will work on it - in case somebody needs it. It works with
>> experimental PyCuda branch, the one with complex numbers support.
>>
>> In addition, the package contains CUFFT wrapper by Ying Wai (Daniel)
>> Fan (it appeared in this mail list, I just added class for plan and
>> batch support), if you prefer nVidia's implementation. I used it just
>> to test my code.
>>
>> Main problems at the moment:
>> - On some problem sizes it is much slower than CUFFT (see table on PyPi
>>  page). - The library requires heavy testing for different problem sizes
>>  and videocards For other plans, see TODO.txt in package.
>> [snip]
>>
>> To sum it all up: the development of this library depends mainly on
>> your reaction. Any comments/bug reports/propositions are appreciated.
>> For my personal purposes it works fine already )
>
> This looks like awesome work. Thanks for making it available. To make
> your life easier, I've just merged complex number support into PyCUDA
> proper. If you'd like pycudafft to be part of PyCUDA itself, we can
> discuss how that could happen. Otherwise, please let me know if there's
> anything you need from PyCUDA.
>
> Thanks again for your work,
> Andreas
>
>

_______________________________________________
PyCUDA mailing list
[email protected]
http://host304.hostmonster.com/mailman/listinfo/pycuda_tiker.net

Reply via email to