Hi Andreas, > To make your life easier, I've just merged complex > number support into PyCUDA proper.
It turned out pycudafft did not really need it, because I only passed pointers to complex numbers to kernels, not complex numbers by value. But thank you anyway, I needed this for my main phd project. I tested pycudafft against mainstream pycuda, it seem to work fine. > If you'd like pycudafft to be part of PyCUDA itself, we can > discuss how that could happen. I am not sure it is necessary. There is *nix ideology, which favors separated functionality. And you will have to add mako templating engine as a dependency for pycuda. But the final decision about architecture of your package is on you of course; it is not a problem for me to compose corresponding patch. I already changed plan interface a little in order to make it use shape/dtype parameters in a same way as numpy arrays and pycuda.gpuarrays. > Otherwise, please let me know if there's anything you need from PyCUDA. Well, a few weeks ago I would ask you to add compiler output redirection, because I had lots of warnings from fft kernels. But I fixed this issue right before redirection appeared in mainstream, so currently I have no other complaints about pycuda. It just works! Best regards, Bogdan On Tue, Mar 2, 2010 at 6:51 AM, Andreas Klöckner <[email protected]> wrote: > Hi Bogdan, > > On Sonntag 14 Februar 2010, Bogdan Opanchuk wrote: >> The project I am working on relies heavily on batched 3D FFTs. You all >> know about the situation with CUFFT and PyCuda, and I decided that I >> must put some effort in it. So, I ported Apple's OpenCL implementation >> of FFT to PyCuda. The result you can see on >> http://pypi.python.org/pypi/pycudafft . It is currently in beta stage, >> but I will work on it - in case somebody needs it. It works with >> experimental PyCuda branch, the one with complex numbers support. >> >> In addition, the package contains CUFFT wrapper by Ying Wai (Daniel) >> Fan (it appeared in this mail list, I just added class for plan and >> batch support), if you prefer nVidia's implementation. I used it just >> to test my code. >> >> Main problems at the moment: >> - On some problem sizes it is much slower than CUFFT (see table on PyPi >> page). - The library requires heavy testing for different problem sizes >> and videocards For other plans, see TODO.txt in package. >> [snip] >> >> To sum it all up: the development of this library depends mainly on >> your reaction. Any comments/bug reports/propositions are appreciated. >> For my personal purposes it works fine already ) > > This looks like awesome work. Thanks for making it available. To make > your life easier, I've just merged complex number support into PyCUDA > proper. If you'd like pycudafft to be part of PyCUDA itself, we can > discuss how that could happen. Otherwise, please let me know if there's > anything you need from PyCUDA. > > Thanks again for your work, > Andreas > > _______________________________________________ PyCUDA mailing list [email protected] http://host304.hostmonster.com/mailman/listinfo/pycuda_tiker.net
