On Nov 16, 2008, at 12:47 PM, Mark Abraham wrote:

Apologies to people receiving this from the FFTW list too, but it should be of interest to a large slice of people using GROMACS. In particular, the improvements for SIMD and size of multiples of 5/10 should be of value for lots of people.

Does anyone (Carsten?) have an off-the-cuff opinion on whether the latter means that FFT grids with dimensions that are multiples of 5 might now be more desirable? How might that affect PME load-splitting?

Hi,

I fear that the performance gain in mdrun due to a faster FFT will not be too large - of course that depends on how 'bad' the performance was before for multiples of 5. The reason is that a significant amount of the time used for transforming to and from reciprocal space is actually communication time: MPI transposes of the FFT data, while the calculation itself is not so time consuming for a normal-sized MD system. Since these transposes are done in Gromacs anyway, the only thing that could be faster with the new FFTW library is the calculation itself - which should not make a big difference in the overall mdrun performance. But why not compare the old and new FFTW performance with a test MD system? I would choose zero PME nodes for that test, just to be shure that a possible FFTW performance gain does not get hidden behind a PME/PP load imbalance.

Carsten



Mark

-------- Original Message --------
Subject: [FFTW-announce] FFTW 3.2 is released
Date: Sat, 15 Nov 2008 22:01:44 -0500 (EST)
From: Steven G. Johnson <[EMAIL PROTECTED]>
Reply-To: Steven G. Johnson <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]

Dear FFTW users,

The official release of FFTW 3.2 is now available from the FFTW home page:

        http://www.fftw.org/

This release does *not* contain the experimental MPI transfroms from FFTW 3.2alpha; as those routines are still in alpha, they have been moved to FFTW
3.3alpha1 (also available from the FFTW web page).

Compared to version 3.1.3 of FFTW, highlights of the FFTW 3.2 release include:

* Significantly faster r2c/c2r transforms, especially on machines
 (e.g. x86) with SIMD instruction.

* Performance improvements for sizes with factors of 5 and 10.

* Rewritten threading support for improved performance.

* Support for the Cell processor, donated by IBM Research; see
 README.Cell and the Cell section of the manual.

* Support for MIPS paired-single SIMD instructions, donated by
 Codesourcery.

The complete release notes can be found in the NEWS and ChangeLog files of the
release, and at:

        http://fftw.org/release-notes

Specific changes compared to FFTW 3.2alpha3 are listed below.

Regards,
Steven G. Johnson and Matteo Frigo

FFTW 3.2 changes from 3.2alpha3

* Worked around apparent glibc bug that leads to rare hangs when freeing
 semaphores.

* Fixed segfault due to unaligned access in certain obscure problems
 that use SSE and multiple threads.

* MPI transforms not included, as they are still in alpha; the alpha
 versions of the MPI transforms have been moved to FFTW 3.3alpha1.

_______________________________________________
FFTW-announce mailing list
[EMAIL PROTECTED]
http://fftw.org/cgi-bin/mailman/listinfo/fftw-announce

_______________________________________________
gmx-users mailing list    [email protected]
http://www.gromacs.org/mailman/listinfo/gmx-users
Please search the archive at http://www.gromacs.org/search before posting! Please don't post (un)subscribe requests to the list. Use the www interface or send it to [EMAIL PROTECTED]
Can't post? Read http://www.gromacs.org/mailing_lists/users.php

Carsten Kutzner
[EMAIL PROTECTED]



_______________________________________________
gmx-users mailing list    [email protected]
http://www.gromacs.org/mailman/listinfo/gmx-users
Please search the archive at http://www.gromacs.org/search before posting!
Please don't post (un)subscribe requests to the list. Use the www interface or send it to [EMAIL PROTECTED]
Can't post? Read http://www.gromacs.org/mailing_lists/users.php

Reply via email to