Re: [petsc-dev] PETSc release?

Karl Rupp Mon, 24 Jul 2017 13:10:01 -0700

Hi Richard,

Karl: 1) thanks for the summary, and 2) Please let me know when yourthink your section of the manual is in a state in which I can look at itand make some contributions. One thing I'd like to write about (if youhaven't already covered it) is some basic info about controlling MPIprocess placement/pinning and the (sometimes surprisingly large) effectsit can have on performance. This is getting a lot more complicated assystems add more NUMA domains and hardware threads. When I was at IntelI encountered a ton of performance problems that were mostly due to badprocess placement (which, fortunately, meant they were actually easy tofix!).


sure, thanks. Expect to receive a bunch of text by tomorrow evening.

Regarding processor placement: I was running 'make streams' to collectdata for the manual chapter. It turned out that the first N/2 processeswere indeed placed on the first socket only, so the N/2+1st processadded a significant boost in achieved overall bandwidth. This shouldmake for a nice illustration of the subject in the manual. Also, it willbenefit users if we get the processor mapping right in `make streams`,so that the output is more in line with our recommendation of "just afew processes per node to saturate memory bandwidth".


Best regards,
Karli

Re: [petsc-dev] PETSc release?

Reply via email to