Re: [Pdl-porters] Multithreading passes on my machine; suspicious about PLplot

John Cerney Mon, 16 May 2011 06:14:44 -0700


On 5/14/2011 8:12 AM, David Mertens wrote:

Hello everybody -

I recently pulled in the pthreading patch that John just pushed. I am
blown away that John was able to go deep into the internals and make
automated pthreading work without any additional user effort. Amazing!

A lot of the effort was just cleaning up and adding an interface toprevious work by Christian Soeller for the add_threading_magic function.

The standard tests compiled and ran on my laptop. I then ran the tests
with forced pthreading using (in bash)

$ PDL_AUTOPTHREAD_TARG=4 PDL_AUTOPTHREAD_SIZE=0 make --no-print-directory test

All of these passed, too. However, PLplot's tests *should not* have
passed. PLplot's C-implementation uses a global state machine and the
low-level plotting commands thread over piddle arguments. I don't
believe it's possible for that to be pthread-safe. However, there are
no tests to check against this, since until now all PDL threading has
run on a single pthread. I don't believe this is likely to cause
trouble with PLplot from most use cases. John explicitly recommends
against a pthread size of 0, and a pthread size of 1 refers to 1
megabyte of data. (Who would try plotting megabytes of data on one
plot?) However, almost certainly somebody is going to set a pthread
size of 0 in their environment variables, just for fun, and then plots
using a PLOTTYPE of POINTS and with various colors will inexplicably
begin to fail.

To help see what is going on with PLplot, It would help to break it downto a small test case where the input data has some extra threadeddimensions, then run with the PDL_AUTOPTHREAD_TARG=4PDL_AUTOPTHREAD_SIZE=0 arguments, and then call theget_autopthread_actual() to verify the number of pthreads that wereactually executed.

When developing the processor multi-threading patch, I found that thefft and the pnmout functions were not thread-safe. They would sometimescrash (segfault) when running the test cases, but not always. It ispossible that PLplot test cases would have to be run multiple times tosee any problems.

In any case, if PLplot is known to be not thread-safe, then it is bestto put the NoPthread flag in the PPcode for the PLplot package so thatpthreading won't be attempted. This is what was done for the fft and thepnmout functions.


Also, some clarifications from your message above:

John explicitly recommends
against a pthread size of 0, and a pthread size of 1 refers to 1
megabyte of data.

There is nothing wrong with running with PDL_AUTOPTHREAD_SIZE=0. Thewill make code attempt to do multiple pthreads on any size PDL. Sincethere is some overhead involved in creating multiple pthreads, thereprobably won't be a speed benefit to doing multiple pthreads small PDLarrays. However, I haven't done a whole lot of benchmarking to verify this.

Also the PDL_AUTOPTHREAD_SIZE refers to the number of elements in thePDL, not the size of the PDL in bytes. For example, doing azeroes(5000,5000) will create an double-precision array with about 24MegElements, but with a size in memory of 24*8 = 190Mbyes





_______________________________________________
PDL-porters mailing list
PDL-porters@jach.hawaii.edu
http://mailman.jach.hawaii.edu/mailman/listinfo/pdl-porters

Re: [Pdl-porters] Multithreading passes on my machine; suspicious about PLplot

Reply via email to