Now that my code is running, in case anyone is still following this thread: 
is it possible that there has been a regression in parallel speed in 0.4? I 
recall that running my code in 0.3.11 resulted in a more or less linear 
speedup on my local machine (i.e. adding seven processors on my machine 
with 4 local physical core resulted in a runtime improvement of about a 
factor of 3), while now the ratio is a little under 2.45. 

Reply via email to