And we found that the code runs fine on Haswell. A KNL compiler bug not a
On Wed, Feb 14, 2018 at 3:58 PM, Mark Adams <mfad...@lbl.gov> wrote:
>> Your point about data decomposition is a good one. Even if you want to
>> run with threads, you must decompose your data intelligently
>> to get good performance. Can't you do the MPI shared work and still pass
>> it off as work necessary for threading anyway?
> We don't have any resources to change the code. Baky is an application PD
> and just has time and interest to work with me to optimize parameters. We
> are just grabbing low hanging fruit. Then we can see where we are and
> quantify the potential benefits of implementing a better data model.
>> What most experimenters take for granted before they begin their
>> experiments is infinitely more interesting than any results to which their
>> experiments lead.
>> -- Norbert Wiener
>> https://www.cse.buffalo.edu/~knepley/ <http://www.caam.rice.edu/~mk51/>