>
> 2D finite difference can be comm intensive is the mesh is too small for each
> processor to have a fair amount of work to do before needing the neighboring
> values from a "far" node.
>

Actually it seems that with VX50 the same node may be the "far" node.
At least that's what I see
from the NUMA Analysis test from TAU Wiki:
http://www.nic.uoregon.edu/tau-wiki/Guide:Opteron_NUMA_Analysis

Do not have the numbers at hand but that was the impression.

>
> How do you identify the specific instruction using a profiler, this is
> something that interests me.
>

I am using the Performance Analyzer that comes with Sun Studio 12. It
provides a per instruction profile view of the disassembly.
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to