> > 2D finite difference can be comm intensive is the mesh is too small for each > processor to have a fair amount of work to do before needing the neighboring > values from a "far" node. >
Actually it seems that with VX50 the same node may be the "far" node. At least that's what I see from the NUMA Analysis test from TAU Wiki: http://www.nic.uoregon.edu/tau-wiki/Guide:Opteron_NUMA_Analysis Do not have the numbers at hand but that was the impression. > > How do you identify the specific instruction using a profiler, this is > something that interests me. > I am using the Performance Analyzer that comes with Sun Studio 12. It provides a per instruction profile view of the disassembly. _______________________________________________ Beowulf mailing list, Beowulf@beowulf.org To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf