Hi all,

I’m testing my application on a SMP workstation (dual Intel Xeon E5-2697 V4 2.3 
GHz Intel Broadwell (boost 2.8-3.1GHz) processors 128GB RAM) and am seeing a 4x 
performance drop compared to a cluster system with 2.6GHz Intel Haswell with 20 
cores / node and 128GB RAM/node.  Both applications have been compiled using 
OpenMPI 1.6.4.  I have tried running:

mpirun -np 20 $EXECUTABLE $INPUT_FILE
mpirun -np 20 --mca btl self,sm $EXECUTABLE $INPUT_FILE

and others, but cannot achieve the same performance on the workstation as is 
seen on the cluster.  The workstation outperforms on other non-MPI but 
multi-threaded applications, so I don’t think it’s a hardware issue.

Any help you can provide would be appreciated.

Thanks,
cap79
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Reply via email to