Hi all, I’m testing my application on a SMP workstation (dual Intel Xeon E5-2697 V4 2.3 GHz Intel Broadwell (boost 2.8-3.1GHz) processors 128GB RAM) and am seeing a 4x performance drop compared to a cluster system with 2.6GHz Intel Haswell with 20 cores / node and 128GB RAM/node. Both applications have been compiled using OpenMPI 1.6.4. I have tried running:
mpirun -np 20 $EXECUTABLE $INPUT_FILE mpirun -np 20 --mca btl self,sm $EXECUTABLE $INPUT_FILE and others, but cannot achieve the same performance on the workstation as is seen on the cluster. The workstation outperforms on other non-MPI but multi-threaded applications, so I don’t think it’s a hardware issue. Any help you can provide would be appreciated. Thanks, cap79 _______________________________________________ users mailing list users@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/users