I'm getting poor performance on OpenMPI tasks on a new AMD 7401P EPYC
server. I suspect hwloc providing a poor topology may have something to do
with it as I receive this warning below when creating a job.
Requested data files available at http://static.skysight.io/out.tgz
Cheers,
Matthew
Following up on this,
Indeed with a recent kernel the error message goes away.
The poor performance stays though (a few percent difference between 4.13
and 4.15rc5), and I'm at a loss as to whether it's related to MPI or not.
I see oddities such as locking the job to the first 12 cores yield 100%
g