Hi,
today I tried to build openmpi-dev-685-g881b1dc on my machines
(Solaris 10 Sparc, Solaris 10 x86_64, and openSUSE Linux 12.1
x86_64) with gcc-4.9.2 and the new Solaris Studio 12.4 compilers.
I succedded on Linux but failed on both Solaris systems for both
compilers with the same error.
...
That is strange, not sure why that is happening. I will try to reproduce with
your program on my system. Also, perhaps you could rerun with –mca
mpi_common_cuda_verbose 100 and send me that output.
Thanks
From: users [mailto:users-boun...@open-mpi.org] On Behalf Of Xun Gong
Sent: Sunday,
On 12/17/2014 07:04 PM, Eric Chamberland wrote:
Hi!
Here is a "poor man's fix" that works for me (the idea is not from me,
thanks to Thomas H.):
#1- char* lCwd = getcwd(0,0);
#2- chdir(lPathToFile);
#3- MPI_File_open(...,lFileNameWithoutTooLongPath,...);
#4- chdir(lCwd);
#5- ...
I think
I think I found a bug in your program with how you were allocating the GPU
buffers. I will send you a version offlist with the fix.
Also, there is no need to rerun with the flags I had mentioned below.
Rolf
From: Rolf vandeVaart
Sent: Monday, January 12, 2015 9:38 AM
To: us...@open-mpi.org
Jeff,
thanks for all the good catches.
MPI_Type_create_resized is not required in this example because
send/recv are called
with count=1.
Generally speaking, if count > 1, MPI_Type_create_resized is required
because
the compiler might add some padding at the end of the type.
Cheers,
Gilles
Hi Brice,
Thanks for your reply. I will look into it.
Regards,
Pradeep
On 9 January 2015 at 10:42, Brice Goglin wrote:
> Hello
>
> Assuming the NUMA distance matrix is available, the distance between a CPU
> and a PCI device is basically the distance between the NUMA