Ralph, Where would these be, in /dev? Chuck From: Ralph Castain [mailto:r...@open-mpi.org] Sent: Thursday, December 17, 2015 04:13 PM To: Open MPI Users <us...@open-mpi.org> Subject: Re: [OMPI users] Need help resolving "error obtaining device context for mlx4_0"
You might want to check the permissions on the MLX device directory - according to that error message, the permissions are preventing you from accessing the device. Without getting access, we don’t have a way to communicate across nodes - you can run on one node using shared memory, but not multiple nodes. So it looks like there is some device-level permissions issue in play. On Dec 17, 2015, at 2:39 PM, Bathke, Chuck <bat...@lanl.gov<mailto:bat...@lanl.gov>> wrote: Hi, I have a system of AMD blades that I am trying to run MCNP6 on using OPENMPI. I installed openmpi-1.6.5. I also have installed Intel FORTRAN and C compiles. I compiled MCNP6 using FC="mpif90" CC="mpicc" … It runs just fine when I run it on a 1-hour test case on just one blade. I need to run it on several blades, but it issues an error and crashes and burns. I have sought help here, but no one seems to know how to fix it. I have mounted /opt and /home on bud and bud6 on the corresponding /opt and /home on bud4, at their suggestion. That did not fix anything. Please look at the attached file (created with bud4>tar -zcf info.tgz mpihT3) that holds the data that is requested at https://www.open-mpi.org/community/help/ and in bullet 13 onhttps://www.open-mpi.org/community/help/ . Can you look at it and suggest a solution? I suspect that it is something trivial that does not stand out and say, “look here you idiot.” Thanks. Charles "Chuck" Bathke MS-C921 Los Alamos National Laboratory P.O. Box 1663 Los Alamos, NM 87545 Phone:(505)667-7214 Cell:(505)695-5709 Fax: 505-665-2897 Location: TA-16, Building 0200, Room 125 NEN-5 Group Office: 505-667-0914 <info.tgz>_______________________________________________ users mailing list us...@open-mpi.org<mailto:us...@open-mpi.org> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users Link to this post: http://www.open-mpi.org/community/lists/users/2015/12/28178.php