You might want to check the permissions on the MLX device directory - according 
to that error message, the permissions are preventing you from accessing the 
device. Without getting access, we don’t have a way to communicate across nodes 
- you can run on one node using shared memory, but not multiple nodes.

So it looks like there is some device-level permissions issue in play.


> On Dec 17, 2015, at 2:39 PM, Bathke, Chuck <bat...@lanl.gov> wrote:
> 
> Hi,
>    I have a system of AMD blades that I am trying to run MCNP6 on using 
> OPENMPI. I installed openmpi-1.6.5. I also have installed Intel FORTRAN and C 
> compiles. I compiled MCNP6 using FC="mpif90" CC="mpicc" … It runs just fine 
> when I run it on a 1-hour test case on just one blade. I need to run it on 
> several blades, but it issues an error and crashes and burns. I have sought 
> help here, but no one seems to know how to fix it. I have mounted /opt and 
> /home on bud and bud6 on the corresponding /opt and /home on bud4, at their 
> suggestion. That did not fix anything. Please look at the attached file 
> (created with bud4>tar -zcf info.tgz mpihT3) that holds the data that is 
> requested at https://www.open-mpi.org/community/help/ 
> <https://www.open-mpi.org/community/help/> and in bullet 13 
> onhttps://www.open-mpi.org/community/help/ 
> <https://www.open-mpi.org/community/help/> . Can you look at it and suggest a 
> solution? I suspect that it is something trivial that does not stand out and 
> say, “look here you idiot.” Thanks.
>  
>  
>                                     Charles "Chuck" Bathke
>  
> MS-C921
> Los Alamos National Laboratory
> P.O. Box 1663
> Los Alamos, NM 87545
> Phone:(505)667-7214
> Cell:(505)695-5709
> Fax: 505-665-2897
> Location: TA-16, Building 0200, Room 125
> NEN-5 Group Office: 505-667-0914
>  
> <info.tgz>_______________________________________________
> users mailing list
> us...@open-mpi.org <mailto:us...@open-mpi.org>
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users 
> <http://www.open-mpi.org/mailman/listinfo.cgi/users>
> Link to this post: 
> http://www.open-mpi.org/community/lists/users/2015/12/28178.php 
> <http://www.open-mpi.org/community/lists/users/2015/12/28178.php>

Reply via email to