To be honest, it’s been a very long time since I had an IB machine. Howard, 
Nathan, or someone who has one - can you answer?



> On Dec 17, 2015, at 3:53 PM, Bathke, Chuck <bat...@lanl.gov> wrote:
> 
> Ralph,
> Where would these be, in /dev?
> Chuck
>  
> From: Ralph Castain [mailto:r...@open-mpi.org] 
> Sent: Thursday, December 17, 2015 04:13 PM
> To: Open MPI Users <us...@open-mpi.org> 
> Subject: Re: [OMPI users] Need help resolving "error obtaining device context 
> for mlx4_0" 
>  
> You might want to check the permissions on the MLX device directory - 
> according to that error message, the permissions are preventing you from 
> accessing the device. Without getting access, we don’t have a way to 
> communicate across nodes - you can run on one node using shared memory, but 
> not multiple nodes.
> 
> So it looks like there is some device-level permissions issue in play.
> 
> 
>> On Dec 17, 2015, at 2:39 PM, Bathke, Chuck <bat...@lanl.gov 
>> <mailto:bat...@lanl.gov>> wrote:
>> 
>> Hi,
>>    I have a system of AMD blades that I am trying to run MCNP6 on using 
>> OPENMPI. I installed openmpi-1.6.5. I also have installed Intel FORTRAN and 
>> C compiles. I compiled MCNP6 using FC="mpif90" CC="mpicc" … It runs just 
>> fine when I run it on a 1-hour test case on just one blade. I need to run it 
>> on several blades, but it issues an error and crashes and burns. I have 
>> sought help here, but no one seems to know how to fix it. I have mounted 
>> /opt and /home on bud and bud6 on the corresponding /opt and /home on bud4, 
>> at their suggestion. That did not fix anything. Please look at the attached 
>> file (created with bud4>tar -zcf info.tgz mpihT3) that holds the data that 
>> is requested at https://www.open-mpi.org/community/help/ 
>> <https://www.open-mpi.org/community/help/> and in bullet 13 
>> onhttps://www.open-mpi.org/community/help/ 
>> <https://www.open-mpi.org/community/help/> . Can you look at it and suggest 
>> a solution? I suspect that it is something trivial that does not stand out 
>> and say, “look here you idiot.” Thanks.
>>  
>>  
>>                                     Charles "Chuck" Bathke
>>  
>> MS-C921
>> Los Alamos National Laboratory
>> P.O. Box 1663
>> Los Alamos, NM 87545
>> Phone:(505)667-7214
>> Cell:(505)695-5709
>> Fax: 505-665-2897
>> Location: TA-16, Building 0200, Room 125
>> NEN-5 Group Office: 505-667-0914
>>  
>> <info.tgz>_______________________________________________
>> users mailing list
>> us...@open-mpi.org <mailto:us...@open-mpi.org>
>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users 
>> <http://www.open-mpi.org/mailman/listinfo.cgi/users>
>> Link to this post: 
>> http://www.open-mpi.org/community/lists/users/2015/12/28178.php 
>> <http://www.open-mpi.org/community/lists/users/2015/12/28178.php>
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post: 
> http://www.open-mpi.org/community/lists/users/2015/12/28180.php

Reply via email to