Hi Rob, On Tue, Nov 25, 2008 at 9:46 AM, Robert Dunkley <[EMAIL PROTECTED]> wrote: > Hi Hal, > > Machine A is powered on. It was after powering down machine B and OpenSM > with it that Machine A went weird.
> /sys/class/infiniband/mthca0 exists on Machine A, contents is: > board_id fw_ver hw_rev node_guid ports sys_image_guid > device hca_type node_desc node_type subsystem uevent What about machine B ? Do these files exist ? Also what is the port state (down or init or something else) ? -- Hal > Thanks, > > Rob > > -----Original Message----- > From: Hal Rosenstock [mailto:[EMAIL PROTECTED] > Sent: 25 November 2008 14:46 > To: Robert Dunkley > Cc: [email protected] > Subject: Re: [ofa-general] Mellanox Gen3, Linux and ibpanic - "Resource > Temporarily unavailable" > > On Tue, Nov 25, 2008 at 9:20 AM, Robert Dunkley <[EMAIL PROTECTED]> > wrote: >> Hi everyone, >> >> I'm using a setup of two machines (Lets call them A and B) directly >> connected by 1 cable. Each machine has a Mellanox MT25204 (Gen3 > Mellanox >> PCI-E Infiniband card) and uses IPOIB, they run Centos 5.2 with OFED > 1.3 >> installed, Machine B runs OpenSM. >> >> All was working fine. I shutdown Machine A did some maintenance and > then >> powered it on again, everything is OK again. I then shutdown Machine B >> (The one running OpenSM), this seemed to really upset Machine A. After >> booting Machine B again, Machine B looks OK with the port down and in >> polling state. > > Is this with machine A powered off ? > >> Machine A however gives the following error if I run >> ibstat: ibpanic: [11406] main: stat of IB device 'mthca0' failed: >> (Resource temporarily unavailable) > > Does /sys/class/infiniband/mthca0 exist on machine A ? If so, what > files are there ? > > -- Hal > >> I don't want to reboot Machine A as it must synch data with Machine B >> over the Infiniband link first. Does anyone have any idea how to fix >> machine A? >> >> Thanks, >> >> Rob >> >> The SAQ Group >> >> Registered Office: 18 Chapel Street, Petersfield, Hampshire GU32 3DZ >> SEMTEC Limited Trading as SAQ is Registered in England & Wales >> Company Number: 06481952 >> >> >> >> http://www.saqnet.co.uk AS29219 >> >> SAQ Group Delivers high quality, honestly priced communication and > I.T. services to UK Business. >> >> DSL : Domains : Email : Hosting : CoLo : Servers : Racks : Transit : > Backups : Managed Networks : Remote Support. >> >> Find us in http://www.thebestof.co.uk/petersfield >> >> _______________________________________________ >> general mailing list >> [email protected] >> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general >> >> To unsubscribe, please visit > http://openib.org/mailman/listinfo/openib-general >> > _______________________________________________ general mailing list [email protected] http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
