On Mon, Sep 22, 2008 at 2:43 PM, Roger Spellman <[EMAIL PROTECTED]> wrote: > Thanks, Hal. > > Below is the output to ibstat and ibstatus. It shows that the rate is > 2.5 Gb/sec, rather than 10 Gb/sec. > > Is there a way to get it to renegotiate the rate, short of rebooting?
Try ibportstate reset on the switch peer port. You could also replug the cable on that link. > [EMAIL PROTECTED] lib64]# ibstat > CA 'mthca0' > CA type: MT25204 > Number of ports: 1 > Firmware version: 1.2.936 > Hardware version: a0 > Node GUID: 0x0002c9020026e4c0 > System image GUID: 0x0002c9020026e4c3 > Port 1: > State: Active > Physical state: LinkUp > Rate: 2 > Base lid: 19 > LMC: 0 > SM lid: 1 > Capability mask: 0x02510a68 > Port GUID: 0x0002c9020026e4c1 > [EMAIL PROTECTED] lib64]# ibstatus > Infiniband device 'mthca0' port 1 status: > default gid: fe80:0000:0000:0000:0002:c902:0026:e4c1 > base lid: 0x13 > sm lid: 0x1 > state: 4: ACTIVE > phys state: 5: LinkUp > rate: 2.5 Gb/sec (1X) > > > > >> It's likely a rate issue where the negotiated port rate is not the >> broadcast group rate. Yes, it's a rate problem (the link is coming up a 1X SDR which is 2.5 Gbps whereas I suspect that the group is 10 Gbps so it can't join. -- Hal >> What does ibstat or ibstatus show when the join fails ? Also, what >> about saquery -g ? > >> > >> > Rebooting the node that failed to join the group always seems to > solve >> > the problem. > >> Yes, that's consistent with the negotiated rate being a problem. > >> -- Hal > _______________________________________________ general mailing list [email protected] http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
