On Wed, 2008-05-14 at 07:05 -0700, [EMAIL PROTECTED] wrote: > On Wed, May 14, 2008 at 10:25:48AM +0300, Eli Cohen wrote: > > > On Tue, 2008-05-13 at 18:21 -0700, [EMAIL PROTECTED] wrote: > > > We're getting panics like this one on big clusters: > > > > > > skb_over_panic: text:ffffffff8821f32e len:160 put:100 > > > head:ffff810372b0f000 data:ffff810372b0f01c tail:ffff810372b0f0bc > > > end:ffff810372b0f080 dev:ib0 > > > > RX SKBs are large enough to contain 100 bytes... this looks like > > corruption. > > Exactly. One thing that can help discover memory corruptions and other bug is to use a debug kernel. Is it possible that you will configure a few nodes for debug kernel?
_______________________________________________ general mailing list [email protected] http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
