> kernel: ib_mthca 0000:06:00.0: Catastrophic error detected: unknown error
 > kernel: ib_mthca 0000:06:00.0:   buf[00]: ffffffff

Looks like an error on the PCI bus.

 > kernel: ib_mthca 0000:01:00.0: Catastrophic error detected: internal parity 
 > error
 > kernel: ib_mthca 0000:01:00.0:   buf[00]: 05000000

probably what it says it is -- a parity error inside the HCA.

Both point to a physical problem to me -- HCA not perfectly seated in
PCI slot, power supply flaky, thermal issue, something like that.

 - R.
_______________________________________________
general mailing list
general@lists.openfabrics.org
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to