> kernel: ib_mthca 0000:06:00.0: Catastrophic error detected: unknown error > kernel: ib_mthca 0000:06:00.0: buf[00]: ffffffff
Looks like an error on the PCI bus. > kernel: ib_mthca 0000:01:00.0: Catastrophic error detected: internal parity > error > kernel: ib_mthca 0000:01:00.0: buf[00]: 05000000 probably what it says it is -- a parity error inside the HCA. Both point to a physical problem to me -- HCA not perfectly seated in PCI slot, power supply flaky, thermal issue, something like that. - R. _______________________________________________ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general