Hi all, I observe such problem spawning randomly on my nodes:
kernel: ib_ipath 0000:03:00.0: RXE parity, Eager TID port 0 idx 0x33c expected 20447819, but got 20047819. kernel: ib_ipath 0000:03:00.0: infinipath0: RXE parity Eager TID not recoverable, read 20047819, expected 20447819 kernel: ib_ipath 0000:03:00.0: infinipath0: RXE parity, Eager TID error is not recoverable That's for qlogic cards, Mellanox ones seem to be much, much more stable. As I need to stress every card I just looking for a tool to make memory chips there under heavy load and, unfortunately, with not much luck. So what's the tool for diagnosing the cards? ---------------------------------------------------------------------- >> Sprawdz swoja najblizsza przyszlosc! >> http://link.interia.pl/f1f0b _______________________________________________ general mailing list [email protected] http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
