May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: Internal error detected: May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[00]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[01]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[02]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[03]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[04]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[05]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[06]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[07]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[08]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[09]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[0a]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[0b]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[0c]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[0d]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[0e]: ffffffff May 9 08:38:54 compute-0-4.local kernel: mlx4_core 0000:02:00.0: buf[0f]: ffffffff
The HCA in question is a Connect-X and the problem only seems to happen with this node. -- Michael Heinz Principal Engineer, Qlogic Corporation King of Prussia, Pennsylvania
_______________________________________________ general mailing list [email protected] http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
