Jean-Francois.Neyroud wrote:
If I attemp to query at the same time the performance counters on all nodes on a cluster ( 40 nodes) . perfquery causes kernel to be stuck in ib_unregister_mad_agent() function. Impossible to send CTRL-C or CTRL-Z to perfquery, it is stuck in the kernel.
maybe with
$ dmesg -c
$ echo 1 > /proc/sysrq-trigger
$ echo t > /proc/sysrq-trigger

and then looking on the related kernel threads stacks from the dmesg (eg of ib_madX threads, etc) you would get more info that you can share.

Or.


_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to