Jean-Francois.Neyroud wrote:
If I attemp to query at the same time the performance counters on all nodes on a cluster ( 40 nodes) . perfquery causes kernel to be stuck in ib_unregister_mad_agent() function. Impossible to send CTRL-C or CTRL-Z to perfquery, it is stuck in the kernel.
maybe with $ dmesg -c $ echo 1 > /proc/sysrq-trigger $ echo t > /proc/sysrq-trigger
and then looking on the related kernel threads stacks from the dmesg (eg of ib_madX threads, etc) you would get more info that you can share.
Or. _______________________________________________ general mailing list [email protected] http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
