Helen> Roland, Thank you for your response. That fixed my initial
Helen> buffer allocation failure. After we tuned the Lustre and
Helen> reran same IOZONE tests again, we got the following
Helen> problem. Was there an actual network interrupt? If so, the
Helen> problem is not obvious now; the two nodes are pinging over
Helen> IPoIB. Please advice.
That's very odd. This message:
Helen> NETDEV WATCHDOG: ib0: transmit timed out
Helen> ib0: transmit timeout: latency 1846
says that we are not seeing send completions from the HCA. However,
are you saying that even when you are seeing this message, ping over
IPoIB is working?
- R.
_______________________________________________
openib-general mailing list
[email protected]
http://openib.org/mailman/listinfo/openib-general
To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general