Shirley Ma wrote:
> Although, I don't think that it's necessarily worth doing, since errors should be very rare. Agree. Since Galaxy hits this problem on PPC, we've tried different approaches to fix this problem. None of them work well. And it's hard to change the existing architecture. So I would like to work on a patch to address the error.
A patch for this would be accepted. I should note that other modules, such as the CM, follow this same error recovery. If an error occurs trying to initialize any of the ports, the entire device is not used by that module.
Ipoib appears to handle each port separately, however, so that a failure on one port does not mark the others as invalid. Roland should know for certain, but at least that's the way the code looks to me.
- Sean _______________________________________________ openib-general mailing list [email protected] http://openib.org/mailman/listinfo/openib-general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general
