Hello all,

I have an LNET routing question. I’ve attached a quick diagram of the current 
setup; but basically I have two core networks (one infiniband and one ethernet) 
with a set of LNET routers in between. There is storage and clients on both 
sides of these routers and all clients need to see all/most storage. All 
connections, configurations, etc are all working.

The question is, if an LNET router goes down (which does cause some amount of 
reconnect or remapping for any clients attempting to use those routes) would 
this cause any issues or delays for a client’s connection to non-routed 
storage? Put slightly different, if a job on the ethernet clients is actively 
using ethernet storage and the lnet routers go down, will job be affected? What 
about a new job just launching when that lnet router is down?

In addition, what does “check_routers_before_use” actually do and does it 
change the scenarios I mentioned? (e.g. If an ethernet client has 
“check_routers_before_use” would every file request start with a ping to the 
routers even if it’s not leaving it’s core network?)

Thanks!


—

Makia Minich
Principal Architect
System Fabric Works
"Fabric Computing that Works”

"Oh, I don't know. I think everything is just as it should be, y'know?”
- Frank Fairfield

_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to