Re: [Beowulf] IP address mapping for new cluster

Carsten Aulbert Mon, 06 Aug 2007 16:40:18 -0700

Hi Larry, (sorry for the late reply)

first of all thank you very much for the feedback!


Larry Stewart wrote:

I was going to say "how often do you really deal with the A.B.C.D ratherthan DNS names anyway?" but I'vejust spent a couple of weeks doing just that and it really is convenientwhen you are in the weeds.


That was our thought as well, thus the "idea".

One comment is that nearly all software that deals with dotted quadsprints in decimal, which makesbinary encodings of the meaning awkward. So using 4 bit fields for theX and Y coordinates is hardto translate in your head. Instead, making the third octet be(row*20)+column would be a lot easieron the brain and supports 12 rows. This is why we do things likeA.B.200+<module ID>.100+<node ID>/18.It's a little awkward to get started, but then it is trivial to map inyour brain from IP to function
and position.

Right now the current plan allows up to 10 rows, thus 20 seems to be agood number here as well :)

The next issue is how all this gets initialized. Pretty much the onlyway to do it is to have the DHCPservers configured to map MAC addresses to IP addresses in a stableway. We don't really have thatproblem because pretty much the only interfaces that have random MACaddresses are the moduleservice processors. The MAC address maps to the manufacturing serialnumber, which is essentialfor tracking faults, but the position (slot ID/module ID) is reported inthe DHCP request in a <vendor>
field and the DHCP server knows what to do.
It seems like when you install something, you will have to enter its MACaddresses into the DHCPserver database and map to a stable IP address given database knowlegeof the position and function
of the device.

Yes, we will require our vendor to hand over a list (text file) of allMAC addresses of the cluster, i.e. two on board NICs plus MAC from IPMIcard.

For us, there were a number of benefits in going to "IP address maps tofunction": * Humans can debug given the IP addresses alone
* No DNS lookups required in performance critical paths
* Higher level configuration files for things like SLURM can be nearlystatic


So far so good.

Nevertheless, is the benefit of mapping IP to physical location reallyvaluable? Trying tomaintain this given the probable frequency of swapping out boxes willcause trouble withDHCP and ARP. Either you make the leases short and wait for them toexpire beforepowering on a replacement, or you have to go around manually flushingleases and arptables. Ugh. Instead, it may make more sense to give a type of devicea stable IP addresswithout regard to position, and to maintain a database mapping MAC/IP tolocationseparately. For a few 1000's of devices, grepping the location filewill be faster thanwalking over to the right rack anyway. We have this problem withmodules. The serviceguys want to swap modules in the backplane to see if a problem followsit and it has
cost us some DHCP hackery to let the addressing respond smoothly.

So far our experience with slightly smaller clusters suggest that theDHCP problem *might* occur, but usually we have a few "spare nodes"which are switched off during regular operations (at least officially;)). If a node dies and is send back for service we will simply leavethe "hole" on the rack and switch on the spare node at its position -again at least officially. After the box returns we can simply reinstallit back in its own place. Thus lease times should thus not be an issue.

So far it seems we will have enough spare room to house all real andspare nodes, thus it should not be a problem (keeping my fingers crossed).


Anyone else seeing a big problem in this idea?

Cheers

Carsten
_______________________________________________
Beowulf mailing list, [email protected]
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] IP address mapping for new cluster

Reply via email to