First off, I want to thank all you gurus who have had your hand in this.  This is an awesome management tool and I'm excited to get it going...

 

I've been testing OSCAR on a 33 node cluster running RedHat 9.0.  When I set up the server it all went very well and I could build an image and reboot my nodes and have the TFTP boot pick it up and push out the image and they would come up great.  So after this I decided to use OSCAR to help upgraded my planned migration to Fedora C2.  That's where I'm running into problems.

 

I've rebuilt my head node to Fedora 2, updated all the RPM patches and then installed OSCAR.  The nodes all see the TFTP server and start to load the OS.  The problem now is that no node will finish loading its image.  They all die at different spots in the load and I see errors like "rsync: read error: connection reset by peer" and then it dumps out to the "systemimager.org/support" message.

 

I thought it might be my gigabit switch being overloaded, so I connected the head node directly to a slave with a cross-over cable and pushed it out again.  I still have the same problem.  It crashes every time.  It looks like it has something to do with 'rsync.'  Is there anything else I can try?  Thanks for you support.

 

Mike

Reply via email to