Thanks Mike, I'll give the UYOK options if this continues to be a problem.
I did however just manage to get one of my compute nodes to boot up
properly. All I did was manually define the Hard Drive parameters in the
BIOS instead of letting it auto-detect, and reimaged.  I'm not 100% sure if
this solved the issue or if something else just 'clicked' during the
re-imaging, but it booted up fine.
Now I'm getting errors when I try to run the post_install scripts step,
specifically relating to Torque:

create queue workq
Configuration of TORQUE queues failed, check the logs at /var/spool/pbs at
/opt/oscar/packages/torque/scripts/post_install line
316
Script /opt/oscar/packages/torque/scripts/post_install
exitted badly with exit code '2' at ./post_install line 49 Couldn't run
'post_install' script for torque at ./post_install line 50 Some of the post
install scripts failed, please check your logs for more info at
./post_install line 55
--> Step 7: Failed to properly complete the cluster
install; please check the logs

Which logs am I suppose to check to track down this issue? I looked around
the /var/spool/pbs folder, but couldn't find anything of relevance.
I'm about to ./start-over and reinstall the server fresh since I might have
messed something up while trying to fix the GRUB issue, I'll follow-up once
I finish later today.

-Milo

-----Original Message-----
From: [EMAIL PROTECTED]
[mailto:[EMAIL PROTECTED] On Behalf Of Michael
Edwards
Sent: Wednesday, May 09, 2007 12:39 PM
To: oscar-users@lists.sourceforge.net
Subject: Re: [Oscar-users] Compute nodes freeze at GRUB prompt

The RAID messages are normal spam.  SIS tries loading a lot of drivers
which don't work because you don't have that kind of hardware.  Now if
you see some hardware modules that you think should load and don't,
then that is more of a problem.

The grub messages I am less sure about...

Did you try using the UYOK option on the "Setup Networking" step?
This uses the kernel and related files from the head node.  This
frequently solves hardware related issues when the head and compute
nodes have more or less the same set of hardware.

When the right storage drivers don't get loaded, the messages that pop
up are often misleading.

On 5/9/07, Milo <[EMAIL PROTECTED]> wrote:
>
>
>
>
> Hi All,
>
>
>
> I've been trying to get a bunch of old P2's clustered together using
Fedora
> Core 5 and an old 10Mbit switch.  The install has went relatively trouble
> free upto this point. Once any of my compute nodes get successfully
imaged,
> they hang at the GRUB prompt on boot-up. The keyboard cursor blinks, but I
> can't enter any input and the system just sits there.
>
>
>
> I looked through the imaging log, and found a few errors relating to
> software RAID drivers. Don't think it's the problem, but I'll paste the
> output here aswell:
>
> Load software RAID modules.
>
> insmod: cannot insert
> `/lib/modules/2.6.18-boel_v3.7.5/kernel/drivers/md/md-mod.ko':
> File exists (-1): File exists
>
> insmod: cannot insert
> `/lib/modules/2.6.18-boel_v3.7.5/kernel/drivers/md/md-mod.ko':
> File exists (-1): File exists
>
> modprobe: module raid5 not found.
>
> modprobe: failed to load module raid5
>
> modprobe: module raid6 not found.
>
> modprobe: failed to load module raid6
>
> insmod: cannot insert
> `/lib/modules/2.6.18-boel_v3.7.5/kernel/drivers/md/md-mod.ko':
> File exists (-1): File exists
>
> Load device mapper driver (for LVM).
>
> Load additional filesystem drivers.
>
> modprobe: module fat not found.
>
> modprobe: failed to load module fat
>
> modprobe: module vfat not found.
>
> modprobe: failed to load module vfat
>
>
>
>
>
> I also get some errors near the end of the log that are probably related
to
> this issue:
>
>
>
> Editing files for actual disk configuration...
>
> /dev/hda -> /dev/hda
>
> /etc/fstab
>
> /etc/systemconfig/systemconfig.conf
>
>
>
> mount /dev /a/dev -o bind || shellout
>
> Use of uninitialized value in concatenation (.) or string at
> /usr/lib/systemconfig/Initrd/RH.pm line 69.
>
> install_device not specified.
>
> Probing devices to guess BIOS drives. This may take a long time.
>
>
>
> install_device not specified.
>
> grep: /boot/grub/device.map: No such file or directory
>
> mv: cannot stat `/boot/grub/device.map': No such file or directory
>
> Probing devices to guess BIOS drives. This may take a long time.
>
> Installation finished. No error reported.
>
> This is the contents of the device map /boot/grub/device.map.
>
> Check if this is correct or not. If any of the lines is incorrect,
>
> fix it and re-run the script `grub-install'.
>
>
>
> (hd0) /dev/hda
>
> Use of uninitialized value in concatenation (.) or string at
> /usr/lib/systemconfig/Boot/Grub.pm line 346.
>
> Probing devices to guess BIOS drives. This may take a long time.
>
>
>
>
>  Thanks Guys, any and all input/help is muchly appreciated.  I would have
> attached the full imaging install log to this post if I could have, but a
> copy is not stored on the server I'm told, an I can't boot my compute
nodes
> to get at the local copy there..
>
>
>
>
>
> -Milo
>
> SharcNET Head Office @ The University of Western Ontario
>
>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by DB2 Express
> Download DB2 Express C - the FREE version of DB2 express and take
> control of your XML. No limits. Just data. Click to get it now.
> http://sourceforge.net/powerbar/db2/
> _______________________________________________
> Oscar-users mailing list
> Oscar-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/oscar-users
>
>

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to