Hugo Hernandez <[EMAIL PROTECTED]> wrote:

> Hello folks,
> We have a medium cluster of about 600 slots running Solaris 10 6/06 X86 and 
> we experienced a power failure causing most (all) of the nodes went into a 
> error due to the abruptly power off.   After booting them did not boot in 
> Multi-User mode but asking to boot in failsafe mode.   Here is the output of 
> one of the nodes:
>
> [b][b]SunOS Release 5.10 Version Generic_118855-14 64-bit
> Copyright 1983-2005 Sun Microsystems, Inc.  All rights reserved.
> Use is subject to license terms.
>
> WARNING - The following files in / differ from the boot archive:
>     /etc/rtc_config

You seem to detect a very sore point in the new grub based boot.
I thought that this could not happen in a "stable" environment
as I did previously have this kind of problems on a development
machine (typically after I did test a new driver binary).

You run "only" 600 Solaris instances, but big hosting provisers
as e.g. Strato may run 30000 instances....currently sparc. What 
happenes after Sparc also uses a grub based boot for ZFS?

I would call this  aserious bug and I propose to add code to the
boor recovery scripts that automatically try to create a new boot 
archive and reboot. This seems to be a similar problem as UFS 
inconsistency in the root FS that after fsck did repair the root FS
needed a reboot instead of a "mount -o remount,rw /"

Jörg

-- 
 EMail:[EMAIL PROTECTED] (home) Jörg Schilling D-13353 Berlin
       [EMAIL PROTECTED]                (uni)  
       [EMAIL PROTECTED]     (work) Blog: http://schily.blogspot.com/
 URL:  http://cdrecord.berlios.de/old/private/ ftp://ftp.berlios.de/pub/schily
_______________________________________________
opensolaris-discuss mailing list
[email protected]

Reply via email to