Hi Dag,

Please see my responses below.

Thanks,

Yasha

On 01/30/2012 03:35 PM, Dag Wieers wrote:
On Mon, 30 Jan 2012, Yasha Karant wrote:

We had a massive power failure, beyond what the UPS could handle.
Despite attempts to find a way for the system to shut down gracefully,
it simply powered down without unmounting the disk partitions.
Nominally, the backup local UPS I am using (APC Back-UPS 650) has an
interface Port DB-9 RS-232 but I have not found any Linux application
that reliably would communicate with this model of UPS (that is,
emulate the same behavior as the application available from APC for MS
Win that senses the RS-232 information from APC, waits the appropriate
time, and then shutdown -- anyone found one?).

Upon boot, automatic fsck failed, and a request was posted for root
password. However, no more than one character of the password would be
accepted, causing an endless loop to this condition and not allowing
me control of the system (run fsck manually).

Hi Yasha,

I wonder:

- whether you were in fact using a journalling filesystem (because it
should even recover from power failure like that when it is journalled)


For the most part, for a number of reasons, we are sticking with ext2, with -- in the case of my workstation -- fuse ntfs for MS Windows which is required for particular unpleasantries. We have not made a production change to ext4, but are considering the transition. We are attempting to get detailed data on the reliability of the ext4 filesystem, how much overhead (loss of data capacity) ext4 requires, and the performance change (loss?) between ext2 and ext4. Any detailed, preferably quantitative, comparisons on production systems will be appreciated.

- what was mounted on /mnt/sysimage (as normally this is your
root-filesystem during installation, not during runtime)


I did a manual umount from the rescue running image, and verified with mount that /mnt/sysimage was not mounted . Nonetheless, when the production system attempted to reboot, it reported that the /dev/sda5 was not cleanly unmounted, and started automatic fsck. This fsck failed, with the request that I manually run fsck -- an operation I could not do as the root password was not accepted, being truncated by the root password input procedure.

- why you couldn't disable the filesystem in /etc/fstab, reboot and fix
it after the system would have booted normally

Indeed, from the rescue running image, a manual invocation of fsck solved the issue -- including fsck -y /dev/sda* , etc., to get all of the partitions. When the swap partition was found, fsck complained but still finished.


- why a filesystem like /mnt/sysimage is configured to stop the
boot-process when it has issues (man fstab, check sixth field)


The filesystem /mnt/sysimage never appears during the regular boot of the system, only from the rescue image. In this particular situation, I did a manual umount of /mnt/sysimage but the regular boot sequence reported that /dev/sda5 (the partition the rescue system mounted on /mnt/sysimage) had not been cleanly unmounted (repeating what I wrote above).

Do the above comments clear the fog?

Thanks in advance for clearing the fog,

Reply via email to