Hi,

On Thu, Sep 06, 2007 at 12:24:14PM +0200, Igor D'Astolfo wrote:
> Hi,
>    I'm using linux-ha to put MySQL in high availability.
> I configured 2 nodes with MySQL in HA, with 3 resources in a group 
> colocated and ordered:
> 
> * the ip bound to the service
> * the partition with data (on a shared storage), formatted with reiserfs
> * the mysql service
> 
> The ha works well, I can migrate the service between the nodes without 
> problems.
> But yesterday I had a big issue: the node that was running the resource 
> group went down for a power loss and left the data partition unclean.
> 
> After the default timeouts, the other node took over the resources and 
> restarted the service. BUT the partition was not clean. This wasn't 
> evident to me, so the server continued to work for about two hours and 
> then the filesystem started to give kernel ops on the fs and mysql 
> stopped responding.
> I had to unmount the partition, make a fsck.reiserfs --rebuild-tree, 
> remount the partition and restore from backup some files that were lost 
> in the correction.
> 
> My question is if it's possible to make a check on the partition before 
> mounting it on the other node or if there's another way to configure the 
> partition to avoid such problems.

This is arguably a case of software failing in an unexpected way.
Journaled filesystems should guarantee integrity of data and
metadata. That's why one uses them. And to avoid very time
consuming filesystem check procedures on boot. Unfortunately,
there is usually no quick way to find out if the filesystem is
good.

Otherwise, it is of course possible to do a filesystem check
before mounting it. But it will cost time. And it would make the
startup procedure heavily dependent on the filesystem size and
its nature. Sometimes, it could even last for hours. The timeouts
would be really tricky to estimate. At any rate, perhaps this
could be made an option and then left to the user to decide if
their filesystem needs extra checking on mount.

Dejan

> Regards
> 
> _______________________________________________
> Linux-HA mailing list
> [email protected]
> http://lists.linux-ha.org/mailman/listinfo/linux-ha
> See also: http://linux-ha.org/ReportingProblems
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to