Consistency check of indexes takes to long

Christoph Kiehl Tue, 14 Aug 2007 01:48:27 -0700

Hi,

we've got very big indexes/workspaces on our production servers which have from3,000,000 to 8,000,000 nodes and are still growing because of creation ofversions and adding new nodes.When it happens that the VM in which Jackrabbit lives in crashes during a writeoperation, Jackrabbit nicely applies the redo log on a restart which gets donequite quick but then starts its consistency check. This check takes from 30minutes to 2 hours depending on the repository. In this time our application isoffline which we would of course like to avoid ;) Our system uses a bundleoracle pm which probably doesn't make things better.I had a quick glance at the consistency check code and it seems like there isnothing that could be substantially optimized in that place. I thought it mightbe possible to just include those index segments that where used while replayingthe redo log but as the consistency check works this is impossible.I think the only way to fasten startup is to avoid the occurrence of the errorsthat the check is checking for at all. Since the redo log mechanism seems quitegood I'm not sure if those errors (MissingAncestor, MultipleEntries,NodeDeleted, UnknownParent) can still occur. Could you maybe elaborate on thesituations where you expect those errors to arise?For now I'm thinking about disabling consistency checks at all by default andrun them in a maintenance window at night. Unfortunately this might be a bitdangerous if parts of the application rely on certain nodes to be found byqueries :/

WDYT?


Cheers,
Christoph

Consistency check of indexes takes to long

Reply via email to