consistency guarantees of Jackrabbit/Lucene indexes

Johannes Boneschanscher Wed, 13 May 2009 06:37:09 -0700

Hi All,

I have searched the Internet and codebase of Jackrabbit aboutrecoverability of the Lucene Indexing in a cluster scenario, however I'mnot certain whether it is really recoverable. I hope someone canenlighten me.

To make failover of the Jackrabbit machine possible we have our filesfor indexes of each node and of the FileDataStore on a network share.We use JNDIDatabaseJournal for clustering two nodes on the same machine.The version of Jackrabbit is Fri Jan 11 14:41:29 EET 2008 version=1.4(according to the pom.properties inside Jackrabbit-Core)

As far as I understand from JCR-204(http://issues.apache.org/jira/browse/JCR-204), which is still open,some measures have been taken to make indexes recoverable.

Also JCR-905 (closed) and JCR-778 (closed) seem related.

In the past we have had issues with Jackrabbit that the connection tothe network share was unstable and the index became corrupted, we try toavoid that (by moving it to a SAN with iSCSI), but as reindexing theentire repository takes a lot of time, as we also index the content withalmost all text extractors (See:http://jackrabbit.apache.org/api/1.4/org/apache/jackrabbit/extractor/package-summary.html)we would like to know whether Jackrabbit can completely recover fromthis kind of situation. (BTW: We solve this by restarting the AppServerJackrabbit is running on, and then the auto recover kicks in, I guessthis one:http://svn.apache.org/viewvc/jackrabbit/branches/1.3/jackrabbit-core/src/main/java/org/apache/jackrabbit/core/query/lucene/Recovery.java?view=log&pathrev=544247#rev544247)

If it can recover, why is JCR-204 still open? If it cannot recover, wewould have to use a local disk and we cannot cluster the machineanymore, and (if I can find time) I'll try and fix the issue.


Regards,

Johannes

consistency guarantees of Jackrabbit/Lucene indexes

Reply via email to